- See Also
-
Gwern
- “Miscellaneous”, Gwern 2009
- “Novelty Nets: Classifier Anti-Guidance”, Gwern 2024
- “InvertOrNot.com Proposal”, Gwern 2021
- “The Second Apocalypse: Freedom In An Unfree Universe”, Gwern 2017
- “Review Of The Quantum Thief Trilogy”, Gwern 2022
- “Interpreting ‘Suzanne Delage’ As Dracula”, Gwern 2009
- “Research Ideas”, Gwern 2017
- “What Is The Morning Writing Effect?”, Gwern 2011
- “Absolute Unit NNs: Regression-Based MLPs for Everything”, Gwern 2023
- “Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset”, Gwern 2015
-
Links
- “Think Before You Speak: Training Language Models With Pause Tokens”, Goyal et al 2023
- “Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation”, Gu et al 2024
- “The Ritualization of the ‘Bank System’ in Japanese TV Animation With Hero or Heroine”, Chang & Tseng 2015
- “Of Coase and Copyrights: The Law and Economics of Literary Fan Art”, Guerra-Pujol 2019
- “The Keats Heuristic: Rhyme As Reason in Aphorism Interpretation”, McGlone & Tofighbakhsh 1999
- “Efficient Training of BERT by Progressively Stacking”, Gong et al 2019
- “Troubling Trends in Machine Learning Scholarship”, Lipton & Steinhardt 2018
- “Progressively Stacking 2.0: A Multi-Stage Layerwise Training Method for BERT Training Speedup”, Yang et al 2020
- “Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data”, Tajwar et al 2024
- “CatLIP: CLIP-Level Visual Recognition Accuracy With 2.7× Faster Pre-Training on Web-Scale Image-Text Data”, Mehta et al 2024
- “Battle Over Startup Leaves Early Investor With No Equity, $2.6 Million Legal Bill § The Information PR War”, McBride 2024
- “Magic and Empiricism in Early Chinese Rainmaking: A Cultural Evolutionary Analysis”, Hong et al 2024
- “Self-Protection As an Adaptive Female Strategy”, Benenson et al 2021
- “The Fire Piston and Its Origins in Europe”, Fox 1969
- “Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion”, Li et al 2024
- “Retrieval Head Mechanistically Explains Long-Context Factuality”, Wu et al 2024
- “Role of Contacts in Long-Range Protein Conductance”, Zhang et al 2019
- “The Pínakes of Callimachus”, Witty 1958
- “The Other Pínakes and Reference Works of Callimachus”, Witty 1973
- “Epistemology and Cognition”, Fetzer 1991
- “VideoGigaGAN: Towards Detail-Rich Video Super-Resolution”, Xu et al 2024
- “How Good Are Low-Bit Quantized LLaMA-3 Models? An Empirical Study”, Huang et al 2024
- “Sam Altman Invests in Energy Startup Focused on AI Data Centers: Investment by OpenAI CEO Highlights Artificial Intelligence’s Electricity Appetite”, Ramkumar 2024
- “Bouvet and Leibniz: A Scholarly Correspondence”, Swiderski 1980
- “How Accurate Are Drug Cryptomarket Listings by Content, Weight, Purity and Repeat Purchase?”, Barratt et al 2024
- “The Cell Assembly: Mark II”, Milner 1957
- “Learning With Intelligent Teacher: Similarity Control and Knowledge Transfer”, Vapnik & Izmailov 2015
- “Abstract Proof Search”, Cazenave 2000
- “Iterative Widening”, Cazenave 2001
- “Width and Serialization of Classical Planning Problems”, Lipovetzky & Geffner 2012
- “Classical Planning Algorithms on the Atari Video Games”, Lipovetzky et al 2015
- “Planning for Novelty: Width-Based Algorithms for Common Problems in Control, Planning and Reinforcement Learning”, Lipovetzky 2021
- “Learn Your Tokens: Word-Pooled Tokenization for Language Modeling”, Thawani et al 2023
- “SpaceByte: Towards Deleting Tokenization from Large Language Modeling”, Slagle 2024
- “Machine Learning Reveals the Control Mechanics of an Insect Wing Hinge”, Melis et al 2024
- “Effects of Gene Dosage on Cognitive Ability: A Function-Based Association Study across Brain and Non-Brain Processes”, Huguet et al 2024
- “Design of Highly Functional Genome Editors by Modeling the Universe of CRISPR-Cas Sequences”, Ruffolo et al 2024
- “It Pays to Cheat: Tactical Deception in a Cephalopod Social Signaling System”, Brown et al 2012
- “Where Is It Like to Be an Octopus?”, Carls-Diamante 2022
- “Up in the Air: Presence and Collection of EDNA from Air and Air Conditioner Units”, Goray et al 2024
- “Magenta Green Screen: Spectrally Multiplexed Alpha Matting With Deep Colorization”, Smirnov et al 2023
- “Regulatory Effect of Osteocytes on Extramedullary and Bone Marrow Adipose Tissue Development and Function”, Lecka-Czernik et al 2024
- “Epigenetic Age Oscillates during the Day”, Koncevičius et al 2024
- “Incentive Engineering: for Computational Resource Management”, Drexler & Miller 1988
- “Why Do People Make Noises in Bed?”, Anikin 2024
- “Towards Generated Image Provenance Analysis Via Conceptual-Similar-Guided-SLIP Retrieval”, Xia et al 2024
- “Mechanism of Enhanced Lipolysis in Adipose Tissue of Exercise-Trained Rats”
- “The Influence of Acutely Administered Nicotine on Cue-Induced Craving for Gambling in At-Risk Video Lottery Terminal Gamblers Who Smoke”, McGrath et al 2013
- “Higher-Order Truths about Chmess”, Dennett 2006
- “How Deviant Can You Be?”, Samuelson 1968
- “Teacher Forcing Recovers Reward Functions for Text Generation”, Hao et al 2022
- “JEPA: Self-Supervised Learning from Images With a Joint-Embedding Predictive Architecture”, Assran et al 2023
- “Efficient Video and Audio Processing With Loihi 2”, Shrestha et al 2023
- Wikipedia
- Miscellaneous
- Link Bibliography
See Also
Gwern
“Miscellaneous”, Gwern 2009
“Novelty Nets: Classifier Anti-Guidance”, Gwern 2024
“InvertOrNot.com Proposal”, Gwern 2021
“The Second Apocalypse: Freedom In An Unfree Universe”, Gwern 2017
“Review Of The Quantum Thief Trilogy”, Gwern 2022
“Interpreting ‘Suzanne Delage’ As Dracula”, Gwern 2009
“Research Ideas”, Gwern 2017
“What Is The Morning Writing Effect?”, Gwern 2011
“Absolute Unit NNs: Regression-Based MLPs for Everything”, Gwern 2023
“Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset”, Gwern 2015
Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset
Links
“Think Before You Speak: Training Language Models With Pause Tokens”, Goyal et al 2023
Think before you speak: Training Language Models With Pause Tokens
“Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation”, Gu et al 2024
“The Ritualization of the ‘Bank System’ in Japanese TV Animation With Hero or Heroine”, Chang & Tseng 2015
The Ritualization of the ‘Bank System’ in Japanese TV Animation with Hero or Heroine
“Of Coase and Copyrights: The Law and Economics of Literary Fan Art”, Guerra-Pujol 2019
Of Coase and Copyrights: The Law and Economics of Literary Fan Art
“The Keats Heuristic: Rhyme As Reason in Aphorism Interpretation”, McGlone & Tofighbakhsh 1999
The Keats heuristic: Rhyme as reason in aphorism interpretation
“Efficient Training of BERT by Progressively Stacking”, Gong et al 2019
“Troubling Trends in Machine Learning Scholarship”, Lipton & Steinhardt 2018
“Progressively Stacking 2.0: A Multi-Stage Layerwise Training Method for BERT Training Speedup”, Yang et al 2020
Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
“Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data”, Tajwar et al 2024
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
“CatLIP: CLIP-Level Visual Recognition Accuracy With 2.7× Faster Pre-Training on Web-Scale Image-Text Data”, Mehta et al 2024
“Battle Over Startup Leaves Early Investor With No Equity, $2.6 Million Legal Bill § The Information PR War”, McBride 2024
“Magic and Empiricism in Early Chinese Rainmaking: A Cultural Evolutionary Analysis”, Hong et al 2024
Magic and Empiricism in Early Chinese Rainmaking: A Cultural Evolutionary Analysis
“Self-Protection As an Adaptive Female Strategy”, Benenson et al 2021
“The Fire Piston and Its Origins in Europe”, Fox 1969
The Fire Piston and Its Origins in Europe:
View PDF:
“Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion”, Li et al 2024
Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion
“Retrieval Head Mechanistically Explains Long-Context Factuality”, Wu et al 2024
Retrieval Head Mechanistically Explains Long-Context Factuality
“Role of Contacts in Long-Range Protein Conductance”, Zhang et al 2019
“The Pínakes of Callimachus”, Witty 1958
“The Other Pínakes and Reference Works of Callimachus”, Witty 1973
“Epistemology and Cognition”, Fetzer 1991
View PDF (31MB):
/doc/psychology/linguistics/1991-fetzer-epistemologyandcognition.pdf
“VideoGigaGAN: Towards Detail-Rich Video Super-Resolution”, Xu et al 2024
“How Good Are Low-Bit Quantized LLaMA-3 Models? An Empirical Study”, Huang et al 2024
How Good Are Low-bit Quantized LLaMA-3 Models? An Empirical Study
“Sam Altman Invests in Energy Startup Focused on AI Data Centers: Investment by OpenAI CEO Highlights Artificial Intelligence’s Electricity Appetite”, Ramkumar 2024
“Bouvet and Leibniz: A Scholarly Correspondence”, Swiderski 1980
“How Accurate Are Drug Cryptomarket Listings by Content, Weight, Purity and Repeat Purchase?”, Barratt et al 2024
How accurate are drug cryptomarket listings by content, weight, purity and repeat purchase?
“The Cell Assembly: Mark II”, Milner 1957
“Learning With Intelligent Teacher: Similarity Control and Knowledge Transfer”, Vapnik & Izmailov 2015
Learning with Intelligent Teacher: Similarity Control and Knowledge Transfer
“Width and Serialization of Classical Planning Problems”, Lipovetzky & Geffner 2012
“Classical Planning Algorithms on the Atari Video Games”, Lipovetzky et al 2015
“Planning for Novelty: Width-Based Algorithms for Common Problems in Control, Planning and Reinforcement Learning”, Lipovetzky 2021
“Learn Your Tokens: Word-Pooled Tokenization for Language Modeling”, Thawani et al 2023
Learn Your Tokens: Word-Pooled Tokenization for Language Modeling
“SpaceByte: Towards Deleting Tokenization from Large Language Modeling”, Slagle 2024
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
“Machine Learning Reveals the Control Mechanics of an Insect Wing Hinge”, Melis et al 2024
Machine learning reveals the control mechanics of an insect wing hinge
“Effects of Gene Dosage on Cognitive Ability: A Function-Based Association Study across Brain and Non-Brain Processes”, Huguet et al 2024
“Design of Highly Functional Genome Editors by Modeling the Universe of CRISPR-Cas Sequences”, Ruffolo et al 2024
Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences
“It Pays to Cheat: Tactical Deception in a Cephalopod Social Signaling System”, Brown et al 2012
It pays to cheat: tactical deception in a cephalopod social signaling system
“Where Is It Like to Be an Octopus?”, Carls-Diamante 2022
“Up in the Air: Presence and Collection of EDNA from Air and Air Conditioner Units”, Goray et al 2024
Up in the air: Presence and collection of eDNA from air and air conditioner units
“Magenta Green Screen: Spectrally Multiplexed Alpha Matting With Deep Colorization”, Smirnov et al 2023
Magenta Green Screen: Spectrally Multiplexed Alpha Matting with Deep Colorization
“Regulatory Effect of Osteocytes on Extramedullary and Bone Marrow Adipose Tissue Development and Function”, Lecka-Czernik et al 2024
“Epigenetic Age Oscillates during the Day”, Koncevičius et al 2024
“Incentive Engineering: for Computational Resource Management”, Drexler & Miller 1988
Incentive Engineering: for Computational Resource Management
“Why Do People Make Noises in Bed?”, Anikin 2024
“Towards Generated Image Provenance Analysis Via Conceptual-Similar-Guided-SLIP Retrieval”, Xia et al 2024
Towards Generated Image Provenance Analysis Via Conceptual-Similar-Guided-SLIP Retrieval
“Mechanism of Enhanced Lipolysis in Adipose Tissue of Exercise-Trained Rats”
Mechanism of enhanced lipolysis in adipose tissue of exercise-trained rats
“The Influence of Acutely Administered Nicotine on Cue-Induced Craving for Gambling in At-Risk Video Lottery Terminal Gamblers Who Smoke”, McGrath et al 2013
“Higher-Order Truths about Chmess”, Dennett 2006
“How Deviant Can You Be?”, Samuelson 1968
“Teacher Forcing Recovers Reward Functions for Text Generation”, Hao et al 2022
Teacher Forcing Recovers Reward Functions for Text Generation
“JEPA: Self-Supervised Learning from Images With a Joint-Embedding Predictive Architecture”, Assran et al 2023
JEPA: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
“Efficient Video and Audio Processing With Loihi 2”, Shrestha et al 2023
Wikipedia
Miscellaneous
-
https://twitter.com/MartinNebelong/status/1783577204145705007
: -
/doc/philosophy/ethics/1558-bruegeltheelder-landscapewiththefalloficarus.jpg
: -
https://twitter.com/DanHendrycks/status/1782953713461772546
: -
https://developer.mozilla.org/en-US/docs/Web/HTML/Element/pre
-
https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Width
-
https://lilianweng.github.io/posts/2018-06-24-attention/
:View External Link:
-
https://research.google/blog/safely-repairing-broken-builds-with-ml/
-
https://www.atlasobscura.com/articles/japanese-green-tea-history
-
https://www.atlasobscura.com/articles/parthenogenesis-uruguay-stick-mantis-insect
-
https://www.getlibretto.com/blog/does-it-matter-which-examples-you-choose-for-few-shot-prompting
-
https://www.nytimes.com/2024/04/22/dining/vegan-chef-matthew-kenney.html
-
https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/
-
https://www.wired.com/story/combined-heart-pump-pig-kidney-transplant-surgery/
:View External Link:
https://www.wired.com/story/combined-heart-pump-pig-kidney-transplant-surgery/
Link Bibliography
-
https://arxiv.org/abs/2310.02226
: “Think Before You Speak: Training Language Models With Pause Tokens”, Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan -
1999-mcglone.pdf
: “The Keats Heuristic: Rhyme As Reason in Aphorism Interpretation”, Matthew S. McGlone, Jessica Tofighbakhsh -
https://www.bloomberg.com/news/articles/2024-04-24/toptal-founder-and-investor-tensions-spark-lawsuit
: “Battle Over Startup Leaves Early Investor With No Equity, $2.6 Million Legal Bill § The Information PR War”, Sarah McBride -
https://arxiv.org/abs/2404.15574
: “Retrieval Head Mechanistically Explains Long-Context Factuality”, Wenhao Wu, Yizhong Wang, Guangxuan Xiao, Hao Peng, Yao Fu -
https://arxiv.org/abs/2404.14047
: “How Good Are Low-Bit Quantized LLaMA-3 Models? An Empirical Study”, -
https://www.wsj.com/tech/ai/sam-altman-investment-exowatt-energy-startup-ai-data-centers-eeeca766
: “Sam Altman Invests in Energy Startup Focused on AI Data Centers: Investment by OpenAI CEO Highlights Artificial Intelligence’s Electricity Appetite”, Amrith Ramkumar -
2015-vapnik.pdf
: “Learning With Intelligent Teacher: Similarity Control and Knowledge Transfer”, Vladimir Vapnik, Rauf Izmailov -
2001-cazenave.pdf
: “Iterative Widening”, Tristan Cazenave -
https://papers.agoric.com/papers/incentive-engineering-for-computational-resource-management/full-text/
: “Incentive Engineering: for Computational Resource Management”, Eric Drexler, Mark S. Miller