math tag

See Also
Gwern
Links
Miscellaneous
Link Bibliography

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

See Also

Gwern

“One Man’s Modus Ponens”, Gwern 2012

One Man’s Modus Ponens

“Prediction Markets”, Gwern 2009

Prediction Markets

“Girl Scouts & Good Corporate Governance”, Gwern 2011

Girl Scouts & Good Corporate Governance

“Simulation Inferences”, Gwern 2009

Simulation Inferences

Links

“Wu’s Method Can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry”, Sinha et al 2024

Wu’s Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

“Tokenization Counts: the Impact of Tokenization on Arithmetic in Frontier LLMs”, Singh & Strouse 2024

Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs

“Leveraging Large Language Models to Boost Dafny’s Developers Productivity”, Silva et al 2024

Leveraging Large Language Models to Boost Dafny’s Developers Productivity

“Solving Olympiad Geometry without Human Demonstrations”, Trinh et al 2024

Solving olympiad geometry without human demonstrations

“Generative AI for Math: Part I—MathPile: A Billion-Token-Scale Pretraining Corpus for Math”, Wang et al 2023

Generative AI for Math: Part I—MathPile: A Billion-Token-Scale Pretraining Corpus for Math

“PRER: Modeling Complex Mathematical Reasoning via Large Language Model Based MathAgent”, Liao et al 2023

PRER: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

“TinyGSM: Achieving >80% on GSM8k With Small Language Models”, Liu et al 2023

TinyGSM: achieving >80% on GSM8k with small language models

“Beyond Human Data: Scaling Self-Training for Problem-Solving With Language Models (ReST^EM)”, Singh et al 2023

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models (ReST^EM)

“Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning”, Dutta et al 2023

Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning

“Universal Self-Consistency for Large Language Model Generation”, Chen et al 2023

Universal Self-Consistency for Large Language Model Generation

“Training Chain-Of-Thought via Latent-Variable Inference”, Phan et al 2023

Training Chain-of-Thought via Latent-Variable Inference

“Why Won’t OpenAI Say What the Q^✱ Algorithm Is? Supposed AI Breakthroughs Are Frequently Veiled in Secrecy, Hindering Scientific Consensus”, Hao 2023

Why Won’t OpenAI Say What the Q^✱ Algorithm Is? Supposed AI breakthroughs are frequently veiled in secrecy, hindering scientific consensus

“Positional Description Matters for Transformers Arithmetic”, Shen et al 2023

Positional Description Matters for Transformers Arithmetic

“GPQA: A Graduate-Level Google-Proof Q&A Benchmark”, Rein et al 2023

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

“The Impact of Large Language Models on Scientific Discovery: a Preliminary Study Using GPT-4”, AI4Science & Quantum 2023

The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4

“Implicit Chain-Of-Thought Reasoning via Knowledge Distillation”, Deng et al 2023

Implicit Chain-of-Thought Reasoning via Knowledge Distillation

“Let Models Speak Ciphers: Multiagent Debate through Embeddings”, Pham et al 2023

Let Models Speak Ciphers: Multiagent Debate through Embeddings

“OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text”, Paster et al 2023

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

“Distinct Neuronal Representation of Small and Large Numbers in the Human Medial Temporal Lobe”, Kutter et al 2023

Distinct neuronal representation of small and large numbers in the human medial temporal lobe

“Papers With Computer-Checked Proofs”, Bernstein 2023

Papers with computer-checked proofs

“Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-Based Self-Verification”, Zhou et al 2023

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

“Testing GPT-4 With Wolfram Alpha and Code Interpreter Plug-Ins on Math and Science Problems”, Davis & Aaronson 2023

Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems

“Teaching Arithmetic to Small Transformers”, Lee et al 2023

Teaching Arithmetic to Small Transformers

“LeanDojo: Theorem Proving With Retrieval-Augmented Language Models”, Yang et al 2023

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

“Let’s Verify Step by Step”, Lightman et al 2023

Let’s Verify Step by Step

“A Chiral Aperiodic Monotile”, Smith et al 2023

A chiral aperiodic monotile

“FERMAT: An Alternative to Accuracy for Numerical Reasoning”, Sivakumar & Moosavi 2023

FERMAT: An Alternative to Accuracy for Numerical Reasoning

“What Number Comes Next? The Encyclopedia of Integer Sequences Knows. The ‘Mathematical Equivalent to the FBI’s Voluminous Fingerprint Files’ Turns 50 This Year, With 362,765 Entries (and Counting)”, Roberts 2023

What Number Comes Next? The Encyclopedia of Integer Sequences Knows. The ‘mathematical equivalent to the FBI’s voluminous fingerprint files’ turns 50 this year, with 362,765 entries (and counting)

“How Does GPT-2 Compute Greater-Than?: Interpreting Mathematical Abilities in a Pre-Trained Language Model”, Hanna et al 2023

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

“Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition”, Muffo et al 2023

Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition

“The Spinorial Ball: a Macroscopic Object of Spin-1/2”, Bernard-Bernardet et al 2023

The spinorial ball: a macroscopic object of spin-1/2

“How Well Do Large Language Models Perform in Arithmetic Tasks?”, Yuan et al 2023

How well do Large Language Models perform in Arithmetic tasks?

“ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics”, Azerbayev et al 2023

ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics

“OEIS: A Handbook of Integer Sequences 50 Years Later”, Sloane 2023

OEIS: A Handbook of Integer Sequences 50 Years Later

“Solving Math Word Problems With Process & Outcome-Based Feedback”, Uesato et al 2022

Solving math word problems with process & outcome-based feedback

“What Is My Math Transformer Doing? – 3 Results on Interpretability and Generalization”, Charton 2022

What is my math transformer doing? – 3 results on interpretability and generalization

“Broken Neural Scaling Laws”, Caballero et al 2022

Broken Neural Scaling Laws

“Dynamic Prompt Learning via Policy Gradient for Semi-Structured Mathematical Reasoning”, Lu et al 2022

Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning

“Mathematical Proof Between Generations”, Bayer et al 2022

Mathematical Proof Between Generations

“Connecting the Scientific and Industrial Revolutions: The Role of Practical Mathematics”, Kelly & Gráda 2022

Connecting the Scientific and Industrial Revolutions: The Role of Practical Mathematics

“NaturalProver: Grounded Mathematical Proof Generation With Language Models”, Welleck et al 2022

NaturalProver: Grounded Mathematical Proof Generation with Language Models

“HTPS: HyperTree Proof Search for Neural Theorem Proving”, Lample et al 2022

HTPS: HyperTree Proof Search for Neural Theorem Proving

“End-To-End Symbolic Regression With Transformers”, Kamienny et al 2022

End-to-end symbolic regression with transformers

“The Sexes Do Not Differ in General Intelligence, but They Do in Some Specifics”, Reynolds et al 2022

The sexes do not differ in general intelligence, but they do in some specifics

“PaLM: Scaling Language Modeling With Pathways”, Chowdhery et al 2022

PaLM: Scaling Language Modeling with Pathways

“Impact of Pretraining Term Frequencies on Few-Shot Reasoning”, Razeghi et al 2022

Impact of Pretraining Term Frequencies on Few-Shot Reasoning

“Exact Number Concepts Are Limited to the Verbal Count Range”, Pitt et al 2022

Exact Number Concepts Are Limited to the Verbal Count Range

“Formal Mathematics Statement Curriculum Learning”, Polu et al 2022

Formal Mathematics Statement Curriculum Learning

“Deep Symbolic Regression for Recurrent Sequences”, d’Ascoli et al 2022

Deep Symbolic Regression for Recurrent Sequences

“Counting and the Ontogenetic Origins of Exact Equality”, Schneider et al 2022

Counting and the ontogenetic origins of exact equality

“A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More”, Drori et al 2021

A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More

“What Is the Point of Computers? A Question for Pure Mathematicians”, Buzzard 2021

What is the point of computers? A question for pure mathematicians

“Scaling Language Models: Methods, Analysis & Insights from Training Gopher”, Rae et al 2021

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

“Linear Algebra With Transformers”, Charton 2021

Linear algebra with transformers

“Training Verifiers to Solve Math Word Problems”, Cobbe et al 2021

Training Verifiers to Solve Math Word Problems

“MiniF2F: a Cross-System Benchmark for Formal Olympiad-Level Mathematics”, Zheng et al 2021

MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics

“A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers”, Miao et al 2021

A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers

“SymbolicGPT: A Generative Transformer Model for Symbolic Regression”, Valipour et al 2021

SymbolicGPT: A Generative Transformer Model for Symbolic Regression

“Basins With Tentacles”, Zhang & Strogatz 2021

Basins with tentacles

“Behavioral and Neuronal Representation of Numerosity Zero in the Crow”, Kirschhock et al 2021

Behavioral and Neuronal Representation of Numerosity Zero in the Crow

“MathBERT: A Pre-Trained Model for Mathematical Formula Understanding”, Peng et al 2021

MathBERT: A Pre-Trained Model for Mathematical Formula Understanding

“Constructions in Combinatorics via Neural Networks”, Wagner 2021

Constructions in combinatorics via neural networks

“NaturalProofs: Mathematical Theorem Proving in Natural Language”, Welleck et al 2021

NaturalProofs: Mathematical Theorem Proving in Natural Language

“Are NLP Models Really Able to Solve Simple Math Word Problems?”, Patel et al 2021

Are NLP Models really able to Solve Simple Math Word Problems?

“Measuring Mathematical Problem Solving With the MATH Dataset”, Hendrycks et al 2021

Measuring Mathematical Problem Solving With the MATH Dataset

“TacticZero: Learning to Prove Theorems from Scratch With Deep Reinforcement Learning”, Wu et al 2021

TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

“Proof Artifact Co-Training for Theorem Proving With Language Models”, Han et al 2021

Proof Artifact Co-training for Theorem Proving with Language Models

“LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning”, Wu et al 2021

LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning

“How the Slowest Computer Programs Illuminate Math’s Fundamental Limits: The Goal of the ‘Busy Beaver’ Game Is to Find the Longest-Running Computer Program. Its Pursuit Has Surprising Connections to Some of the Most Profound Questions and Concepts in Mathematics”, Pavlus 2020

How the Slowest Computer Programs Illuminate Math’s Fundamental Limits: The goal of the ‘busy beaver’ game is to find the longest-running computer program. Its pursuit has surprising connections to some of the most profound questions and concepts in mathematics

“The Empirical Metamathematics of Euclid and Beyond”, Wolfram 2020

The Empirical Metamathematics of Euclid and Beyond

“MMLU: Measuring Massive Multitask Language Understanding”, Hendrycks et al 2020

MMLU: Measuring Massive Multitask Language Understanding

“Generative Language Modeling for Automated Theorem Proving”, Polu & Sutskever 2020

Generative Language Modeling for Automated Theorem Proving

“A Promising Path Towards Autoformalization and General Artificial Intelligence”, Szegedy 2020

A Promising Path Towards Autoformalization and General Artificial Intelligence

“Lights and Shadows”, Ciechanowski 2020

Lights and Shadows

“Singing Euclid: the Oral Character of Greek Geometry”, Blåsjö 2020

Singing Euclid: the oral character of Greek geometry

“Mathematical Reasoning via Self-Supervised Skip-Tree Training”, Rabe et al 2020

Mathematical Reasoning via Self-supervised Skip-tree Training

“Remembering John Conway’s FRACTRAN, a Ridiculous, yet Surprisingly Deep Language”, Braithwaite 2020

Remembering John Conway’s FRACTRAN, a ridiculous, yet surprisingly deep language

“Radical Solutions: French Mathematician Évariste Galois Lived a Full Life. When He Wasn’t Trying to Overthrow the Government, He Was Reinventing Algebra”, Brook & Macfarlane 2020

Radical Solutions: French mathematician Évariste Galois lived a full life. When he wasn’t trying to overthrow the government, he was reinventing algebra

“Learning to Prove Theorems by Learning to Generate Theorems”, Wang & Deng 2020

Learning to Prove Theorems by Learning to Generate Theorems

“Transformers As Soft Reasoners over Language”, Clark et al 2020

Transformers as Soft Reasoners over Language

“Neural Arithmetic Units”, Madsen & Johansen 2020

Neural Arithmetic Units

“Generative Language Modeling for Automated Theorem Proving § Experiments”, Polu & Sutskever 2020 (page 11 org openai)

Generative Language Modeling for Automated Theorem Proving § Experiments

“Deep Learning for Symbolic Mathematics”, Lample & Charton 2019

Deep Learning for Symbolic Mathematics

“Do NLP Models Know Numbers? Probing Numeracy in Embeddings”, Wallace et al 2019

Do NLP Models Know Numbers? Probing Numeracy in Embeddings

“Ternary Circuits: Why R=3 Is Not the Optimal Radix for Computation”, Etiemble 2019

Ternary circuits: why R=3 is not the Optimal Radix for Computation

“MAWPS: A Math Word Problem Repository”, Koncel-Kedziorski et al 2019

MAWPS: A Math Word Problem Repository

“Learning to Reason in Large Theories without Imitation”, Bansal et al 2019

Learning to Reason in Large Theories without Imitation

“Analysing Mathematical Reasoning Abilities of Neural Models”, Saxton et al 2019

Analysing Mathematical Reasoning Abilities of Neural Models

“Paul Erdős’s Mathematics As a Social Activity”, Rekvenyi 2019

Paul Erdős’s mathematics as a social activity

“Fancy Euclid’s Elements in T_eX”, Slyusarev 2019

Fancy Euclid’s Elements in T_eX

“Reinventing the Wheel: Discovering the Optimal Rolling Shape With PyTorch”, Wiener 2019

Reinventing the Wheel: Discovering the Optimal Rolling Shape with PyTorch

“The First Printed Math Books”, Boardley 2019

The First Printed Math Books

“Making of Byrne’s Euclid”, Rougeux 2018

Making of Byrne’s Euclid

“Best Practices: Formal Proofs, the Fine Print and Side Effects”, Murray & Oorschot 2018

Best Practices: Formal Proofs, the Fine Print and Side Effects

“Mastering Chess and Shogi by Self-Play With a General Reinforcement Learning Algorithm”, Silver et al 2017

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

“From Boiling Lead and Black Art: An Essay on the History of Mathematical Typography”, Smith 2017

From boiling lead and black art: An essay on the history of mathematical typography

“Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems”, Ling et al 2017

Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems

“Solving General Arithmetic Word Problems”, Roy & Roth 2016

Solving General Arithmetic Word Problems

“DeepMath—Deep Sequence Models for Premise Selection”, Alemi et al 2016

DeepMath—Deep Sequence Models for Premise Selection

“A Relatively Small Turing Machine Whose Behavior Is Independent of Set Theory”, Yedidia & Aaronson 2016

A Relatively Small Turing Machine Whose Behavior Is Independent of Set Theory

“The LEGO Counting Problem”, Eilers 2016

The LEGO Counting Problem

“Too Good to Be True: When Overwhelming Evidence Fails to Convince”, Gunn et al 2016

Too good to be true: when overwhelming evidence fails to convince

“Probabilistic Integration: A Role in Statistical Computation?”, Briol et al 2015

Probabilistic Integration: A Role in Statistical Computation?

“Random Gradient-Free Minimization of Convex Functions”, Nesterov & Spokoiny 2015

Random Gradient-Free Minimization of Convex Functions

“Prizes and Productivity: How Winning the Fields Medal Affects Scientific Output”, Borjas & Doran 2015

Prizes and Productivity: How Winning the Fields Medal Affects Scientific Output

“Is There a Curse of the Fields Medal?”, Kollár 2015

Is There a Curse of the Fields Medal?

“The Misfortunes of a Trio of Mathematicians Using Computer Algebra Systems—Can We Trust in Them?”, Durán et al 2014

The Misfortunes of a Trio of Mathematicians Using Computer Algebra Systems—Can We Trust in Them?

“Neural Networks, Manifolds, and Topology”, Olah 2014

Neural Networks, Manifolds, and Topology

“Finite Time Blowup for an Averaged Three-Dimensional Navier-Stokes Equation”, Tao 2014

Finite time blowup for an averaged three-dimensional Navier-Stokes equation

“Homotopy Groups of Suspended Classifying Spaces: An Experimental Approach”, Romero & Rubio 2013

Homotopy groups of suspended classifying spaces: An experimental approach

“On Unsettleable Arithmetical Problems”, Conway 2013

On Unsettleable Arithmetical Problems

“The Algebraic Combinatorial Approach for Low-Rank Matrix Completion”, Király et al 2012

The Algebraic Combinatorial Approach for Low-Rank Matrix Completion

“Mind Switches in Futurama and Stargate”, Evans & Huang 2012

Mind Switches in Futurama and Stargate

“Vividness in Mathematics and Narrative”, Gowers 2012

Vividness in Mathematics and Narrative⁠:

View PDF:

/doc/math/2012-gowers.pdf

“How to Write a 21^st Century Proof”, Lamport 2011

How to Write a 21^st Century Proof

“Jewish Problems”, Khovanova & Radul 2011

Jewish Problems

“The Cosmic Distance Ladder”, Tao 2010

The Cosmic Distance Ladder

“Coolex: The Coolest Way to Generate Combinations”, Ruskey & Williams 2009

Coolex: The coolest way to generate combinations

“Packing Unit Squares in Squares: A Survey and New Results”, Friedman 2009

Packing Unit Squares in Squares: A Survey and New Results

“Desperately Seeking Mathematical Proof”, Nathanson 2009

Desperately seeking mathematical proof

“The Gödel Letter”, Gödel 2009

The Gödel Letter

“11858_2008_132_41_1-Web 45..60”

11858_2008_132_41_1-web 45..60⁠:

View PDF:

/doc/math/2009-sinclair.pdf

“Probing the Improbable: Methodological Challenges for Risks With Low Probabilities and High Stakes”, Ord et al 2008

Probing the Improbable: Methodological Challenges for Risks with Low Probabilities and High Stakes

“The Epic Story of Maximum Likelihood”, Stigler 2007

The Epic Story of Maximum Likelihood

“Overhang”, Paterson & Zwick 2007

“The Monotype 4-Line System for Setting Mathematics”, Rhatigan 2007

The Monotype 4-Line System for Setting Mathematics

“Maximum Overhang”, Paterson et al 2007

Maximum overhang

“Computational Discovery in Pure Mathematics”, Colton 2007

Computational Discovery in Pure Mathematics

“Béla Bollobás: Graphs Extremal and Random [Interview of Béla Bollobás by Y. K. Leong]”, Leong & Bollobás 2007

Béla Bollobás: Graphs Extremal and Random [Interview of Béla Bollobás by Y. K. Leong]⁠:

View PDF:

/doc/math/2007-leong.pdf

“Comment on a Paper by Yucai Su On the Jacobian Conjecture (2005-12-30)”, Moh 2006

Comment on a Paper by Yucai Su On the Jacobian Conjecture (2005-12-30)

“Proof of Two Dimensional Jacobian Conjecture”, Su 2005

Proof of Two Dimensional Jacobian Conjecture

“Online Convex Programming and Generalized Infinitesimal Gradient Ascent”, Zinkevich 2003

Online Convex Programming and Generalized Infinitesimal Gradient Ascent

“EWD1300: The Notational Conventions I Adopted, and Why”, Dijkstra 2002

EWD1300: The Notational Conventions I Adopted, and Why

“Hymne to Hymen”, Descartes & Smith 2002

Hymne to Hymen⁠:

View PDF:

/doc/math/2002-descartes.pdf

“The War of the Frogs and the Mice, or the Crisis of the Mathematische Annalen”, Dalen 2001

The War of the Frogs and the Mice, or the Crisis of the Mathematische Annalen⁠:

View PDF:

/doc/math/2001-vandalen.pdf

“Making Mathematics: The Coffee Connection”, Wieschenberg 1999

Making Mathematics: The Coffee Connection⁠:

View PDF:

/doc/math/1999-wieschenberg.pdf

“An Editor Recalls Some Hopeless Papers”, Hodges 1998

An Editor Recalls Some Hopeless Papers

“How Did Software Get so Reliable without Proof?”, Hoare 1996

How did software get so reliable without proof?

“Light Shadows: Remembrances of Yale in the Early Fifties”, Rota 1996

Light Shadows: Remembrances of Yale in the Early Fifties

“Ten Lessons I Wish I Had Been Taught”, Rota 1996

Ten Lessons I Wish I Had Been Taught

“Riemann Zeta Function Is a Fractal”, Woon 1994

Riemann zeta function is a fractal

“A Visit to Hungarian Mathematics”, Hersh & John-Steiner 1993

A visit to Hungarian mathematics⁠:

View PDF:

/doc/math/1993-hersh.pdf

“What in Heaven Is a Digital Sundial?”, stewart 1991

What in Heaven Is a Digital Sundial?⁠:

View PDF:

/doc/math/1991-stewart.pdf

“How I Was Led to the Frequency Approach”, Hamming 1991

How I was led to the frequency approach⁠:

View PDF:

/doc/math/1991-hamming.pdf

“On the Computational Complexity of the Jones and Tutte Polynomials”, Jaeger et al 1990

On the computational complexity of the Jones and Tutte polynomials

“Factors and Primes: a Specific Numerical Ability”, Hermelin & O’Connor 1990

Factors and primes: a specific numerical ability

“Envisioning Information: Chapter 5, ‘Color and Information’, Pg83-86 [on Oliver Byrne’s Color Diagram Version of Euclid’s Elements]”, Tufte 1990

Envisioning Information: chapter 5, ‘Color and Information’, pg83-86 [on Oliver Byrne’s color diagram version of Euclid’s Elements]

“In Memory of Henry J. Kelley”, Cliff 1989

In memory of Henry J. Kelley

“Dynamical Systems That Sort Lists, Diagonalize Matrices and Solve Linear Programming Problems”, Brockett 1988

Dynamical systems that sort lists, diagonalize matrices and solve linear programming problems

“The Printing of Mathematics”, Wishart 1988

The Printing of Mathematics

“The Emergence of Princeton As a World Center for Mathematical Research, 1896--1939”, Aspray 1988

The Emergence of Princeton as a World Center for Mathematical Research, 1896--1939⁠:

View PDF:

/doc/www/www.ams.org/8a85ac1a003826664827022e225240cccf621019.pdf

“The Aesthetic Viewpoint in Mathematics”

The aesthetic viewpoint in mathematics⁠:

View PDF:

/doc/math/1987-krull.pdf

“John Von Neumann As Seen By His Brother”, Vonneuman 1987

John von Neumann As Seen By His Brother⁠:

View PDF:

/doc/math/1987-vonneumann.pdf

“Review of Yuri I. Manin Yu, A Course in Mathematical Logic 1997”, Boolos 1986

Review of Yuri I. Manin Yu, A course in mathematical logic 1997⁠:

View PDF:

/doc/math/1986-boolos.pdf

“Discrete Hartley Transform”, Bracewell 1983

Discrete Hartley transform

“Are Impossible Figures Possible?”, Kulpa 1983

Are impossible figures possible?

“Bi-Continuous Extensions of Invertible Combinatorial Functions”, Toffoli 1981

Bi-continuous extensions of invertible combinatorial functions

“The Letter S”, Knuth 1980

The Letter S

“Some Proposals for Reviving the Philosophy of Mathematics”, Hersh 1979

Some Proposals for Reviving the Philosophy of mathematics⁠:

View PDF:

/doc/math/1979-hersh.pdf

“Heaviside's Operational Calculus and the Attempts to Rigorise It”, Lützen 1979

Heaviside's Operational Calculus and the Attempts to Rigorise It⁠:

View PDF:

/doc/math/1979-lutzen.pdf

“Social Processes and Proofs of Theorems and Programs”, Millo et al 1979

Social Processes and Proofs of Theorems and Programs

“Life at Low Reynolds Number”, Purcell 1977

Life at low Reynolds number

“Randomness and Mathematical Proof”, Chatin 1975

Randomness and Mathematical Proof⁠:

View PDF:

/doc/math/1975-chaitin.pdf

“The Dangers of Computer-Science Theory”, Knuth 1973

The Dangers of Computer-Science Theory

“Fidelity in Mathematical Discourse: Is One and One Really Two?”, Davis 1972

Fidelity in Mathematical Discourse: Is One and One Really Two?⁠:

View PDF:

/doc/math/1972-davis.pdf

“The Humble Programmer [EWD340]”, Dijkstra 1972

The Humble Programmer [EWD340]

“Assigning Probabilities to Logical Formulas”, Scott & Krauss 1966

Assigning Probabilities to Logical Formulas

“Singular Extremals In Lawden’s Problem Of Optimal Rocket Flight”, Kelley 1963

Singular Extremals In Lawden’s Problem Of Optimal Rocket Flight

“A Steepest-Ascent Method for Solving Optimum Programming Problems”, Bryson & Denham 1962

A Steepest-Ascent Method for Solving Optimum Programming Problems

“Method of Gradients”, Kelley 1962

Method of Gradients

“Gradient Theory of Optimal Flight Paths”, Kelley 1960

Gradient Theory of Optimal Flight Paths

“Toward Mechanical Mathematics”, Wang 1960

Toward Mechanical Mathematics

“Stable Predictor-Corrector Methods for Ordinary Differential Equations”, Hamming 1959

Stable Predictor-Corrector Methods for Ordinary Differential Equations

“The Printing of Mathematics: Aids for Authors and Editors and Rules for Compositors and Readers at the University Press, Oxford”, Chaundy et al 1954

The Printing of Mathematics: Aids for Authors and Editors and Rules for Compositors and Readers at the University Press, Oxford

“Non-Cooperative Games”, Nash 1951

Non-Cooperative Games

“Principles of the Self-Organizing Dynamic System”, Ashby 1947

Principles of the Self-Organizing Dynamic System

“An Essay On The Psychology Of Invention In The Mathematical Field”, Hadamard 1945

An Essay On The Psychology Of Invention In The Mathematical Field

“A More Symmetrical Fourier Analysis Applied to Transmission Problems”, Hartley 1942

A More Symmetrical Fourier Analysis Applied to Transmission Problems

“Leonhard Euler's Elastic Curves”

Leonhard Euler's Elastic Curves⁠:

View PDF:

/doc/math/1933-oldfather.pdf

“On a Problem of Formal Logic”, Ramsey 1930

On a Problem of Formal Logic

“Operational Methods in Mathematical Physics”, Carslaw 1928

Operational Methods in Mathematical Physics

“The Foundations of Mathematics”, Ramsey 1926b

The Foundations of Mathematics

“Cutting a Round Cake on Scientific Principles”, Galton 1906

Cutting a Round Cake on Scientific Principles⁠:

View PDF:

/doc/math/1906-galton.pdf

“On Operators in Physical Mathematics. Part I”, Heaviside 1892

On Operators in Physical Mathematics. Part I⁠:

View PDF:

/doc/math/1892-heaviside.pdf

“Why Momentum Really Works”

Why Momentum Really Works

“Math: OpenAI API Can Do Some Math out of the Gate, but Most Math It Seems It Has to Learn. Many Times, the Numbers That It Spits out Are Just Random. However, including Different Priming Prompts Can Result in Decent Results.”

Math: OpenAI API can do some math out of the gate, but most math it seems it has to learn. Many times, the numbers that it spits out are just random. However, including different priming prompts can result in decent results.

“Oliver Byrne’s Edition of Euclid [Scans]”, Casselman 2024

Oliver Byrne’s edition of Euclid [Scans]

spolu

The examples are indeed extremely simple on purpose (otherwise it’s hard to communicate efficiently what’s happening to non-Metamath experts). That being said, we’re still pretty far away from IMOs; but this is definitely a goal for us, and one we’re actively working towards!

Wikipedia

All horses are the same color⁠:

View External Link:

https://en.wikipedia.org/wiki/All_horses_are_the_same_color
Büchi arithmetic
Backpropagation
Blackboard bold
Commutative diagram
Coreset
Hilbert’s paradox of the Grand Hotel⁠:

View External Link:

https://en.wikipedia.org/wiki/Hilbert%27s_paradox_of_the_Grand_Hotel
Jeep problem
Journal of Integer Sequences⁠:

View External Link:

https://en.wikipedia.org/wiki/Journal_of_Integer_Sequences
Lanchester’s laws
Long line (topology)⁠:

View External Link:

https://en.wikipedia.org/wiki/Long_line_%28topology%29
Mathematical diagram
Metamath
Moving sofa problem⁠:

View External Link:

https://en.wikipedia.org/wiki/Moving_sofa_problem
Neil Sloane⁠:

View External Link:

https://en.wikipedia.org/wiki/Neil_Sloane
Nomogram
On-Line Encyclopedia of Integer Sequences
On-Line Encyclopedia of Integer Sequences § Self-referential sequences⁠:

View External Link:

https://en.wikipedia.org/wiki/On-Line_Encyclopedia_of_Integer_Sequences#Self-referential_sequences
On-Line Encyclopedia of Integer Sequences § Sloane's gap⁠:

View External Link:

https://en.wikipedia.org/wiki/On-Line_Encyclopedia_of_Integer_Sequences#Sloane's_gap
Proof without words
Scottish Book⁠:

View External Link:

https://en.wikipedia.org/wiki/Scottish_Book
Scottish Café⁠:

View External Link:

https://en.wikipedia.org/wiki/Scottish_Caf%C3%A9
Square packing in a square⁠:

View External Link:

https://en.wikipedia.org/wiki/Square_packing_in_a_square
Time-saving bias
Ulam spiral
Wallpaper group

Miscellaneous

Link Bibliography

https://arxiv.org/abs/2402.14903: “Tokenization Counts: the Impact of Tokenization on Arithmetic in Frontier LLMs”, Aaditya K. Singh, D. J. Strouse

link-bibliography
https://arxiv.org/abs/2312.08926: “PRER: Modeling Complex Mathematical Reasoning via Large Language Model Based MathAgent”, Haoran Liao, Qinyi Du, Shaohua Hu, Hao He, Yanyan Xu, Jidong Tian, Yaohui Jin

link-bibliography
https://arxiv.org/abs/2312.06585#deepmind: “Beyond Human Data: Scaling Self-Training for Problem-Solving With Language Models (ReST^EM)”, Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Peter J. Liu, James Harrison

link-bibliography
https://arxiv.org/abs/2312.02179: “Training Chain-Of-Thought via Latent-Variable Inference”, Du Phan, Matthew D. Hoffman, David Dohan, Sholto Douglas, Tuan Anh Le, Aaron Parisi, Pavel Sountsov, Charles Sutton

link-bibliography
https://arxiv.org/abs/2310.06786: “OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text”, Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba

link-bibliography
https://cr.yp.to/papers/pwccp-20230907.pdf: “Papers With Computer-Checked Proofs”, Daniel J. Bernstein

link-bibliography
https://arxiv.org/abs/2308.07921: “Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-Based Self-Verification”, Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song

link-bibliography
https://arxiv.org/abs/2307.03381: “Teaching Arithmetic to Small Transformers”, Nayoung Lee, Kartik Sreenivasan, Jason D. Lee, Kangwook Lee, Dimitris Papailiopoulos

link-bibliography
https://arxiv.org/abs/2306.15626: “LeanDojo: Theorem Proving With Retrieval-Augmented Language Models”, Kaiyu Yang, Aidan M. Swope, Alex Gu, Rahul Chalamala, Peiyang Song, Shixing Yu, Saad Godil, Ryan Prenger

link-bibliography
https://arxiv.org/abs/2305.20050#openai: “Let’s Verify Step by Step”, Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman

link-bibliography
https://arxiv.org/abs/2304.02015#alibaba: “How Well Do Large Language Models Perform in Arithmetic Tasks?”, Zheng Yuan, Hongyi Yuan, Chuanqi Tan, Wei Wang, Songfang Huang

link-bibliography
https://arxiv.org/abs/2302.12433: “ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics”, Zhangir Azerbayev, Bartosz Piotrowski, Hailey Schoelkopf, Edward W. Ayers, Dragomir Radev, Jeremy Avigad

link-bibliography
http://neilsloane.com/doc/HIS50.pdf: “OEIS: A Handbook of Integer Sequences 50 Years Later”, Neil Sloane

link-bibliography
https://arxiv.org/abs/2205.12910#allen: “NaturalProver: Grounded Mathematical Proof Generation With Language Models”, Sean Welleck, Jiacheng Liu, Ximing Lu, Hannaneh Hajishirzi, Yejin Choi

link-bibliography
https://arxiv.org/abs/2205.11491#facebook: “HTPS: HyperTree Proof Search for Neural Theorem Proving”, Guillaume Lample, Marie-Anne Lachaux, Thibaut Lavril, Xavier Martinet, Amaury Hayat, Gabriel Ebner, Aurélien Rodriguez

link-bibliography
2022-reynolds.pdf: “The Sexes Do Not Differ in General Intelligence, but They Do in Some Specifics”, Matthew R. Reynolds, Daniel B. Hajovsky, Jacqueline M. Caemmerer

link-bibliography
https://arxiv.org/abs/2204.02311#google: “PaLM: Scaling Language Modeling With Pathways”, Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham

link-bibliography
https://arxiv.org/abs/2202.01344#openai: “Formal Mathematics Statement Curriculum Learning”, Stanislas Polu, Jesse Michael Han, Kunhao Zheng, Mantas Baksys, Igor Babuschkin, Ilya Sutskever

link-bibliography
2022-schneider.pdf: “Counting and the Ontogenetic Origins of Exact Equality”, Rose M. Schneider, Erik Brockbank, Roman Feiman, David Barner

link-bibliography
https://arxiv.org/abs/2112.11446#deepmind: “Scaling Language Models: Methods, Analysis & Insights from Training Gopher”, Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides

link-bibliography
https://arxiv.org/abs/2110.14168#openai: “Training Verifiers to Solve Math Word Problems”, Karl Cobbe, Vineet Kosaraju, Mohammad Bavarian, Jacob Hilton, Reiichiro Nakano, Christopher Hesse, John Schulman

link-bibliography
https://www.quantamagazine.org/how-the-slowest-computer-programs-illuminate-maths-fundamental-limits-20201210/: “How the Slowest Computer Programs Illuminate Math’s Fundamental Limits: The Goal of the ‘Busy Beaver’ Game Is to Find the Longest-Running Computer Program. Its Pursuit Has Surprising Connections to Some of the Most Profound Questions and Concepts in Mathematics”, John Pavlus

link-bibliography
https://arxiv.org/abs/2009.03300: “MMLU: Measuring Massive Multitask Language Understanding”, Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt

link-bibliography
https://arxiv.org/abs/2009.03393#openai: “Generative Language Modeling for Automated Theorem Proving”, Stanislas Polu, Ilya Sutskever

link-bibliography
https://www.c82.net/blog/?id=79: “Making of Byrne’s Euclid”, Nicholas Rougeux

link-bibliography
2013-romero.pdf: “Homotopy Groups of Suspended Classifying Spaces: An Experimental Approach”, Ana Romero, Julio Rubio

link-bibliography
2013-conway.pdf: “On Unsettleable Arithmetical Problems”, John H. Conway

link-bibliography
2003-zinkevich.pdf: “Online Convex Programming and Generalized Infinitesimal Gradient Ascent”, Martin Zinkevich

link-bibliography
1990-tufte-envisioninginformation-ch5-byrneseuclid.pdf: “Envisioning Information: Chapter 5, ‘Color and Information’, Pg83-86 [on Oliver Byrne’s Color Diagram Version of Euclid’s Elements]”, Edward Tufte

link-bibliography
https://archive.org/details/eassayonthepsych006281mbp: “An Essay On The Psychology Of Invention In The Mathematical Field”, Jacques Hadamard

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]