GPT-3 tag

Gwern Branwen

See Also
Gwern
- “The Scaling Hypothesis”, Gwern 2020
Links
Miscellaneous
Link Bibliography

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

Gwern

“The Scaling Hypothesis”, Gwern 2020

The Scaling Hypothesis

Links

“RAG vs Fine-Tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture”, Balaguer et al 2024

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

“Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT”, Hao & Warzel 2023

Inside the Chaos at OpenAI: Sam Altman’s weekend of shock and drama began a year ago, with the release of ChatGPT

“Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation”, Ding et al 2023

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

“Does GPT-4 Pass the Turing Test?”, Jones & Bergen 2023

Does GPT-4 Pass the Turing Test?

“PAIR: Jailbreaking Black Box Large Language Models in 20 Queries”, Chao et al 2023

PAIR: Jailbreaking Black Box Large Language Models in 20 Queries

“Fine-Tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!”, Qi et al 2023

Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

“Non-Determinism in GPT-4 Is Caused by Sparse MoE”, 152334H 2023

Non-determinism in GPT-4 is caused by Sparse MoE

“Large Language Models As Superpositions of Cultural Perspectives”, Kovač et al 2023

Large Language Models as Superpositions of Cultural Perspectives

“AI Is a Lot of Work: As the Technology Becomes Ubiquitous, a Vast Tasker Underclass Is Emerging—And Not Going Anywhere”, Dzieza 2023

AI Is a Lot of Work: As the technology becomes ubiquitous, a vast tasker underclass is emerging—and not going anywhere

“I’m Afraid I Can’t Do That: Predicting Prompt Refusal in Black-Box Generative Language Models”, Reuter & Schulze 2023

I’m Afraid I Can’t Do That: Predicting Prompt Refusal in Black-Box Generative Language Models

“Why Didn'T DeepMind Build GPT-3?”, Godwin 2023

Why didn't DeepMind build GPT-3?

“OpenAI’s Sam Altman Talks ChatGPT And How Artificial General Intelligence Can ‘Break Capitalism’”, Konrad & Cai 2023

OpenAI’s Sam Altman Talks ChatGPT And How Artificial General Intelligence Can ‘Break Capitalism’

“GPT-3 As Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities”, Bommarito et al 2023

GPT-3 as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

“HALIE: Evaluating Human-Language Model Interaction”, Lee et al 2022

HALIE: Evaluating Human-Language Model Interaction

“TruthfulQA: Measuring How Models Mimic Human Falsehoods”, Lin et al 2021

TruthfulQA: Measuring How Models Mimic Human Falsehoods

“‘How GPT-3 Is Shaping Our AI Future’ With Sam Altman/Azeem Azhar (The Exponential View), Wednesday 7 October 2020”

‘How GPT-3 Is Shaping Our AI Future’ with Sam Altman/Azeem Azhar (The Exponential View), Wednesday 7 October 2020⁠:

/doc/ai/nn/transformer/gpt/3/2020-10-06-exponentialview-samaltman-152648-s5e01-howgpt3isshapingouraifuture.mp3#t=2205

“Towards Synthesizing Complex Programs from Input-Output Examples”, Chen et al 2017

Towards Synthesizing Complex Programs from Input-Output Examples

“Genetics of Caffeine Consumption and Responses to Caffeine”, Yang et al 2010

Genetics of caffeine consumption and responses to caffeine

Sort By Magic

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.