statistical comparison tag

Gwern Branwen

See Also
Gwern
Links
Miscellaneous

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

Gwern

“GPT-2 Preference Learning for Music Generation”, Gwern 2019

GPT-2 Preference Learning for Music Generation

“Resorting Media Ratings”, Gwern 2015

Resorting Media Ratings

Links

“Predicting the Direction of Phenotypic Difference”, Gokhman et al 2024

Predicting the direction of phenotypic difference

“Diffusion Model Alignment Using Direct Preference Optimization”, Wallace et al 2023

Diffusion Model Alignment Using Direct Preference Optimization

“A General Theoretical Paradigm to Understand Learning from Human Preferences”, Azar et al 2023

A General Theoretical Paradigm to Understand Learning from Human Preferences

“On the Optimal Bounds for Noisy Computing”, Zhu et al 2023

On the Optimal Bounds for Noisy Computing

“Direct Preference Optimization (DPO): Your Language Model Is Secretly a Reward Model”, Rafailov et al 2023

Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model

“Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems”, Feng et al 2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems

“Reputation Inflation”, Filippas et al 2022

Reputation Inflation

“Bayesian Inference of the Climbing Grade Scale”, Drummond & Popinga 2021

Bayesian inference of the climbing grade scale

“PiRank: Learning To Rank via Differentiable Sorting”, Swezey et al 2020

PiRank: Learning To Rank via Differentiable Sorting

“Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment”, Talebi et al 2020

Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment

“Self-Play Learning Without a Reward Metric”, Schmidt et al 2019

Self-Play Learning Without a Reward Metric

“Group Testing: An Information Theory Perspective”, Aldridge et al 2019

Group Testing: An Information Theory Perspective⁠:

View PDF:

/doc/statistics/order/comparison/2019-aldridge.pdf

“Top-K Off-Policy Correction for a REINFORCE Recommender System”, Chen et al 2018

Top-K Off-Policy Correction for a REINFORCE Recommender System

“Comparison Based Learning from Weak Oracles”, Kazemi et al 2018

Comparison Based Learning from Weak Oracles

“OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”, Henderson et al 2017

OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning

“Analogical-Based Bayesian Optimization”, Le et al 2017

Analogical-based Bayesian Optimization

“Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking”, Chen et al 2017

Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking

“The Competitiveness of Games in Professional Sports Leagues”, Wills 2017

The competitiveness of games in professional sports leagues

“Deep Reinforcement Learning from Human Preferences”, Christiano et al 2017

Deep reinforcement learning from human preferences

“PBO: Preferential Bayesian Optimization”, Gonzalez et al 2017

PBO: Preferential Bayesian Optimization

“D-TS: Double Thompson Sampling for Dueling Bandits”, Wu & Liu 2016

D-TS: Double Thompson Sampling for Dueling Bandits

“Just Sort It! A Simple and Effective Approach to Active Preference Learning”, Maystre & Grossglauser 2015

Just Sort It! A Simple and Effective Approach to Active Preference Learning

“On the Complexity of Best Arm Identification in Multi-Armed Bandit Models”, Kaufmann et al 2014

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

“Bayesian Active Learning for Classification and Preference Learning”, Houlsby et al 2011

Bayesian Active Learning for Classification and Preference Learning

“Case Studies in Bayesian Computation Using INLA”, Martino & Rue 2010

Case studies in Bayesian computation using INLA

“Sorting from Noisy Information”, Braverman & Mossel 2009

Sorting from Noisy Information

“Can People Distinguish Pâté From Dog Food? [preprint]”, Bohannon et al 2009

Can People Distinguish Pâté From Dog Food? [preprint]

“Aggregating Inconsistent Information: Ranking and Clustering”, Ailon et al 2008

Aggregating inconsistent information: Ranking and clustering

“Pure Exploration for Multi-Armed Bandit Problems”, Bubeck et al 2008

Pure Exploration for Multi-Armed Bandit Problems

“Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings”, Goldstein et al 2008

Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings

“Noisy Sorting Without Resampling”, Braverman & Mossel 2007

Noisy Sorting Without Resampling

“Noisy Binary Search and Its Applications”, Karp & Kleinberg 2007

Noisy binary search and its applications

“Paired Comparison Models for Ranking National Soccer Teams”, Hallinan 2005

Paired Comparison Models for Ranking National Soccer Teams

“Bayesian Adaptive Exploration”, Loredo & Chernoff 2003

Bayesian Adaptive Exploration

“How Dangerous Are Drinking Drivers?”, Levitt & Porter 2001

How Dangerous Are Drinking Drivers?

“Sympercents: Symmetric Percentage Differences on the 100 Log_e Scale Simplify the Presentation of Log Transformed Data”, Cole 2000

Sympercents: symmetric percentage differences on the 100 log_e scale simplify the presentation of log transformed data

“Born Again Group Testing: Multiaccess Communications”, Wolf 1985

Born again group testing: Multiaccess communications⁠:

View PDF:

/doc/statistics/order/comparison/1985-wolf.pdf

“The Rating of Chessplayers, Past and Present (Second Edition)”, Elo 1978

The Rating of Chessplayers, Past and Present (Second Edition)

“Inconsistencies in a Schedule of Paired Comparisons”

Inconsistencies in a Schedule of Paired Comparisons

“Metacritic Has A (File-Drawer) Problem”

Metacritic Has A (File-Drawer) Problem

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.