Summary of Ted Xiao on Twitter: "The golden days of internet-scale models achieving unprecedented zero-shot results seem to be waning. The new Big Thing is subsequent fine tuning with humans increasingly out of the loop. How does this work? Let’s explore *Prior Amplification* 🔎 (1/N) https://t.co/5JWFik2hgX" / Twitter

Summary Ted Xiao on Twitter: "The golden days of internet-scale models achieving unprecedented zero-shot results seem to be waning. The new Big Thing is subsequent fine tuning with humans increasingly out of the loop. How does this work? Let’s explore Prior Amplification 🔎 (1/N) https://t.co/5JWFik2hgX" / Twitter twitter.com

521 words - html page - View html page

One Line

Ted Xiao's Twitter thread proposes Prior Amplification as an alternative to internet-scale models, with GPT-3's RLHF using a human preference model learned from "good" dialogue to fine-tune and achieve success.

Key Points

Ted Xiao's Twitter thread proposes Prior Amplification as an alternative to internet-scale models achieving zero-shot results, which involves training a base model on unstructured data and using engineered assumptions or small datasets to learn priors
GPT-3's RLHF has evolved from fine-tuning with human-curated dialogue to fine-tuning with a human preference model learned from "good" dialogue
Prior Amplification and GPT-3's RLHF are making waves in the AI field

Summary

89 word summary

Ted Xiao's Twitter thread examines the waning success of internet-scale models achieving zero-shot results, and suggests Prior Amplification as an alternative that leverages "good" priors to combine scale and alignment. This process involves training a base model on large amounts of unstructured data, using engineered assumptions or small, high-quality datasets to learn priors, and applying those priors to create a new dataset for training. GPT-3's RLHF has evolved from fine-tuning with human-curated dialogue to fine-tuning with a human preference model learned from "good" dialogue, making waves in the process.

Raw indexed text (3,485 chars / 521 words / 23 lines)

Ted Xiao on Twitter: "The golden days of internet-scale models achieving unprecedented zero-shot results seem to be waning. The new Big Thing is subsequent fine tuning with humans increasingly out of the loop. How does this work?

Let’s explore *Prior Amplification* 🔎

(1/N) https://t.co/5JWFik2hgX" / Twitter

JavaScript is not available.

We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center.

Help Center

Cookie Policy

Imprint

Ads info

Log inUse appTwitter is better on the appNever miss a Tweet. Open this in the Twitter app to get the full experience.Not nowSwitch to the app

Thread

See new TweetsConversation

Ted Xiao@xiao_tedThe golden days of internet-scale models achieving unprecedented zero-shot results seem to be waning. The new Big Thing is subsequent fine tuning with humans increasingly out of the loop. How does this work?

Let’s explore *Prior Amplification*

(1/N)2:38 AM · Dec 21, 202277.1K Views45 Retweets5 Quote Tweets307 LikesTed Xiao@xiao_ted·Dec 21Replying to @xiao_tedLarge internet datasets, whether they are digital art or literature or Reddit posts, reflect some innate notion of the human condition. These kernels of truth can be shaped (ie. NSFW or hate speech filters) but always stem from some subset of human-produced content. (2/N)110Ted Xiao@xiao_ted·Dec 21Massive Transformer models can reasonably model the broad strokes of these internet-scale distributions. Afterwards, they can then be prompted to hone in on a specific region of the training distribution, such as the part containing helpful and respectful dialogue. (3/N)18Ted Xiao@xiao_ted·Dec 21But what if this region is under-represented? What if humans on the internet are rarely courteous and helpful? We can try fine-tuning on smaller datasets that contain the properties we’d like to prioritize; perhaps a hand-curated corpus of helpful dialogue. (4/N)111Ted Xiao@xiao_ted·Dec 21But this fine tuning step is quite laborious. Whether we generate this small corpus from scratch or try to filter out a subset of the original dataset, this dataset is small and likely involves significant human effort. Can we do better? (5/N)19Ted Xiao@xiao_ted·Dec 21We already know that the desired properties of our final model are contained in the original dataset, but perhaps they just aren’t prevalent enough. Can we leverage the “good” priors to autonomously combine BOTH scale and alignment? Enter Prior Amplification. (6/N)113Ted Xiao@xiao_ted·Dec 21Prior Amplification applies “good” priors to steer the original dataset or create new datasets entirely, from scratch or from existing sources. This idea is not new, but is highly relevant to the scale and cost of internet-scale models. (7/N)110Ted Xiao@xiao_ted·Dec 21The recipe for Prior Amplification:

1) Train a base model on lots of unstructured data

2) Use engineered assumptions or a small high-quality dataset to learn priors

3) Apply these data-driven “good” priors to create a new dataset

4) Train on the new dataset

(8/N)1117Ted Xiao@xiao_ted·Dec 21This approach is already making waves. Perhaps most notably, we can see this in the evolution in GPT-3’s RLHF: from fine-tuning on human-curated “good” dialogue (not scalable) to fine-tuning with a human preference model learned from “good” dialogue (scalable). (9/N)123