Reading Notes | A Pretrainer’s Guide to Training Data – Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

[Semantic Scholar] – [Code] – [Tweet] – [Video] – [Website] – [Slide]

Change Logs:

  • 2023-10-03: First draft.