Ben Cottier
Research Articles
The replication and emulation of GPT-3
This is the fourth post in the “Understanding the diffusion of large language models” sequence. This piece explores what was required for various actors to produce a GPT-3-like model from scratch, and the timing of various GPT-3-like models being developed. A timeline of selected GPT-3-like models and their significance examines the development of GPT-3-like models (or attempts at producing them) since GPT-3’s release.
The replication and emulation of GPT-3
This post is one part of the sequence Understanding the diffusion of large language models. As context for this post, I strongly recommend reading at least the 5-minute summary…
Understanding the diffusion of large language models: summary
How might transformative AI technology (or the means of producing it) spread among companies, states, institutions, and even individuals? What might the impact of that be, and how can we minimize risks in light of that?
This is the first post in the “Understanding the diffusion of large language models” sequence, which introduces and summarizes the research project.
Background for “Understanding the diffusion of large language models”
This is the second post in the “Understanding the diffusion of large language models” sequence. This piece provides background, including definitions of relevant terms, the inputs to AI development, the relevance of AI diffusion, and other information to contextualize the remainder of the sequence.
Implications of large language model diffusion for AI governance
This is the seventh post in the “Understanding the diffusion of large language models” sequence. While the sequence is primarily descriptive, this post explores how to beneficially shape AI diffusion, and what the project’s findings mean for the governance of transformative AI (TAI).
GPT-3-like models are now much easier to access and deploy than to develop
This is the third post in the “Understanding the diffusion of large language models” sequence. This piece describes some GPT-3-like models that are widely available for download and what resources are required to actually use them.
Questions for further investigation of AI diffusion
This is the eighth post in the “Understanding the diffusion of large language models” sequence. In this post, Ben Cottier lists questions about AI diffusion that he thinks would be worthy of more research at the time of writing.
Drivers of large language model diffusion: incremental research, publicity, and cascades
This is the fifth post in the “Understanding the diffusion of large language models” sequence. This piece describes the most important factors for GPT-3-like model diffusion.
Conclusion and Bibliography for “Understanding the diffusion of large language models”
This is the ninth and final post in the “Understanding the diffusion of large language models” sequence, which presented key findings from case studies on the diffusion of eight language models that are similar to GPT-3. This post provides a conclusion, highlighting key findings from the research, along with a bibliography.