Publications
While our publications are all listed here, they are easier to browse on our research page.
Conclusion and Bibliography for “Understanding the diffusion of large language models”
This is the ninth and final post in the “Understanding the diffusion of large language models” sequence, which presented key findings from case studies on the diffusion of eight language models that are similar to GPT-3. This post provides a conclusion, highlighting key findings from the research, along with a bibliography.
Questions for further investigation of AI diffusion
This is the eighth post in the “Understanding the diffusion of large language models” sequence. In this post, Ben Cottier lists questions about AI diffusion that he thinks would be worthy of more research at the time of writing.
Implications of large language model diffusion for AI governance
This is the seventh post in the “Understanding the diffusion of large language models” sequence. While the sequence is primarily descriptive, this post explores how to beneficially shape AI diffusion, and what the project’s findings mean for the governance of transformative AI (TAI).
Publication decisions for large language models, and their impacts
This is the sixth post in the “Understanding the diffusion of large language models” sequence. In this piece, the researcher provides an overview of the information and artifacts that have been published for the GPT-3-like models studied in this project, estimates some of the impacts of these publication decisions, assesses the rationales for these decisions, and makes predictions about how decisions and norms will change in the future.
Drivers of large language model diffusion: incremental research, publicity, and cascades
This is the fifth post in the “Understanding the diffusion of large language models” sequence. This piece describes the most important factors for GPT-3-like model diffusion.
The replication and emulation of GPT-3
This is the fourth post in the “Understanding the diffusion of large language models” sequence. This piece explores what was required for various actors to produce a GPT-3-like model from scratch, and the timing of various GPT-3-like models being developed. A timeline of selected GPT-3-like models and their significance examines the development of GPT-3-like models (or attempts at producing them) since GPT-3’s release.
GPT-3-like models are now much easier to access and deploy than to develop
This is the third post in the “Understanding the diffusion of large language models” sequence. This piece describes some GPT-3-like models that are widely available for download and what resources are required to actually use them.
Background for “Understanding the diffusion of large language models”
This is the second post in the “Understanding the diffusion of large language models” sequence. This piece provides background, including definitions of relevant terms, the inputs to AI development, the relevance of AI diffusion, and other information to contextualize the remainder of the sequence.
Understanding the diffusion of large language models: summary
How might transformative AI technology (or the means of producing it) spread among companies, states, institutions, and even individuals? What might the impact of that be, and how can we minimize risks in light of that?
This is the first post in the “Understanding the diffusion of large language models” sequence, which introduces and summarizes the research project.