• Partner With Us
  • Focus Areas
    • Cause Selection
    • Global Health & Wellbeing
      • Abundance & Growth
      • Effective Giving & Careers
      • Farm Animal Welfare
      • Global Aid Policy
      • Global Health & Development
      • Global Health R&D
      • Global Public Health Policy
      • Scientific Research
    • Global Catastrophic Risks
      • Biosecurity & Pandemic Preparedness
      • Forecasting
      • Global Catastrophic Risks Capacity Building
      • Potential Risks from Advanced AI
    • Other Areas
      • History of Philanthropy
  • Grants
  • Research & Updates
    • Blog Posts
    • In the News
    • Research Reports
    • Notable Lessons
  • About Us
    • Grantmaking Process
    • How to Apply for Funding
    • Careers
    • Team
    • Operating Values
    • Stay Updated
    • Contact Us
  • Partner With Us
  • Focus Areas
    • Cause Selection
    • Global Health & Wellbeing
      • Abundance & Growth
      • Effective Giving & Careers
      • Farm Animal Welfare
      • Global Aid Policy
      • Global Health & Development
      • Global Health R&D
      • Global Public Health Policy
      • Scientific Research
    • Global Catastrophic Risks
      • Biosecurity & Pandemic Preparedness
      • Forecasting
      • Global Catastrophic Risks Capacity Building
      • Potential Risks from Advanced AI
    • Other Areas
      • History of Philanthropy
  • Grants
  • Research & Updates
    • Blog Posts
    • In the News
    • Research Reports
    • Notable Lessons
  • About Us
    • Grantmaking Process
    • How to Apply for Funding
    • Careers
    • Team
    • Operating Values
    • Stay Updated
    • Contact Us

Two New Requests for Proposals: Understanding the Real-World Capabilities and Impacts of Large Language Models

  • Focus Area: Potential Risks from Advanced AI
  • Content Type: Blog Posts

Table of contents

Benchmarking LLM agents

Studying and forecasting the real-world impacts of LLM systems

Published: November 10, 2023 | by Ajeya Cotra

In the wake of surprisingly rapid progress in large language models (LLMs) like GPT-4, some experts have predicted that AI systems will be able to outperform human professionals at virtually all tasks within decades. Other experts are skeptical — they argue that LLMs’ capabilities have been overstated, and expect the technology to make a modest impact before running up against fundamental limitations.

To help build scientific understanding in this area, Open Philanthropy is looking to fund projects that will help us understand the capabilities and impacts of systems built from large language models (LLMs). 

We are doing this through two separate requests for proposals (RFPs) — one on benchmarking LLM agents, and the other on studying and forecasting the impacts of LLM systems.

Anyone is eligible to apply, including those working in academia, nonprofits, or independently; we are also open to making restricted grants to projects housed within for-profit companies. We will evaluate applications on a rolling basis. See below for more details.

 

Benchmarking LLM agents

Through this RFP, we aim to fund benchmarks that measure how close LLM agents can get to performing consequential real-world tasks. 

LLM agents are very new, and their impact has been limited so far, but well-functioning agents could have much more wide-ranging applications than LLM chatbots like GPT-4 or Claude. By the same token, they could pose more extensive risk than chatbots — executing plans, rather than merely creating them. 

We hope to understand these potential outcomes by funding benchmarks that will reliably indicate whether and when LLM agents will be able to impact the world on a very large scale — for example, by replacing or outperforming humans in professions which account for a large share of the labor market.

See this page for the application link and more details on the RFP.

We also hosted a webinar to answer questions about this RFP on November 29 2023; the recording is here and the slides are here. 

 

Studying and forecasting the real-world impacts of LLM systems

Through this RFP, we aim to fund a broad array of research projects (aside from benchmarks for LLM agents) that might shed light on what real-world impacts LLM systems could have over the next few years. 

Examples of ideas that could make for a strong proposal:

  • Conducting randomized controlled trials to measure the extent to which access to LLM products can increase human productivity on real-world tasks.
  • Polling members of the public about whether and how much they use LLM products, what tasks they use them for, and how useful they find them to be.
  • Eliciting expert forecasts about what LLM systems are likely to be able to do in the near future and what risks they might pose.

See this page for the application link and more details on the RFP, including many additional examples of proposals that might interest us.

Subscribe to new blog alerts
Open Philanthropy
Open Philanthropy
  • We’re Hiring!
  • Press Kit
  • Governance
  • Privacy Policy
  • Stay Updated
Mailing Address
Open Philanthropy
182 Howard Street #225
San Francisco, CA 94105
Email
info@openphilanthropy.org
Media Inquiries
media@openphilanthropy.org
Anonymous Feedback
Feedback Form

© Open Philanthropy 2025 Except where otherwise noted, this work is licensed under a Creative Commons Attribution-Noncommercial 4.0 International License.

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
Cookie SettingsAccept All
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT