
OpenAI Research | Publication
Dec 18, 2025 · OpenAI introduces a real-world evaluation framework to measure how AI can accelerate biological research in the wet lab. Using GPT-5 to optimize a molecular cloning protocol, the work …
Research | OpenAI
Aug 7, 2025 · OpenAI’s o series models are advanced reasoning AI systems that use chain-of-thought processes to solve complex STEM problems through logical, step-by-step analysis.
OpenAI Newsroom | Research
6 days ago · Stay up to speed on the rapid advancement of AI technology and the benefits it offers to humanity.
PaperBench: Evaluating AI’s Ability to Replicate AI Research | OpenAI
Apr 2, 2025 · Agents must replicate 20 ICML 2024 Spotlight and Oral papers from scratch, including understanding paper contributions, developing a codebase, and successfully executing experiments.
Why language models hallucinate - OpenAI
Sep 5, 2025 · Our new research paper argues that language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging uncertainty.
Introducing Prism - OpenAI
Jan 27, 2026 · Prism is an early step toward that future. We’re excited to learn from researchers using Prism today and to continue building toward tools that help science move faster—together. Try Prism …
To demonstrate that our methodology can scale reliably, we train a 16 million latent autoencoder on GPT-4 [OpenAI, 2023] residual stream activations. Because improving reconstruction and sparsity is …
Prism | A free, LaTeX-native workspace for scientists | OpenAI
AI that understands your paper Project-aware AI works across the full context of your manuscript, including past drafts and revisions. It helps clarify ideas, check reasoning, and refine …
CLIP: Connecting text and images | OpenAI
Jan 5, 2021 · We further explore challenges that CLIP poses in our paper and we hope that this work motivates future research on the characterization of the capabilities, shortcomings, and biases of …
We instead use the term in a broader sense and study generalization to unseen datasets. We motivate this as a proxy for performing un-seen tasks, as aspired to in the zero-data learning paper of …