Kevin Weil
简介
VP Science @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve Ex: Pres @Planet, Head of Product @Instagram @Twitter ❤️ @elizabeth ultramarathons kids cats math
平台
内容历史
GPT 5.2 ran uninterrupted for *one week* and wrote *3 million* lines of code. The future is going to be awesome
We built a browser with GPT-5.2 in Cursor. It ran uninterrupted for one week. It's 3M+ lines of code across thousands of files. The rendering engine is from-scratch in Rust with HTML parsing, CSS cascade, layout, text shaping, paint, and a custom JS VM. It *kind of* works! It
Wow, the third Erdos problem solved via GPT 5.2 this week!
Weekend win: The proof I submitted for Erdos Problem #397 was accepted by Terence Tao. The proof was generated by GPT 5.2 Pro and formalized with Harmonic. Many open problems are sitting there, waiting for someone to prompt ChatGPT to solve them:
2026 is going to be an exciting year for science! AI will be a meaningful accelerant and it couldn't come at a better time.
Terence Tao confirms: For the first time, an LLM (GPT-5.2 pro) has successfully solved an Erdos problem on its own. This makes me really excited for GPT-5.3 pro. Science is gaining momentum, and the breakthroughs are becoming more significant.
RT dave kasten Fascinating: Maryland becomes first state govt to try to ship a textfile to help LLMs navigate government services (llms.txt) Original tweet: https://x.com/David_Kasten/status/2009305237949931550
RT Fidji Simo The launch of ChatGPT Health is really personal for me. I know how hard it can be to navigate the healthcare system (even with great care). AI can help patients and doctors with some of the biggest issues. More here: https://fidjisimo.substack.com/p/chatgpt-health Original tweet: https://x.com/fidjissimo/status/2008978500557131893
Super cool! GPT-5.2 🤝 @HarmonicMath to solve and formalize Erdos #728.
GPT-5.2 has successfully and fully autonomously resolved* Erdős problem #728 prior to any human previously: https://www.erdosproblems.com/forum/thread/728 *WITH THREE IMPORTANT CAVEATS 🧵👇: 1) The original problem statement is quite ambiguous. The model solved an interpretation of the problem (1/4)
Great advice for 2026 that would have made zero sense in 2025
remember to tackle your todo list in order of urgency: - reply to friends - kick off gpt-5.2-codex-xhigh tasks - brush teeth, make breakfast, etc.
View quoted postRT Greg Brockman two big themes of AI in 2026 will be enterprise agent adoption and scientific acceleration Original tweet: https://x.com/gdb/status/2006584251521839141
Congrats to the @OpenAI research team—GPT 5.2 is an incredible model!
Nice way to end the year, see you in 2026 for more! (Also good to remember that 6 months ago the models were at 4% on Frontier Math Tier 4...)
This is wild and impressive... and will seem commonplace in 12-24 months.
Codex CLI wrapped 100 billion tokens. That was my usage in just 39 days on one of the laptops where I run OpenAI’s Codex CLI (currently GPT-5.2 Codex xhigh). I have three OpenAI Pro accounts (US$200 each). If I consider two months of subscription, that’s about 6% of what I would
High school student uses AI to discover 1M+ objects humans missed in astronomical data. Head of NASA openly recruiting him through Twitter with a fighter jet ride included. All my worlds colliding. I love everything about this.
@MAstronomers Matteo please apply to work at NASA and I will personally throw in a fighter jet ride as a signing bonus
View quoted postRT Aaron Levie http://x.com/i/article/2004648738762227713 Original tweet: https://x.com/levie/status/2004654686629163154
RT Poetiq We finally had a moment to run our system with GPT-5.2 X-High on ARC-AGI-2! Using the same Poetiq harness as before, we saw results as high as 75% at under $8 / problem using GPT-5.2 X-High on the full PUBLIC-EVAL dataset. This beats the previous SOTA by ~15 percentage points. Original tweet: https://x.com/poetiq_ai/status/2003546910427361402
This is great @ivanhzhao!
RT Alex Predhome, Track and Field Enjoyer Time for a Christmas classic Original tweet: https://x.com/Predamame/status/2002908392558678344
RT roon the primary criticism of AI you hear has nothing to do with water use or existential risk whatsoever: most people just think it’s fake and doesn’t work and is a tremendous bubble eating intellectual property while emitting useless slop along the way. when GPT-5 came out and perhaps didn’t live up to what people were expecting for a full version bump, the timeline reaction was not mild, it was a full-scale meltdown. there are many intelligent (and unintelligent) people who latched onto this moment to declare AI scaling over, thousands of viral tweets, still a prevailing view in many circles. The financial-cultural phenomenon of machine intelligence is one of the most powerful in decades, and there are a lot of people who would like for its position to be weakened, many outright celebrating its losses and setback. Michael burry of ‘Big Short’ fame, unfortunately the type of guy to predict 12 of the last 3 recessions, has bet himself into insolvency on the AI bubble’s collapse one of the stranger things about this time is that there are very few secrets, and very little reason to be so misinformed. model labs have very little space in between creating new capabilities and launching them to the public. The view among the well informed public and not just “lab insiders” is that machine intelligence is absurdly joyfully smart at so many new things every month. It’s actively contributing on the cutting edge of programming and math and science. Sebastian Bubeck and co’s recent paper reports that GPT5-pro is capable of producing results on the frontier of theoretical physics research, Terry Tao wrote a blog about “vibe-proving” Erdos problems with the auto-formalization AI Aristotle. You can read that these scientists are using it to actively contribute to black hole physics, tighten mathematical bounds in optimization theory, churning morasses of biomedical data into real insight. Google Deepmind, from the way they are signalling, seems to be slowly closing a dragn...
The Genesis Mission is a brilliant set of ideas. Very excited to deepen @OpenAI's partnership with the DoE and the National Labs in the name of AI and national security 🇺🇸
OpenAI and the U.S. Department of Energy are expanding their collaboration on AI and advanced computing in support of national scientific priorities. The agreement builds on our work with DOE’s national labs and advances the Genesis Mission to accelerate scientific discovery.
View quoted postThis is super exciting 👀
This is wild! Johannes Schmitt used GPT5 to solve his own open problem on intersection numbers on moduli spaces of curves (the proof turns out to be unexpectedly simple, "low hanging fruit"). He wrote up the paper, being careful to point out which *entire paragraphs* were written
💥 The new ChatGPT imagegen is here! Plus there's a super fun Images section in the ChatGPT app now too. Built-in prompts make it easy to gen great images. Try it and share what you make below!
Science 🤝 GPT-5. Our new FrontierScience benchmark will be a valuable way to measure the performance of AI models on hard chemistry, biology, physics, and more. Plus, GPT-5 operating in a wet lab environment suggested experiments to increase a molecular cloning protocol's efficiency by 79x. Great thread below 👇
Accelerating scientific progress is one of the most impactful ways AI can benefit society. Models can already help researchers reason through hard problems — but doing this well means testing models on tougher evaluations and in real scientific workflows grounded in experiments.
View quoted postRT Daniel Litt OK, I think GPT 5.2 Pro is actually a step change in usefulness for my applications (algebraic geometry/number theory research). Original tweet: https://x.com/littmath/status/2000636724574302478
RT Sam Altman GPT-5.2 exceeded a trillion tokens in the API on its first day of availability and is growing fast! Original tweet: https://x.com/sama/status/1999624463013544024
Love this from @Opendoor 🇺🇸
Very proud to introduce the @Opendoor Hero’s Home Credit. $4k off closing costs when you buy an Opendoor home using Opendoor Checkout. Exclusively for active-duty military and veterans. https://www.opendoor.com/heroes Live now in Texas. AZ by end of year, and available everywhere
View quoted postRT Sebastien Bubeck btw the paper described in this thread is on arxiv already: https://arxiv.org/abs/2512.10220 ! Original tweet: https://x.com/SebastienBubeck/status/1999540978676355478
This problem fits in a broader context of understanding THE SHAPE OF LEARNING CURVES. The most basic property of such shapes is that hopefully ... they are decreasing! Specifically from the statistical perspective, assume that you add more data, can you prove that your test loss
View quoted postRT Marc Andreessen 🇺🇸 It’s time to build. Original tweet: https://x.com/pmarca/status/1997109742200934620
The new National Security Strategy of the United States: "As Alexander Hamilton argued in our republic’s earliest days, the United States must never be dependent on any outside power for core components—from raw materials to parts to finished products— necessary to the nation’s
Want a masterclass in using ChatGPT? Read this account. I work here and helped build these products, and it still blew my mind.
RT Brad Gerstner 🇺🇸🚀 Original tweet: https://x.com/altcap/status/1995860986407096568
Amazing bi-partisan, joint letter from @tedcruz & @CoryBooker calling on all business leaders to contribute to the 50+ million kids accounts set up over next 6 months under the Invest America Act (aka Trump Accounts). 🇺🇸🚀
💯. We want more experienced industry leaders like @DavidSacks in government, especially in critical and highly technical areas like AI.
David Sacks @DavidSacks is a throwback to the era of American greatness in which the most capable private sector citizens selflessly volunteered for government service in moments of peril for a dollar a day. He is a credit to our nation, and we need more like him, not fewer. 🇺🇸
View quoted postRT Elon Musk Optimus will be the Von Neumann probe Original tweet: https://x.com/elonmusk/status/1994940491054682570
Congrats @Will4Planet and the whole @planet team! Consistently amazing.
Launch success! We've made contact with all 38 satellites launched on today's mission. All healthy so far. Textbook launch. 🌍💫 This pic from the ride up :) Thanks to @SpaceX team for the smooth ride! Thank you @Planet team for building the sats and our missions teams for
One of the cool things about working on AI and raising kids is seeing all the parallels. I watch them learn facts about the world, and then learn to apply them with increasing layers of abstraction. Understanding humor is often a great eval. My 11 yo finally got this joke 🤣
Human: we have a color named after you! Salmon: really? is it silvery blue like my outsides? Human: no, uh– Salmon: wait why is it pink? Human: ... Salmon: WHY IS IT PINK
View quoted post👀Noam Brown: Today we at @OpenAI are releasing GPT-5.1-Codex-Max, which can work autonomously for more than a day over millions of tokens. Pretraining hasn't hit a wall, and neither has test-time compute. Congrats to my teammates @kevinleestone & @mikegmalek for helping to make it possible! Link: https://x.com/polynoamial/status/1991212955250327768
RT Andrej Karpathy I am unreasonably excited about self-driving. It will be the first technology in many decades to visibly terraform outdoor physical spaces and way of life. Less parked cars. Less parking lots. Much greater safety for people in and out of cars. Less noise pollution. More space reclaimed for humans. Human brain cycles and attention capital freed up from “lane following” to other pursuits. Cheaper, faster, programmable delivery of physical items and goods. It won’t happen overnight but there will be the era before and the era after.
RT Crémieux Paul Erdős won a bet by proving he wasn't addicted to amphetamines. But he got nothing done: "You've showed me I'm not an addict... You've set mathematics back a month."
RT Sebastien Bubeck Looks like a pretty long CoTFermat's Library: Andrew Wiles on the morning he discovered how to fix his proof of Fermat's Last Theorem Link: https://x.com/fermatslibrary/status/1986131120623145125
Just imagine where we'll be a year from now...Ethan Mollick: A year ago, I would not have expected the first academic field to seem to reach a consensus that AIs will accelerate research (which is not the same thing as autonomous research) would be math But that appears to be happening based on math professors in my feed and elsewhere. Link: https://x.com/emollick/status/1984388281061282081
Would love to hear more details if you're comfortable sharing, @wtgowers!Timothy Gowers @wtgowers: I crossed an interesting threshold yesterday, which I think many other mathematicians have been crossing recently as well. In the middle of trying to prove a result, I identified a statement that looked true and that would, if true, be useful to me. 1/3 Link: https://x.com/wtgowers/status/1984340182351634571
100% agree with @ErnestRyu here—this is an exciting time in mathematics, and in science more broadly.Ernest Ryu: I firmly believe we are at a watershed moment in the history of mathematics. In the coming years, using LLMs for math research will become mainstream, and so will Lean formalization, made easier by LLMs. (1/4) Link: https://x.com/ErnestRyu/status/1984033423586160889
RT Derya Unutmaz, MD Here is the story of a remarkable, independent treatment suggestion by GPT-5 Pro: repurposing a known drug for a patient with food protein–induced enterocolitis syndrome (FPIES). First, how we came to test this. My close friend, physician-scientist Dr. Oral Alpan, treated the patient described in the published report (link in the thread) for refractory allergic skin disease with the biologic dupilumab, which is approved for that indication. The patient also had FPIES, a food allergy that in his case was triggered by wheat and caused hours of cramping and watery, sometimes bloody, diarrhea. There is no approved treatment for FPIES; patients are advised to avoid trigger foods and prepare for emergencies if accidental exposure occurs. For twenty years, even a small amount of wheat set off the same cascade a few hours later, followed by days of recovery. After starting dupilumab for his skin condition, the patient traveled to France and accidentally ate a baguette. To his surprise, nothing happened! It was his first uneventful wheat exposure in two decades! On return, Dr. Alpan hypothesized that dupilumab might be responsible and supervised an oral food challenge approaching 50 grams of wheat protein. Again, no reaction! When an insurance interruption forced a pause in medication, the old symptoms returned; restarting it restored tolerance. In the peer-reviewed paper published today, Dr. Alpan and his team describe seven additional patients, ages 2 to 58, who responded to dupilumab for their FPIES condition. While this is not definitive proof and represents an observational case series, the findings suggest that dupilumab could be a potential treatment for FPIES. Dr. Alpan intends to contact Regeneron, the manufacturer, to pursue clinical studies hopefully toward FDA approval for this disease. About two months ago, as he had just submitted the paper on this case, Dr. Alpan told me this story. Until today’s publication there were no reports in t...