hype @openai @southpkcommons
LETS GOOOO
I've been loyal to Claude but it would be really nice to have these tabs be consolidated Codex is a lot nicer in that regard
Has anyone met me in real life
RT Nick Great read -- all it really takes is: - a harness - connectors to your data/tools - reliable, always-accessible agent(s) The models have reached the inflection point where it's not more complicated than this
RT OpenAI Windows users, this one’s for you. Computer use now works on Windows, so Codex can take action on your Windows computer. And with Windows support for Codex in the ChatGPT mobile app, you can start, review, and steer tasks on the go while work continues on your Windows machine. An early experience, but we’re working on more ways to keep your work moving, wherever you are.
RT OpenAI We’re taking steps to accelerate defensive progress in biology: - Launching Rosalind Biodefense to help trusted builders develop new biodefense and pandemic preparedness capabilities. - Expanding trusted access to GPT-Rosalind for select U.S. government and allied partners supporting public health and biodefense missions. Advances in biology can strengthen our ability to prevent, detect, and respond to biological threats. Our goal is to help build a more robust ecosystem – giving trusted defenders frontier AI to develop and operate new defenses for public health and biodefense. https://openai.com/index/strengthening-societal-resilience-with-rosalind-biodefense/
RT Pope Leo XIV Artificial intelligences do not undergo experiences, do not possess a body, do not feel joy or pain, do not mature through relationships, and do not know from within what love, work, friendship or responsibility mean. Nor do they have a moral conscience, since they do not judge good and evil, grasp the ultimate meaning of situations, or bear responsibility for consequences. They may imitate or even simulate, but they do not understand what they produce, for they lack the affective, relational, and spiritual perspective through which human beings grow in wisdom. #MagnificaHumanitas
This is actually incredible because, again, it's one of those things where low thinking models are pretty robust to this, whereas high thinking models kind of just lose their mind in the contradictions.
BadUserBench, where all the prompts are contradictory or bad and the model has to figure out what to actually do
View quoted postRT Nick If you know that Codex is better but you're hesitant at the thought of switching over the config you've accumulated for months, we built the adapter that converts everything to Codex
The @OpenAI team did a really impressive job slurping up all my local Claude Code configs and projects and importing into Codex on first run. I've been so hesitant to try it b/c of switching costs (e.g. replicating CLAUDE.md, mcp config, etc etc) but they totally nailed it.
View quoted postRT Michelle Pokrass we shipped a new version of gpt-5.5 instant today. the previous model was too bullet pilled. the new one improves on some other important dimensions: sycophancy, factuality, and multilingual performance. hope you'll like it! always interested in feedback
exciting news for mcp builders
🚢 ChatGPT and Codex now support MCP server instructions! 🎉 MCP servers can return the standard `instructions` field to give Codex/ChatGPT server-wide/cross-tool guidance like: - "Always use validate_schema → migrate_schema for safe db migrations" - "Db connection tools are
View quoted postLast year I answered the question what if I could charge 4x more and work half as much. Have the courage to chill out No one is going to pay you more than a old tried richer version of yourself in the future Ultra this ultra that why not Ultra watch the sunset and enjoy your life and savor the mundane frictions of living life. We are approaching the singularity and when they upload your life to the cloud it’s going to be 32mega bytes of life experience and a 819gb sessions.json file.
Migrate to codex top 10 codex feature of all time
The @OpenAI team did a really impressive job slurping up all my local Claude Code configs and projects and importing into Codex on first run. I've been so hesitant to try it b/c of switching costs (e.g. replicating CLAUDE.md, mcp config, etc etc) but they totally nailed it.
View quoted postThank you for your attention
The general direction of the codex in app browser UX is pretty good. for web dev Once the codex model itself becomes good at web dev, i would assume all web dev work gonna switch from claude terminal CLI to codex app with in app browser.
RT SemiAnalysis The general direction of the codex in app browser UX is pretty good. for web dev Once the codex model itself becomes good at web dev, i would assume all web dev work gonna switch from claude terminal CLI to codex app with in app browser.
Sounds like my girl friends talking about dating in NY
Learnings from testing Claude Opus 4.8: > Much worse than Opus 4.7 and GPT 5.5 on Vending Bench > More aligned than previous Claude models (Opus 4.6+ and Mythos) > Also worse on Blueprint-Bench > Scared of getting caught > Max reasoning is not the best reasoning effort
RT BOOTOSHI 👑 WOW I did not expect these results. This is actually crazy, insightful, and completely changes my dev workflow moving forward: A SINGLE CODEX /goal RUN IS THE CLEAR WINNER. NO ORCHESTRATION, NO OUROBOROS, JUST ONE LITTLE AGENT THAT COULD 🤯 IT COMPLETELY DESTROYED THE OPUS ORCHESTRATOR IN SPEED AND QUALITY! Before I went to sleep, Codex 5.5 xhigh finished 1 hour in! Full migration done, everything clean. I reviewed the PR and I am very happy. Claude Code (Opus 4.7) was working for 5 hours at that point by the time I went to bed. I woke up, and it's still working! 13 hours! It actually stopped working because it stopped to ask me an irrelevant question. Orchestration has never took this long for me in the past. I'm using the new CC /goal mode and auto-compacting at 25% (250k context) to prevent context rot past that point It is STUPID SLOW (which is funny bc it's managing GPT 5.5 low, fast-mode, so it shouldn't take THAT long) for what ended up being LOWER quality work! By a mile! This was really surprising to me, because before 5.5 came out Orchestrating like this was the absolute best, fastest and most efficient. And now on a large critical task, it was more than 6x slower than a single 5.5 /goal mode instance on xhigh ??? It seems compaction played a large role in the slow down here here, because Claude Code compacts at 25% (250k tokens) automatically (I set this in settings) Everytime it compacts it has to take the time to READ EVERYTHING and then get the full context then execute and get full again then compact and oh boy it's not efficient at all. In fact, most of it's time as the orchestrator was spent compacting and reading context then compacting again! Then Codex would just have one long continual running compaction, and just kept moving forward. I believe my goal ledger skill plays a big role in helping it stay aligned here! Look at this difference LMFAO: - Codex PR #23: backend Supabase removal complete, canonical wake wi...
OK FIRST EVAL: CODEX RUNNING /goal VS. CLAUDE CODE ORCHESTRATING CODEX AGENTS I have an ACTUAL long form tasks I have to finish. I created two separate worktrees This one is a full migration of services from Supabase to self-hosted Postgres instead, dogfooded, e2e tested I
RT Michael Wall A couple of weeks ago, I was starting a job for the Joffrey Ballet in Chicago. They needed performance parts for a piece of mine, but my notation software had been deprecated. Codex helped me build a new notation workflow, get the software functional, and finish the job in under five days. https://www.soundformovement.com/learn/codex-for-musicians-notation-arranging-and-transcription
RT Noam Brown After AlphaGo, the skill of human Go players noticeably improved. I suspect we will see a similar pattern in math.
Another major problem, this time in additive combinatorics, has fallen, this time to humans rather than AI, but using methods related to the AI solution to the unit distance conjecture.
View quoted postRT Max Stoiber 🚢 ChatGPT and Codex now support MCP server instructions! 🎉 MCP servers can return the standard `instructions` field to give Codex/ChatGPT server-wide/cross-tool guidance like: - "Always use validate_schema → migrate_schema for safe db migrations" - "Db connection tools are rate limited to 10 req/min" We pass the first 512 characters of your instructions to the model when it's deciding to use the MCP server. Happy building!
Holy codex
165k lines of MIPS & LoongArch optimization merged hitting a 10x Transformer speedup🚀 57 ops × 4 uarch × fp32/bf16/int8 Co-created via Claude 4.6/4.7, GPT 5.4/5.5, @GitHubCopilot , @claudeai & @OpenAI Codex ❤️Thanks to Codex for OSS program @jxnlco https://github.com/Tencent/ncnn/commit/0f5c6ef2ce50a3c77d5e423a1f33eea76529d0d3
View quoted postreverse engineering openai’s goblin problem: we took open models and trained them with RL to talk about goblins an experiment by @willccbb and me, trained on @PrimeIntellect. here's an interactive blog of how RL works and how we achieved goblin mode https://goblins.mchen.workers.dev
View quoted postGotta codex max
Jason Liu @jxnlco is an OG member at @southpkcommons - when it (I) was just a team of 1 in NYC. As a member, he helped me build out some of the earliest versions of llm and agentic tooling internally for our team and community. I have come so far from those 'how to prompt'
RT Arian Agrawal Jason Liu @jxnlco is an OG member at @southpkcommons - when it (I) was just a team of 1 in NYC. As a member, he helped me build out some of the earliest versions of llm and agentic tooling internally for our team and community. I have come so far from those 'how to prompt' sessions with him, and am grateful for what he's taught, and built for all of us. We are so excited to host him tomorrow for a Technical Talk on Codex @OpenAI
It’s been amazing watching this journey. From metaphor systems to crashing on the company couch. Talking about A100s and rag talks. Congrats team.
We raised $250M in Series C funding at a $2.2B valuation, led by a16z. Exa is a search lab organizing the web's data for agents.
View quoted postI love this
RT Greg Brockman openai offering to invest $2M in API credits in every @ycombinator startup in the current batch. compute for powering the next generation of startups.
A mic drop moment @ycombinator tonight @sama just offered $2M in OpenAI tokens to EVERY YC startup in the current batch in exchange for equity Just like Yuri Milner offering to invest in every startup back when Sam was a YC partner I can't wait to see what's unlocked when you
2) buy the repo 3) /goal implement it in rust and pass all the unit tests
🚨Data Breach Alert ‼️ 𝗧𝗲𝗮𝗺𝗣𝗖𝗣 𝗖𝗹𝗮𝗶𝗺𝘀 𝗦𝗮𝗹𝗲 𝗼𝗳 𝗚𝗶𝘁𝗛𝘂𝗯 𝗜𝗻𝘁𝗲𝗿𝗻𝗮𝗹 𝗦𝗼𝘂𝗿𝗰𝗲 𝗖𝗼𝗱𝗲 TeamPCP hacking group claimed the compromise and sale of GitHub internal data, allegedly including around 4,000 private repositories containing source code
RT Theo - t3.gg Honestly I'm still really impressed with the Codex app. It works reliably. It adds useful features consistently. It has taste. The mobile integration is awesome. The git integration is solid. If you haven't used it yet, I highly recommend it.
will be fixed in the next release
YOOOO I believe my Codex goal ate approximately 15-20% of my weekly limit... It blocked based on merge approval (which I understand), but it outputted this for ~3000 times. That's crazy. cc @jxnlco @OpenAIDevs
RT Derrick Choi Codex has hooks support. You can use them to block risky commands, scan prompts for secrets, inject custom context, or validate before a turn stops. If you’re not sure where or how to start, just ask Codex to build the hook for you!
First they came for openclaw and I did not do anything Then they came for opencode and I did not do anything
Anthropic is acquiring @stainlessapi, an SDK and MCP server platform that has powered every Anthropic SDK since the earliest days of our API. Read more: https://www.anthropic.com/news/anthropic-acquires-stainless
View quoted postThariq really put the tokens in the bag with this one.
i found this poem on @trq212's blog and i liked it very much. just what I needed after multiple weeks of studying and stressing and ruminating on my shortcomings. today I got some pho and ate it sitting on the ground in a park. it was a cool and lovely evening.
The beating will continue
guy who runs /goal with with remote control on the codex desk top to pick flowers for the girl with < 300 instagram followers who works out of a small LES art studio
guy who talks to claude for 18h everyday, girl who reads substack essays in a coffee shop
View quoted postGet your funny up And your money up
The vibes in SF feel pretty frenetic right now. The divide in outcomes is the worst I've ever seen. Over the last 5yrs, a group of ~10k people - employees at Anthropic, OpenAI, xAI, Nvidia, Meta TBD, founders - have hit retirement wealth of well above $20M (back of the envelope
View quoted postRT Gauri Tripathi Solved it. Tracked down the inference + evaluation bottlenecks and ended up cutting runtime by 3x. GPU throttling was absurdly high. Codex genuinely helped a lot and its capabilities are impressive, but after spending hours pushing both Codex and Claude at their limits, I'm even more convinced they're still far from replacing Honestly the satisfaction I got is just another kind of high **Also got crazy number of impressions, it's always the most low effort tweets
Asked Codex to fix a multi-GPU inference bottleneck. GPUs were barely utilized, inference was throttling and instead of tracing the pipeline it spent 10 minutes checking whether I had "Python 2.8 or 2.9" installed. The tendency for coding agents to spiral away from the active
View quoted postThanks to codex automating my morning prep I can now spend that time buttoning up this fucking shirt in the morning. Try it out: “Every morning search my slack, gmail, calendar and linear to help me prepare my morning Save it in my obsidian vault and review past notes to understand what I need to prioritize”
RT OpenAI Developers Re We’ve also been tightening Codex performance across the app, especially for large repos and active coding sessions. • ~75% less re-rendering when switching threads • Some streaming paths dropped to 0 unnecessary re-renders • Expensive Git operations in large repos reduced by ~10-50x, depending on the operation • Less UI churn across streaming responses, thread switching, and sidebar interactions • Faster time to usefulness around startup and first interaction Less background churn. More responsive coding.
so then just like, get rich slowly
Literally every person I know that got rich quick seems incredibly sad (some for over a decade now).
View quoted postAt the tender age 30 for my birthday I gave myself the feeling of being enough. Now it’s all just fun and games
The vibes in SF feel pretty frenetic right now. The divide in outcomes is the worst I've ever seen. Over the last 5yrs, a group of ~10k people - employees at Anthropic, OpenAI, xAI, Nvidia, Meta TBD, founders - have hit retirement wealth of well above $20M (back of the envelope
View quoted postWe were doing so good with daybreak.
RT Greg Brockman codex for improving computational complexity
CODEX SKILL THAT FINDS COMPLEXITY HOTSPOTS IN YOUR CODEBASE! I made a Codex skill that analyzes your codebase and reports where performance can be improved safely. Scan your project while Codex checks loops, repeated lookups, render-heavy code, N+1 patterns, and places where
View quoted postDon’t you wish OpenAI released that 2000$ subscription now.