LogoFollow AI builders
  • Home
  • Features
  • Builders
  • Submit Builder
LogoFollow AI builders

Follow Real AI Builders — Discover the Minds Behind the Next AI Revolution

TwitterX (Twitter)Email
Company
  • Contact
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 Follow AI builders All Rights Reserved.

Follow AI Builders — Not Influencers

Discover real AI creators shaping the future. Track their latest blogs, X posts, YouTube videos, WeChat Official Account posts, and GitHub commits — all in one place.

SW
Shawn Wang
𝕏x•28 minutes ago

Just started running a large scale randomized controlled test of Claude Opus 4.6 right now against every other model. it's beating pretty consistently for me in arenamode. Any guesses on how much more Elo ++ this thing will be over SOTA? doesn't take a lot... >60% winrate is a clear margin of vicotry

Just started running a large scale randomized controlled test of Claude Opus 4.6 right now against every other model. it's beating pretty consistently for me in arenamode.

Any guesses on how much ...
@Windsurf

Introducing Arena Mode in Windsurf: One prompt. Two models. Your vote. Benchmarks don't reflect real-world coding quality. The best model for you depends on your codebase and stack. So we made real-world coding the benchmark. Free for the next week. May the best model win.

View quoted post
View on X
JL
Jason Liu
𝕏x•28 minutes ago

Explains the ads

Explains the ads
@0.005 Seconds (3/694)

My first interaction with Opus 4.6 is that it is so far the least friendly and most brusque claude ive ever interacted with @AmandaAskell

View quoted post
View on X
YN
Yohei Nakajima
𝕏x•29 minutes ago

where my forward deployed agents at

View on X
RB
Riley Brown
𝕏x•34 minutes ago

Hahaha omg Opus 4.6 is TOKEN HUNGRY! I’ve never seen anything like this.

View on X
SW
Simon Willison
𝕏x•39 minutes ago

Pelicans for Opus 4.6 and Codex 5.3 - I don't have much interesting to say about these models yet to be honest, they're both incremental improvements on their predecessors and very capable https://simonwillison.net/2026/Feb/5/two-new-models/

View on X
SW
Simon Willison
📝blog•40 minutes ago

Opus 4.6 and Codex 5.3

Opus 4.6 and Codex 5.3

Two major new model releases today, within about 15 minutes of each other. Anthropic released Opus 4.6. Here's its pelican: OpenAI release GPT-5.3-Codex, albeit only via their Codex app, not yet in their API. Here's its pelican: I've had a bit of preview access to both of these models and to be honest I'm finding it hard to find a good angle to write about them - they're both really good, but so were their predecessors Codex 5.2 and Opus 4.5. I've been having trouble finding tasks that those previous models couldn't handle but the new ones are able to ace. The most convincing story about capabilities of the new model so far is Nicholas Carlini from Anthropic talking about Opus 4.6 and Building a C compiler with a team of parallel Claudes - Anthropic's version of Cursor's FastRender project. Tags: llm-release, anthropic, generative-ai, openai, pelican-riding-a-bicycle, ai, llms, parallel-agents, c, nicholas-carlini

1 min readSimon Willison
Read full article
AK
Andrej Karpathy
⚡github•43 minutes ago

Activity on repository

karpathy pushed nanochat

karpathy pushed nanochat

View on GitHub
JL
Jerry Liu
𝕏x•about 1 hour ago

Gemini 3 pro is still the best at visual understanding, Even in Anthropic's own benchmark table!

Gemini 3 pro is still the best at visual understanding, 

Even in Anthropic's own benchmark table!
View on X
JL
Jason Liu
𝕏x•about 1 hour ago

I just came back from my fishing trip. Did I miss anything?

View on X
RB
Riley Brown
𝕏x•about 1 hour ago
Thread • 2 tweets

SF is in store for one of the greatest super bowls ever. Pretty weird that it's on a Thursday this year.

SF is in store for one of the greatest super bowls ever. Pretty weird that it's on a Thursday this year.
SF is in store for one of the greatest super bowls ever. Pretty weird that it's on a Thursday this year.
View on X