Discover real AI creators shaping the future. Track their latest blogs, X posts, YouTube videos, WeChat Official Account posts, and GitHub commits — all in one place.
We benchmarked Mistral OCR against other frontier and open-weight models on ParseBench 📊 For a model at its price point, it is quite competitive! - It wins on semantic formatting - understanding strikethroughs, superscripts/subscripts, title hierarchy, links - It is competitive on content faithfulness (reading order + hallucinations + omissions) and visual grounding (bounding boxes) - It does ok on tables and doesn't really have chart capabilities. Of course, some of the frontier models + OCR providers like Azure Doc Intelligence + AWS Textract are a bit more expensive. Check out our full leaderboard on ParseBench: https://www.parsebench.ai/
The data of tokens and uptime recovered by @vercel AI Gateway is truly astonishing
At @goodtacoai we switched from using the Anthropic API to using @vercel AI Gateway for better reliability. Super cool to see this data displayed on the dashboard so we don't have to do anything to figure it out ourselves.
Bitrobot casually dropping the largest humanoid teleop dataset ever collected in real homes HIW-500: Humanoids-in-the-Wild 500 hours check it out here => https://huggingface.co/datasets/BitRobot/HIW-500
1/ Introducing HIW-500 (Humanoids-in-the-Wild 500): the largest open-source humanoid teleop dataset collected in real homes Built w/ @UnitreeRobotics @huggingface across 12 homes in Southeast Asia, it covers: > 500+ hrs > 23K+ episodes > 10+ TB > 10+ household tasks
View quoted postRT Victor E. Nunez tony stark is not texting jarvis. voice lets you give agents more context, faster. the messy stuff is actually the point. i wrote about how we use it with codex today and where this is all going. talk to your computer. be shameless about it. i’ll see you in a few months 🎙️ https://x.com/nunezvice/status/2069813341367808139?s=20
i want to see more like these!!
RT Joyce Zhang One of my big dreams when I first started out was to teach engineers to talk about their feelings at massive scale at Dreamforce, AWS re:Invent, etc I was blown away when @swyx asked me to host an event at @aiDotEngineer this year. Get a free pass: http://bit.ly/connect-ai-2026
very proud to host this session opening night! if you work in tech but have never heard of Touchy Feely, you are exactly the kind of person that needs to see this. You have no idea the shocking number of tech leadeds on here are secret @LITfellows alums and i have seen first
View quoted postRT Ryan Shea Introducing AI IQ Bio: the most comprehensive set of biotech benchmarks in the world ...& Bio IQ: the most comprehensive "biotech capabilities index" ever produced Benchmark sources include benchmarksdotbio, FutureHouse, SecureBio, Anthropic, OpenAI & more
RT AI Engineer Thinking about AI Engineer World's Fair? Apply to be an Associate! It's a great way to connect with fascinating people across the AI ecosystem, build new relationships, and be part of one of the most exciting AI gatherings of the year. https://www.ai.engineer/associates
happy karpathy agent day for those who celebrate
Inspired by @karpathy’s words on why you - yes YOU - should work on AI Agents
View quoted post