SW

Shawn Wang

0 位关注者828 条内容最近 7 天 39 条

简介

achieve ambition with intentionality, intensity, & integrity - @dxtipshq - @sveltesociety - @aidotengineer - @latentspacepod - @cognition + @smol_ai

平台

𝕏Shawn Wang

内容历史

SW
Shawn Wang
𝕏xabout 5 hours ago

btw we list job postings for our AIE sponsors and @sentry has a very good one here see them all at http://ai.engineer/jobs very proud and grateful of the AIE Expo this year. this is the highest concentration of koding talent NYC will have had in basically ever. team did well.swyx🔜 @aidotEngineer CODE 🗽: whoaaaah Sentry gathering some serious js talent recently Link: https://x.com/swyx/status/1989173675736854636

btw we list job postings for our AIE sponsors and @sentry has a very good one here see them all at http://ai.engineer/jobs very proud and grateful of ...btw we list job postings for our AIE sponsors and @sentry has a very good one here see them all at http://ai.engineer/jobs very proud and grateful of ...
View on X
SW
Shawn Wang
𝕏xabout 6 hours ago

seeing a lot of evidence that Gemini 3 is rolling out soon... i'm not saying anything specific here but i'd definitely tune in to the @aidotengineer CODE livestream next week if I were you (https:// ai .engineer / youtube) (also for @thekitze's banger first talk at AIE!)kitze: just tried gemini 3.0 jesus fucking christ Link: https://x.com/thekitze/status/1989111025271206053

seeing a lot of evidence that Gemini 3 is rolling out soon... i'm not saying anything specific here but i'd definitely tune in to the @aidotengineer C...
View on X
SW
Shawn Wang
youtubeabout 7 hours ago

How Philips is scaling AI literacy across 70,000 employees

Watch on YouTube
SW
Shawn Wang
youtubeabout 8 hours ago

Notion’s rebuild for agentic AI: How GPT‑5 helped unlock autonomous workflows

Watch on YouTube
SW
Shawn Wang
youtubeabout 8 hours ago

From pilot to practice: How BBVA is scaling AI across the organization

Watch on YouTube
SW
Shawn Wang
𝕏xabout 9 hours ago

whoaaaah Sentry gathering some serious js talent recently

View on X
SW
Shawn Wang
youtubeabout 12 hours ago

ChatGPT Atlas and the next era of web browsing — the OpenAI Podcast Ep. 9

Watch on YouTube
SW
Shawn Wang
youtubeabout 17 hours ago

I Tried Vibe Coding Against a Coding Genius… The Result Is Insane

Watch on YouTube
SW
Shawn Wang
𝕏xabout 17 hours ago

as I talked to many many friends (thank you!) about the Priscilla x Zuck pod, I found that many of you non biologists feel intimidated by the science. Don't worry, the Biohub story is now primarily building a frontier AI lab, so I put this in more familiar terms to you in today's @latentspacepod writeup. also included an extract of my favorite quote from Priscilla, where she articulates why exactly they're aiming at the Immune System next after solving the virtual cell. Makes sense that if we want to keep people healthy, we should deeply understand nature's first party healthcare system in order to fix/supplement it!Latent.Space: Priscilla Chan and Mark Zuckerberg co-founded the Chan Zuckerberg Initiative (CZI) in 2015, committing 99% of their Meta shares to advance science, education, and opportunity. As a pediatrician and CEO of Meta respectively, they've built CZI into one of the most ambitious Link: https://x.com/latentspacepod/status/1986487301397176813

as I talked to many many friends (thank you!) about the Priscilla x Zuck pod, I found that many of you non biologists feel intimidated by the science....as I talked to many many friends (thank you!) about the Priscilla x Zuck pod, I found that many of you non biologists feel intimidated by the science....
View on X
SW
Shawn Wang
youtubeabout 19 hours ago

LangChain Academy New Course: LangSmith Essentials

Watch on YouTube
SW
Shawn Wang
youtubeabout 20 hours ago

To-Do List Middleware (Python)

Watch on YouTube
SW
Shawn Wang
youtubeabout 20 hours ago

Why Most AI Agents Fail — and How a Simple Todo List Fixes It

Watch on YouTube
SW
Shawn Wang
youtubeabout 20 hours ago

Execute code with sandboxes for Deep Agents

Watch on YouTube
SW
Shawn Wang
𝕏x1 day ago

ok lots of the most cracked ai ux people at @sync_conf today but i’m going to be thinking about this demo for a long time heres @_adamwiggins_ , @threepointone , me et al grilling Steve about his new collaborative multiagents on Tldraw!!Steve Ruiz: An interesting realization as we’re working on multi-agent collaboration fairies project: we’re kind of rediscovering “old ai” as I understood it in videogames, ie entities with a programmatic state machine responding to events, except with “new ai” handling the rest Link: https://x.com/steveruizok/status/1986164318032044338

View on X
SW
Shawn Wang
youtube1 day ago

I Found a $10K MRR App Idea With 400,000 Built-In Customers

Watch on YouTube
SW
Shawn Wang
youtube1 day ago

Can AI program a robot dog?

Watch on YouTube
SW
Shawn Wang
youtube1 day ago

I Built A Movie Poster Faceswap App In 20 Minutes (And Shipped It To The App Store)

Watch on YouTube
SW
Shawn Wang
youtube2 days ago

Add a Human-in-the-Loop to Your LangChain Agent (Next.js + TypeScript Tutorial)

Watch on YouTube
SW
Shawn Wang
youtube2 days ago

Tool Call Limit Middleware (Python)

Watch on YouTube
SW
Shawn Wang
youtube2 days ago

How Agents Use Context Engineering

Watch on YouTube
SW
Shawn Wang
𝕏x2 days ago

yes Aether is a biglab model, no its not a llama one thing i learned from the inside is that all code ide model strings are combinations of models combined with prompts and toolcalls for the model. so no you wont feel like it is “raw” matching what youd find in the normal chat. abandoned blogpost: The Model Selector is a Lieswyx: {highly anticipated model} is coming sooner than all of you anons have been speculating. can try all 3 sizes now in @windsurf Next and see if you can figure it out :) Link: https://x.com/swyx/status/1988425979216437653

yes Aether is a biglab model, no its not a llama one thing i learned from the inside is that all code ide model strings are combinations of models com...yes Aether is a biglab model, no its not a llama one thing i learned from the inside is that all code ide model strings are combinations of models com...yes Aether is a biglab model, no its not a llama one thing i learned from the inside is that all code ide model strings are combinations of models com...yes Aether is a biglab model, no its not a llama one thing i learned from the inside is that all code ide model strings are combinations of models com...
View on X
SW
Shawn Wang
𝕏x3 days ago

## Lists outside, Detail inside. one fun discovery i've had using Cursor 2.0 is that the Agents view and the Editor view are not actually that different if you happen to be #rightsidebar gang I've been #rightsidebar since @shanselman first talked about it many moons ago and just realized my natural koding layout looks a bit like this now. I encourage people to think about AI UX in terms of literal screen real estate dedicated to agents. with Cursor 1.0 it was ~25%, with Cursor 2.0 it's now ~50-100% depending how you use it. I like being able to inspect my filesystem (which also doubles as logs) so I'm at 50%. but its interesting to see how LHS = Agent and RHS = Manual and have the karpathy "autonomy slider" visually manifest as the VSCode pane splitters.swyx: first try of the new Cursor Composer model (btw I'm still a DAU of Cursor! @smol_ai is entirely a Cursor vibecode) one impressive example - Composer 1 finished 2 rounds of human feedback and debugging with me and got me what I wanted, while Sonnet 4.5 was still working on its Link: https://x.com/swyx/status/1983585407368609909

## Lists outside, Detail inside. one fun discovery i've had using Cursor 2.0 is that the Agents view and the Editor view are not actually that differe...## Lists outside, Detail inside. one fun discovery i've had using Cursor 2.0 is that the Agents view and the Editor view are not actually that differe...
#rightsidebar
View on X
SW
Shawn Wang
𝕏x3 days ago

one notable problem about this dream of "continuous learning" models / "self evolving" agents is that the business model of centralized ai right now is quite antithetical to this try telling your GTM people you can't hype customers about a new version of the product because the product versions itself try telling your agent engineers they need to improve the agent, but no they can't access real user data to do so the transition to Level 4 probably involves going from "global best" to local maxima and that is going to be a very hard reset for a lot of the way AI strategy is set up today for most agent labsprinz: Google has released a new "Introduction to Agents" guide, which discusses a "self-evolving" agentic system (Level 4). "At this level, an agentic system can identify gaps in its own capabilities and create new tools or even new agents to fill them." https://www.kaggle.com/whitepaper-introduction-to-agents Link: https://x.com/deredleritt3r/status/1988244609504334283

one notable problem about this dream of "continuous learning" models / "self evolving" agents is that the business model of centralized ai right now i...
View on X
SW
Shawn Wang
𝕏x3 days ago

ok one of the things that i've always wanted an AIE is coming to pass, after the Great @dylan522p v @jefrankle debate of 2024: the Great MCP debate! @vtahowe and @ianlivingstone are taking on all challengers - if you are a knowledgeable MCP skeptic, come do a live debate next week at AIE CODE! sign up/let Allie know you are interested!Allie Howe: Some say MCP is >obsolete >a security nightmare >creates unnecessary abstraction Others say MCP provides >standardization >a security boundary at the network layer >federated data integration Which is it? Time to host a debate, live at @aidotengineer CODE next week! Link: https://x.com/vtahowe/status/1988050009233723890

ok one of the things that i've always wanted an AIE is coming to pass, after the Great @dylan522p v @jefrankle debate of 2024: the Great MCP debate! @...
View on X
SW
Shawn Wang
𝕏x3 days ago

RT Surge AI Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense, models are nowhere near ready. 👇 https://surgehq.ai/blog/rl-envs-real-world

View on X
SW
Shawn Wang
youtube3 days ago

How to Build AI Apps in 86 minutes (Complete Guide)

Watch on YouTube
SW
Shawn Wang
𝕏x3 days ago

RT Latent.Space "You know for the next period, we really wanna make science the main focus of what we're doing and specifically, the Biohub is really gonna be like the main focus of our philanthropy, and it's just something that we're very excited about. When we started 10 years ago, we had this idea like, 'Okay, I bring experience as a physician, Mark's an engineer and he builds things, and we have an opportunity to give back resources to make an impact on this world.'" Priscilla and Mark elaborate on their future mission of their philanthropy and how they can leverage their experiences to do something special. @czi @officialbiohubLatent.Space: Priscilla Chan and Mark Zuckerberg co-founded the Chan Zuckerberg Initiative (CZI) in 2015, committing 99% of their Meta shares to advance science, education, and opportunity. As a pediatrician and CEO of Meta respectively, they've built CZI into one of the most ambitious Link: https://x.com/latentspacepod/status/1986487301397176813

View on X
SW
Shawn Wang
𝕏x3 days ago

RT Allie Howe Some say MCP is >obsolete >a security nightmare >creates unnecessary abstraction Others say MCP provides >standardization >a security boundary at the network layer >federated data integration Which is it? Time to host a debate, live at @aidotengineer CODE next week!

RT Allie Howe: Some say MCP is >obsolete >a security nightmare >creates unnecessary abstraction Others say MCP provides >standardization >a security b...
View on X
SW
Shawn Wang
youtube4 days ago

Glif AI: The $10 App That Replaces a Full Creative Team

Watch on YouTube
SW
Shawn Wang
𝕏x4 days ago

RT SMB Attorney Wow... Warren Buffet says goodbye in his final annual letter today (full copy in the comments below). As he signed off, the following were final words of advice: "One perhaps self-serving observation. I’m happy to say I feel better about the second half of my life than the first. My advice: Don’t beat yourself up over past mistakes – learn at least a little from them and move on. It is never too late to improve. Get the right heroes and copy them. You can start with Tom Murphy; he was the best. Remember Alfred Nobel, later of Nobel Prize fame, who – reportedly – read his own obituary that was mistakenly printed when his brother died and a newspaper got mixed up. He was horrified at what he read and realized he should change his behavior. Don’t count on a newsroom mix-up: Decide what you would like your obituary to say and live the life to deserve it. Greatness does not come about through accumulating great amounts of money, great amounts of publicity or great power in government. When you help someone in any of thousands of ways, you help the world. Kindness is costless but also priceless. Whether you are religious or not, it’s hard to beat The Golden Rule as a guide to behavior. I write this as one who has been thoughtless countless times and made many mistakes but also became very lucky in learning from some wonderful friends how to behave better (still a long way from perfect, however). Keep in mind that the cleaning lady is as much a human being as the Chairman."

View on X
SW
Shawn Wang
youtube4 days ago

Cursor 2.0 Tutorial for Beginners (Full Course)

Watch on YouTube
SW
Shawn Wang
𝕏x4 days ago

RT Akshay Kothari Notion’s Q3 was our finest yet. Growth is accelerating at scale. We used to sell software; now we sell work itself. That’s a profound shift, and the numbers show it. I ran 50+ demos this past quarter, refining our story for customers each time. I shared the latest version internally last week and thought: why not share it with all of you too? It’s an 18-minute watch; feedback welcome!

View on X
SW
Shawn Wang
youtube4 days ago

Build Hour: Agent RFT

Watch on YouTube
SW
Shawn Wang
𝕏x4 days ago

Ahead of AIE CODE presented for the first time by @GoogleDeepMind, I'm happy to release this special weekend chat with our Day 2 emcee @jedborovik, product lead of @Julesagent: https://www.youtube.com/watch?v=emWgP_fr04k&lc=Ugy2OGAfghkKIgO_bIl4AaABAg which was very fun because Jed turned the tables on me and challenged me on what I think needs to happen next for the hyper competitive landscape of Coding Agents. see you soon in New York!

View on X
SW
Shawn Wang
𝕏x5 days ago

i think this is beautiful - doing ViT from raw pixels means you need to jointly train everything - this poor model must independently solve MNIST, and THEN/ALSO learn to be a perfect calculator in its weights. then keep going.... only constrained by the data you give it. it's why @percyliang's concept of "foundation models" in 2021 was so disruptive/sacrilegious in the Google vs OpenAI sprint to the GPT: instead of 1000 different small models all specialized in their tasks, concentrate all that budget/data/resources in one supermodel that has the capacity to model 1000 tasks; along the way you get 1) transfer learning, 2) capabilities you never explicitly trained for, 3) emergent abilities that only unlock at a given param/depth/data exposure rate.varchasvi: @karpathy said we should delete tokenizers. So I did, and it worked! I read @karpathy 's post on OCR where he talks about models that read text straight from pixels instead of using a tokenizer (and how having a tokenizer is a problem), and I wanted to build one such model from Link: https://x.com/varchasvee_/status/1986811191474401773

i think this is beautiful - doing ViT from raw pixels means you need to jointly train everything - this poor model must independently solve MNIST, and...i think this is beautiful - doing ViT from raw pixels means you need to jointly train everything - this poor model must independently solve MNIST, and...i think this is beautiful - doing ViT from raw pixels means you need to jointly train everything - this poor model must independently solve MNIST, and...
View on X
SW
Shawn Wang
𝕏x6 days ago

RT Latent.Space Congratulations @alexgshaw @Mike_A_Merrill on the exciting and impressive launch! Shoutouts to @lschmidt3 , @andykonwinski, @LaudeInstitute for supporting this amazing work!Alex Shaw: Today, we’re announcing the next chapter of Terminal-Bench with two releases: 1. Harbor, a new package for running sandboxed agent rollouts at scale 2. Terminal-Bench 2.0, a harder version of Terminal-Bench with increased verification Link: https://x.com/alexgshaw/status/1986911106108211461

RT Latent.Space: Congratulations @alexgshaw @Mike_A_Merrill on the exciting and impressive launch! Shoutouts to @lschmidt3 , @andykonwinski, @LaudeIns...
View on X
SW
Shawn Wang
𝕏x7 days ago

RT Alex Shaw Today, we’re announcing the next chapter of Terminal-Bench with two releases: 1. Harbor, a new package for running sandboxed agent rollouts at scale 2. Terminal-Bench 2.0, a harder version of Terminal-Bench with increased verification

RT Alex Shaw: Today, we’re announcing the next chapter of Terminal-Bench with two releases: 1. Harbor, a new package for running sandboxed agent roll...
View on X
SW
Shawn Wang
𝕏x7 days ago

"use brain"Aiden Bai: also who the hell made this diagram. so based Link: https://x.com/aidenybai/status/1985819403825664233

"use brain"
View on X
SW
Shawn Wang
𝕏x7 days ago

RT Gabe Greenberg we've been cooking @g2i_co very excited to party with @swyx on this! @MichelleBakels / @Beccalytics are organizing @ReactMiamiConf and @AIEMiami the same week!!!AI Engineer: Miami: The world's leading AI Engineering conference is coming to Miami! AI Engineer: Miami, April 20–21. Two days. One track. A community of engineers and founders on the frontier of AI technology. Organized by @g2i_co, supported by @aiDotEngineer. Join us! https://www.ai.engineer/miami Link: https://x.com/AIEMiami/status/1984305102920785977

RT Gabe Greenberg: we've been cooking @g2i_co very excited to party with @swyx on this! @MichelleBakels / @Beccalytics are organizing @ReactMiamiConf ...
View on X
SW
Shawn Wang
youtube8 days ago

How I Built This Entire App in 35 Minutes (With No Code)

Watch on YouTube
SW
Shawn Wang
📝blog8 days ago

The Impossible Triangle of LLM Infra

another talk I am giving at Mastra's TypeScript AI conf today https://docs.google.com/presentation/d/1NnQ3H5Bki3vWRRJdVXoCFJ5dsNKH9QrC-eEQ2Z8olck/edit?usp=sharing

1 min readShawn Wang
Read full article
SW
Shawn Wang
youtube10 days ago

Automatic code reviews with OpenAI Codex

Watch on YouTube
SW
Shawn Wang
youtube11 days ago

The Best Vibe Coding Tools in 2026

Watch on YouTube
SW
Shawn Wang
youtube12 days ago

Go from a Cursor, Replit, or Bolt Site to the App Store (in One Prompt)

Watch on YouTube
SW
Shawn Wang
youtube14 days ago

The 7 Essential Prompts of App Design (No Code)

Watch on YouTube
SW
Shawn Wang
youtube14 days ago

Monster Manor by Sora 2

Watch on YouTube
SW
Shawn Wang
youtube15 days ago

Introducing: Sora 2 Character Cameos

Watch on YouTube
SW
Shawn Wang
youtube15 days ago

Claude Code updates: When to use Haiku 4.5, Claude Code on web, and more.

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

The Easiest Way to Build Mobile Apps Ever (No Coding Needed)

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

Build Hour: AgentKit

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

ENEOS Materials accelerates manufacturing productivity with ChatGPT Enterprise

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

MIXI accelerates secure, organization-wide adoption of ChatGPT Enterprise

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

Vercel's CEO Shares 5 AI Startup Ideas So Good You’ll Quit Your Job

Watch on YouTube
SW
Shawn Wang
youtube16 days ago

Sam, Jakub, and Wojciech on the future of OpenAI with audience Q&A

Watch on YouTube
SW
Shawn Wang
youtube17 days ago

Vibe Coder vs Pro Developer | Who Can Clone a $300M App Better?

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

Can I Clone a $6M App Better Than a Pro Developer? (With No Code)

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

Effect: the Good Parts, `use workflow`, and Vercel Domains — Dillon Mulroy

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

Claude Skills Built Me an AI Agent Army (They Run Everything Now)

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

Build beautiful frontends with OpenAI Codex

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

How Claude is transforming financial services

Watch on YouTube
SW
Shawn Wang
youtube18 days ago

LLM Building Blocks & Transformer Alternatives

Watch on YouTube
SW
Shawn Wang
youtube22 days ago

Intro to scrolling tabs in ChatGPT Atlas

Watch on YouTube
SW
Shawn Wang
youtube22 days ago

Work smarter with your company knowledge in ChatGPT

Watch on YouTube
SW
Shawn Wang
youtube22 days ago

Claude now has memory

Watch on YouTube
SW
Shawn Wang
youtube23 days ago

The Complete AI Stack and Workflow for 100M+ Video Views

Watch on YouTube
SW
Shawn Wang
youtube24 days ago

Introducing ChatGPT Atlas

Watch on YouTube
SW
Shawn Wang
youtube25 days ago

Make $1M+ in 2026 with this Trend (IRL is back)

Watch on YouTube
SW
Shawn Wang
youtube25 days ago

Claude Code on the web

Watch on YouTube
SW
Shawn Wang
youtube25 days ago

Introducing Claude for Life Sciences

Watch on YouTube
SW
Shawn Wang
youtube25 days ago

Scaling enterprise AI: Fireside chat with Eli Lilly’s Diogo Rau and Dario Amodei

Watch on YouTube
SW
Shawn Wang
youtube25 days ago

How AbbVie accelerates drug discovery with Claude

Watch on YouTube
SW
Shawn Wang
📝blog25 days ago

The only Permanent Underclass are the ones who believe it is permanent

1 min readShawn Wang
Read full article
SW
Shawn Wang
youtube28 days ago

I Built an Entire App with OpenAI's Codex and 8 AI Agent Employees

Watch on YouTube
SW
Shawn Wang
youtube28 days ago

Space with ChatGPT

Watch on YouTube
SW
Shawn Wang
youtube28 days ago

Building more effective AI agents

Watch on YouTube
SW
Shawn Wang
youtube28 days ago

OpenAI Codex in your code editor

Watch on YouTube
SW
Shawn Wang
youtube29 days ago

Creating custom Skills with Claude

Watch on YouTube
SW
Shawn Wang
youtube29 days ago

Agent Skills: Specialized capabilities you can customize

Watch on YouTube
SW
Shawn Wang
youtube29 days ago

Connect Claude to Microsoft 365

Watch on YouTube
SW
Shawn Wang
youtube29 days ago

The 5 Levels of AI App Building (Vibe Coding Master Class)

Watch on YouTube
SW
Shawn Wang
youtube30 days ago

Genspark's Super AI Agent is INSANE

Watch on YouTube
SW
Shawn Wang
youtube30 days ago

Introducing Claude Haiku 4.5

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Using OpenAI Codex CLI with GPT-5-Codex

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Build Hour: Responses API

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

My AI Videos Hit 1M+ Views (Veo3 + Sora 2 Demo)

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

I Built a Fully Polished App in 30 Minutes (No Code)

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

OpenAI x Broadcom — The OpenAI Podcast Ep. 8

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Building with MCP and the Claude API

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

What Work Looks Like with ChatGPT | Write, Research, Code, Create

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

OpenAI's NEW AI Agent Builder Replaces n8n & Zapier

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Developer State Of The Union

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

A Conversation with Sam and Jony

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Built for SF by SF: AI Solutions Helping Our City Thrive

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Evals in Action: From Frontier Research to Production Applications

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Sora, ImageGen, and Codex: The Next Wave of Creative Production

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

AMA: Scaling AI Applications into the Enterprise

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Model Behavior: The Science of AI Style

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Measuring Agents With Interactive Evaluations

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Shipping with Codex

Watch on YouTube
SW
Shawn Wang
youtubeabout 1 month ago

Building with Open Models

Watch on YouTube