Discover real AI creators shaping the future. Track their latest blogs, X posts, YouTube videos, WeChat Official Account posts, and GitHub commits — all in one place.
lol heard a 2nd startup today that has made sales and evals based on this podcast its fun to be "just an interviewer", but i'm always mindful/humbled by the fact that we are playing with live ammo here. LS has directly and indirectly impacted a lot of peoples' careers and roadmaps. not something i ever thought about when starting this thing 3 years ago
🆕 The End of SWE-Bench Verified (2024-2026) https://latent.space/p/swe-bench-dead Today @OpenAIDevs is announcing the voluntary deprecation of SWE-Bench Verified! We're releasing a podcast + analysis in today's post. Saturation of SWE-Bench has been a community hot topic for over a year -
new CLI for langsmith (tracing, evals) AND new skills for teaching agents to use said CLI
🚀 Announcing LangSmith Skills + CLI 🚀 Agent improvements are increasingly driven by coding agents themselves. We're releasing LangSmith Skills alongside the LangSmith CLI to make coding agents experts at the agent engineering lifecycle. LangSmith Skills enable agents to
View quoted postActivity on repository
hamelsmu pushed awesome-agent-skills
View on GitHubHad to make a DNB version of this
Activity on repository
hamelsmu forked hamelsmu/awesome-agent-skills from VoltAgent/awesome-agent-skills
View on GitHubWe integrated the Codex harness into Prism (http://prism.openai.com) — this means you get skills, reasoning levels, and the raw tenacity of the Codex model in your LaTeX environment. Oh and we also built version mgmt into Prism, which was one of the top requests. See below thread from @vicapow for some great examples of the power of Codex inside Prism 👇
🧵1/ We've brought the most advanced AI to Prism by introducing Codex to Prism. Prism is already the best place for scientific writing to happen—and with Codex, now you can write, compute, analyze, and iterate all in one place.
View quoted post