The company behind ChatGPT is back with bombastic claim that their new o1 model is capable of so-called "complex reasoning." Ever-faithful, Alex and Emily tear it apart. Plus the flaws in a tech publication's new 'AI hype index,' and some palette-cleansing new regulation against data-scraping worker surveillance.
References:
OpenAI: Learning to reason with LLMs
- How reasoning works
- GPQA, a 'graduate-level' Q&A benchmark system
Fresh AI Hell:
MIT Technology Review's AI 'AI hype index'
CFPB Takes Action to Curb Unchecked Worker Surveillance
You can check out future livestreams on Twitch.
Our book, 'The AI Con,' comes out in May! Pre-order your copy now.
Subscribe to our newsletter via Buttondown.
Follow us!
Emily
- Bluesky: emilymbender.bsky.social
- Mastodon: dair-community.social/@EmilyMBender
Alex
- Bluesky: alexhanna.bsky.social
- Mastodon: dair-community.social/@alex
- Twitter: @alexhanna
Music by Toby Menon.
Artwork by Naomi Pleasure-Park.
Production by Christie Taylor.