GPT-4o just got outplayed — by a smaller model.

GPT-4o just got outplayed — by a smaller model.

Welcome back to Taledy AI, your go-to resource for staying up to date with the latest advancements in AI, one edition at a time.

This week? Things got spicy.

DeepSeek is teaching models how to think better than GPT-4o. OpenAI gave ChatGPT a memory upgrade. Meta just casually dropped a 2 trillion parameter tease. And NVIDIA? They've got a model that can switch reasoning on and off

Let's get into it ⬇️

Latest Video:

Discover how to use Manus to bulk-create YouTube videos

📈 The Latest in AI Signals

🚀 OpenAI Levels Up ChatGPT (Again)

ChatGPT now remembers everything. Literally.

🤔 What’s New:

  • Memory is persistent across chats
  • Better personalization, long-term context
  • You control what it remembers or forgets

🚫 Who Doesn't Get It Yet?

  • EEA, UK, Switzerland, Norway, Iceland, Liechtenstein

🔮 Why It Matters: We’re inching toward AI personal assistants that truly know you. The more you use it, the smarter it becomes. Creepy or cool? You decide.

Siri's Makeover Incoming: Apple is finally updating Siri for fall 2025. Expect smarter responses and deeper personalization.

📊 ByteDance Gets Logical: ByteDance introduces Seed-Thinking-v1.5 - a MoE model built for STEM reasoning. It's their first real leap into serious LLM territory.

🔥 HEADLINER: DeepSeek Reinvents Reinforcement

DeepSeek just dropped DeepSeek GRM — a generative reward model that critiques itself during inference. Think: "This AI doesn’t just answer. It provides explanations, evaluates its own responses, and adjusts its performance dynamically.

🔎 Method: SPCT (Self-Principled Critique Tuning)

  1. Rejective Fine-Tuning (RFT): Uses 1.07M instruction + 186K tough rejective samples
  2. Online RL via GRPO: Reward +1 if its answer matches ground truth, -1 if not. KL penalty? 0.08.
  3. Inference-Time Sampling: Get multiple self-critiques, then use a Meta-RM to filter out bad takes.

🔢 Results

  • RewardBench: From 86.0% → 90.4% with meta-filtering
  • PPE Preference: 64.7% → 67.2%
  • Overall: Single-pass 69.5% → 72.8% with 32-sample voting
  • Outperforms: Nemetron 4340BR, rivals GPT-4o

🔮 Why It Matters: This flips the script. Instead of scaling to 671B+ models, DeepSeek shows you can train smaller, smarter AIs that teach themselves. It’s like AI doing code reviews on its own brain.

🎉 New Model Watch: NVIDIA's Nemetron Ultra 253B

NVIDIA just launched a 253B parameter model that beats bigger beasts like DeepSeek R1 on most tasks,  and it runs on a single 8x H100 node.

🎯 Highlights:

  • Built on LLAMA 3.1
  • Uses Neural Architecture Search (NAS)
  • Reasoning On/Off toggle
    • Math500: 97% with reasoning
    • LiveCodeBench: 29% → 66.3%

🌊 Key Feature: Max sequence length of 131,072 tokens, long context window heaven.

🔮 Why It Matters: You don't need trillion-parameter monsters. Smart architecture + selective reasoning = efficient power. It's open-source with commercial use.

🚀 AI Tool of the Day: Fresh Picks

Taledy.com: Create viral short clips from YouTube videos or your own uploads.

👨‍🎨 Mind-Blowing AI: DreamActor-M1

From ByteDance comes DreamActor-M1: an AI that animates static images into full-body video sequences.

  • Smooth motion
  • Realistic expressions
  • Rivals Runway's Act-One

Perfect for content creators who want motion without mocap. 🎥

🚀 Rumor Radar

  • Meta teased a 2 TRILLION parameter "LLaMA 4 Behemoth" model
  • OpenAI’s GPT-4.1 + GPT-O3 expected to drop this week
  • DeepSeek R2 chatbot might already be in testing — SPCT-powered?
  • Google’s Gemini Live now has real-time camera + screen input AI

📢 From Taledy

MCPs are taking over the world. Are you familiar with them, or should I create a tutorial for that next time? Let me know in the comments or by replying to this email. 

📊 TL;DR Recap

  • DeepSeek’s SPCT beats GPT-4o in some benchmarks
  • ChatGPT memory is now permanent (unless you turn it off)
  • NVIDIA’s 253B model does more with less
  • Meta is back with open weight LLaMA-4s
  • Gemini Live = real-time screen + camera AI
  • ChatGPT is now the world’s most downloaded app
  • ByteDance and Appleare making bold AI moves

Stay curious, stay sharp.

— The Taledy Team 🧠

P.S. Think someone needs this newsletter? Share it with them!

Follow us on Twitter | Join the Taledy AI community | Subscribe on YouTube