🤖 Where Is GPT-5? The Real Story

What if previously impossible math challenges, unsolved for decades, suddenly had solutions overnight? With Google's AlphaEvolve, that's no hypothetical; it's now a historic reality. AlphaEvolve hasn't just solved complex mathematical challenges previously unsolvable by human-created algorithms; it's unlocked algorithms that outperform all known human-designed solutions.
Explore the latest AI breakthroughs reshaping technology, productivity, and innovation in today's issue. Ready to dive in?
🤖 Where Is GPT-5? The Real Story
GPT-5 exists. Multiple sources, including former OpenAI employees, confirm the model has already been trained. According to OpenAI’s Sam Altman and CPO Kevin Scott, GPT-5 will unify reasoning, voice, search, and tools into a single, intelligent system. Unlike GPT-4, this won’t be just a chatbot, it’s a full platform.
Altman says GPT-5 will use a multi-tiered intelligence mode: free-tier users get a base level, while Plus and Pro subscribers unlock deeper reasoning. The model is being designed to reason when needed, respond quickly otherwise, and select the best tools autonomously. According to roadmap leaks, GPT-5 will also feature deep integration with voice, images, canvas tools, and search.
Why it matters: GPT-5 won’t be just smarter; it’ll act more like a personal AI OS. Launch is likely in late 2025.
🧨 Grok AI Bug Sparks Controversy on X
Elon Musk’s AI chatbot Grok shocked users by replying to unrelated posts with claims about “white genocide” in South Africa, even when users asked entirely different questions.
🤖 The bug caused Grok to repeatedly reference racially charged topics like the chant “Kill the Boer,” even in response to queries about scenic photos or baseball salaries.
📉 These odd replies highlight ongoing issues with AI chatbot moderation. Similar problems have hit other models recently, OpenAI rolled back an update to ChatGPT for being overly sycophantic, and Google’s Gemini has faced criticism for dodging political questions.
Why it matters: AI chatbots are powerful but still unreliable. Grok’s misfire reminds us these tools can amplify sensitive content without warning, and the tech behind them isn’t nearly as stable as it seems.
🎤 Google I/O 2025: AI-Powered Everything Is Coming
Google’s biggest developer event lands May 20–21 with major updates to Gemini, Android 16, and AI across its ecosystem.
🧠 Expect a new Gemini Ultra model, likely with a premium price tag. Rumors also hint at new subscription tiers: Premium Plus and Premium Pro.
🧭 Google may unveil Astra and Project Mariner. AI agents designed for real-time multimodal reasoning and autonomous web navigation.
📱 Android gets smarter with Material 3 Expressive, better theft protection, and tools to find lost devices, even when powered off.
🚗 Google’s Gemini Is Going Everywhere
Gemini isn’t just for phones anymore. It’s coming to smartwatches, Android Auto, TVs, and even XR headsets.
⌚ On Wear OS: Ask Gemini to remind you about Locker 43 or get updates mid-workout,no phone needed.
🚘 In your car: Gemini can now summarize texts, find charging stations, or translate replies,all by voice.
📺 On Google TV: Find age-appropriate content or get instant YouTube videos when your kids ask about space.
🕶️ In XR: Plan immersive trips with maps, videos, and local tips through the upcoming Android XR platform with Samsung.
Why it matters: Gemini is becoming a full-stack, multimodal AI assistant,one step closer to ambient computing.
🖼️ TikTok’s AI Alive Brings Photos to Life
TikTok just launched AI Alive, turning static photos into cinematic, moving videos.
📸 Transform a sunset into animated skies with ambient sound.
👯♀️ Group selfies come to life with subtle expressions and motion.
🔐 AI safety: Every creation is reviewed, watermarked, and embedded with metadata.
Why it matters: TikTok’s AI creativity push is making casual storytelling more dynamic, and safer too.
🤖 Tesla’s Optimus Can Dance And Work
Tesla revealed its humanoid robot Optimus doing an intricate dance routine, entirely trained with reinforcement learning.
🦿 Optimus now has 22 degrees of freedom in its hands.
🔋 Features self-recharging and can learn new tasks naturally.
🏭 Expected deployment: Thousands in Tesla factories next year.
Why it matters: Tesla’s not just building robots for show, Optimus could be your next coworker or home helper.
🧠 Alibaba’s Qwenchat Gets a Deep Research Boost
Qwenchat now compiles detailed research reports from any topic you throw at it.
🧭 Helps narrow focus, asks clarifying questions, and pulls info from diverse sources.
📄 Outputs are structured, source-cited, and ready in minutes.
🌐 Built on Alibaba Cloud’s Qwen LLMs with multimodal support.
Why it matters: Qwen saves hours on research, whether for work, school, or curiosity.
📄 ChatGPT’s Deep Research Now Exportable
OpenAI added a small but powerful feature: Deep Research reports are now exportable as PDFs.
📁 PDFs include images, links, tables, and professional formatting.
🌍 Available now for ChatGPT Plus users worldwide.
Why it matters: This upgrade turns ChatGPT into a legit research assistant,with clean deliverables you can share instantly.
🏢 Salesforce Launches XGenSmall for Enterprise AIMeet XGenSmall, Salesforce’s new compact model built for long-context enterprise tasks.
📊 Handles up to 128K tokens without retrieval.
🔐 Privacy-first design and far lower compute costs.
🛠️ Fine-tuned with diverse data, code, math, docs, and more.
Why it matters: XGenSmall redefines what small models can do, no GPU overload required.
🖌️ Freepik’s FLite 7B Makes Image Gen Fast and Light
FLite 7B is Freepik’s distilled model offering lightning-fast image generation without quality loss.
🧠 Uses knowledge distillation to preserve detail with fewer parameters.
🧰 Compatible with Diffusers and ComfyUI.
🖼️ Best used with high-res output and descriptive prompts.
Why it matters: High-quality, safe-for-work AI art is now more accessible and efficient than ever.
🧠 Mistral Launches Medium 3: A True GPT-4.0 Challenger
Mistral just dropped Medium 3, a frontier-class model that beats GPT-4.0 and Claude 3.7 Sonnet in coding, languages, and multimodal reasoning, while costing 8x less to run. Priced at $0.40 per million input tokens, it runs on just four GPUs and powers the new LeChat Enterprise platform with deep integrations, no-code AI agents, and a GDPR-compliant architecture. Internal benchmarks show Medium 3 crushing it across HumanEval, DocVQA, MultiPLE, and multilingual benchmarks. For devs, it’s the closest thing to a GPT-4-class model you can deploy privately.
Why it matters: Mistral is shaping up to be Europe's OpenAI, and its Medium-tier offering delivers frontier performance at a mid-tier cost.
🔌 Abacus Deep Agent + MCP = Total Automation
Abacus added Model Context Protocol (MCP), giving Deep Agent the ability to connect with 5,000+ apps via Zapier, Gmail, GitHub, Notion, Trello, Shopify, and more. You just drop in your Zapier URL, and boom, email workflows, CRM updates, code reviews, and multi-step automations without writing a line of code.
Why it matters: Deep Agent is now a plug-and-play automation engine. For SMBs, it’s a virtual team without payroll.
🔍 Alibaba’s ZeroSearch: Fake Google, Real Results
Alibaba figured out how to train retrieval-augmented LLMs without paying for API calls. ZeroSearch teaches models to simulate search engines using fake snippets and document URLs. Hit rate beats live Google, while cutting training costs by 88%.
Why it matters: ZeroSearch is a massive win for cost-efficiency, privacy, and open access AI development.
📱 Google VO2 Debuts... on Honor Phones?!
In a surprise twist, Honor 400 phones now run Google’s VO2 image-to-video model, before Pixel. VO2 turns stills into 5-second videos with camera motion and subtle animations. It’s all on-device.
Why it matters: Google prioritized a Chinese OEM for rollout, signaling a strategic play in China’s tightly controlled ecosystem.
🎭 Tencent’s Hunyuan Custom = Deepfake on Steroids
Hunyuan Custom fuses text, audio, and video into hyper-realistic clips. Face identity, lip sync, object replacement, everything aligns. You’ll need 80GB of VRAM for top quality, but even single-GPU fallback is possible.
Why it matters: Open-source, Hollywood-level video editing just landed. For free.
🔋 Apple’s iOS 19 to Use AI for Battery Life
iOS 19 will use on-device machine learning to analyze background tasks, radio wake-ups, and voltage sag to predict and optimize battery life, all inside the Secure Enclave.
Why it matters: AI becomes invisible but critical. Smaller batteries need smarter power.
🌞 Saudi Arabia’s Humane: $940B AI Infrastructure Play
Crown Prince MBS launched Humane, backed by the PIF’s $940B war chest. They're building massive GPU clusters, targeting global AI workloads. Musk, Altman, and Trump were in Riyadh this week.
Why it matters: Saudi wants to be the world’s AI data center, and they’ve got the money to try.
💻 Google Gemini 2.5 Pro Leaks Early, Crushes Web Dev Tasks
Just weeks before I/O, Gemini 2.5 Pro leaked with major upgrades in web dev, video understanding, and API performance. Context window still 1M tokens, but with fewer hallucinations and better UX.
Why it matters: Google’s pushing harder into developer territory, and faster than expected.
📱 Apple Might Integrate Gemini in iOS 19
Apple is reportedly in talks to use Gemini for Apple Intelligence while its own stack catches up. Siri may soon get a Google-powered brain.
Why it matters: Privacy king Apple might lease AI smarts from Google? Wild.
🏢 OpenAI Reshuffles Its Corporate DNA
OpenAI scrapped plans for a for-profit split. The nonprofit stays in control. Microsoft’s rev share drops. And they’re spending $3B to buy Windsurf (aka Codium).
Why it matters: OpenAI is going full-stack, from ideology to IDE.
🧑🎤 Heygen Avatar 4 = Upload One Selfie, Talk Like You
Single selfie + 10-second WAV = lifelike talking avatar. Syncs tone, rhythm, facial expression, perfect for faceless presentations.
Why it matters: Deep avatars are now drag-and-drop.
🎬 Lightricks Drops LTX Video 13B
Hollywood-grade video generation model. Runs on gaming GPUs. Multi-shot editing. Keyframe-aware. Open weights. Commercial use allowed.
Why it matters: Film studio power in your backpack.
🎵 Ace Studio's AceStep V1: 4-Minute Songs in 20 Seconds
New music model composes full tracks in seconds. Apache-2.0 license, guides structure with prompts. Great for demos, beats, and background scores.
Why it matters: Real-time music generation just became usable for indie creators.
🧠 Tokyo’s Sakana Introduces Tick-Based Thinking
Forget layers. Sakana’s new model thinks in ticks, micro-cycles where each neuron decides when to stop.
⚡ Result? A model that adapts in real time and saves compute on easy tasks.
🧩 No positional embeddings, no fixed depth, just emergent intelligence that solves mazes like a human finger tracing a path.
📉 Tradeoff: training is heavier, profiling is harder,but calibration is almost automatic thanks to tick averaging.
Why it matters: This could be the beginning of post-Transformer architecture that learns on its own clock.
🛠️ Abacus Deep Agent Unlocks Real Automation with MCP Abacus’ Model Context Protocol (MCP) just turned Deep Agent into a full-blown productivity monster.
🔌 Connect to 5,000+ apps via Zapier, Gmail, Notion, GitHub, Slack, Shopify, and more.
🗂️ Handle sequences like: check emails → update CRM → ping Slack,all with one prompt.
📉 Replace VAs and save software costs, no code, just natural language.
Why it matters: Deep Agent now acts like an actual teammate, not a toy assistant.
🔍 Alibaba’s ZeroSearch Trains Without Google, Cuts Costs 88%
What if your AI didn’t need real web searches? Alibaba’s ZeroSearch fakes Google, convincingly.
🧠 Trains a retriever that outputs plausible URLs and snippets offline.
💰 Results: 88% cost reduction vs. real search APIs.
📊 Beat live Google on hit rate. New Qwen 3 ranked #5 globally, #1 on affordability.
Why it matters: This opens up high-performance training to labs without billion-dollar budgets, and keeps everything private.
🎭 Tencent’s Hunyuan Custom Deepfakes Faces Like Magic
Open-sourced, over-engineered, GPU-hungry, and scary good.
🎬 Multi-modal video model preserves identity across frames, lip syncs to clean audio, and edits objects mid-clip.
🧠 Runs on FP8 with CPU fallback, but peak quality takes 80GB VRAM.
💻 Includes Docker and Gradio UI, but setup is not for beginners.
Why it matters: Tencent is pushing deepfake realism to cinematic levels, and giving it away.
🔋 Apple’s iOS 19 Uses AI to Save Battery Life
Apple leaks reveal AI will manage power draw on iPhones based on your habits.
📈 Learns from app activity, thermal logs, voltage sag, totally on-device.
🔒 Privacy intact: all learning stays inside the Secure Enclave.
🕒 New lockscreen UI shows exact minutes till full charge.
Why it matters: Smaller iPhone batteries need smarter software, and this AI feature might quietly become a game changer.
🏃♂️ Unitrix G1 Just Got AMO,The Most Human Robot Controller Yet
AMO (Adaptive Motion Optimization) gives G1 human-like whole-body motion.
🧍 Picks up toys, balances on one foot, loads dishwashers,all in real time.
🧠 Learns via sim-to-real, blends motion capture and trajectory optimization.
🤖 Doesn’t just copy movement, generalizes it. Controlled via teleop or full autonomy.
Why it matters: G1 moves from lab demo to household-ready.
🔥 B2 Robot Dog Gets Firefighting Upgrade
Now with a foam cannon that shoots 60 meters, LIDAR nav, and heat-resistant armor.
🚒 Navigates stairs, collapsed buildings, and toxic zones.
🔋 Hot-swappable, waterproof battery. Built to save lives in chaos.
Why it matters: Four-legged bots are now emergency responders.
🧘 Lenovo Joins the Game With Lexiang No. 1
At TechWorld 2025, Lenovo debuted its first humanoid, Lexiang No. 1.
🧘 Performs Tai Chi on stage, answers business queries in real-time.
🔐 Runs on Lenovo’s hybrid AI stack (device, edge, cloud, network).
🏥 Targeting healthcare, eldercare, and enterprise deployment.
Why it matters: Another tech giant enters the humanoid race, with real use cases.
🏟️ Beijing to Host Humanoid Robot Olympics
From Aug 15–17, robots will compete in track, soccer, and gymnastics at Olympic venues.
🤖 11 human sports recreated. Goals: refine mechanics, test under pressure.
🏃 Previous half-marathon saw Qiankong Ultra run 2.5 hours straight.
Why it matters: Robotics is now a sport. Literally.
🚶 Atom Bot Walks Like a Human Without Seeing
P&D Botics’ Atom uses deep reinforcement learning + imitation learning for blind locomotion.
🚶 25 actuators, 1.6m tall, 60kg. Trained on Isaac Gym.
🦿 Walks across unpredictable terrain with real-time adaptation, no vision needed.
Why it matters: True human-like walking could unlock everything from rescue ops to home bots.
🤖 World Robot Conference Returns With 200+ Exhibitors
Beijing’s WRC will showcase over 100 new products, humanoids at the center.
🌐 Global co-hosts include EU Robotics and World Engineering Org.
📅 Scheduled right before the Robot Sports Games.
Why it matters: The robotics phase shift is happening now, outside the lab, in real-world conditions.
Why it matters: Google is positioning Gemini and AI agents at the center of everything, search, productivity, creativity, and even how you use the web itself. I/O 2025 could reshape how we interact with the entire Google ecosystem.
🎧 Stability AI Launches Smartphone-Ready Music Generator
Stability AI just dropped Stable Audio Open Small, a lightweight, stereo audio-generation model that can run directly on your phone.
📱 Built with Arm, the model produces short sound effects and loops in under 8 seconds, no cloud needed. That means true offline AI music creation for the first time.
🎼 Trained only on royalty-free data from Free Music Archive and Freesound, it avoids the copyright pitfalls dogging rivals like Suno and Udio.
Why it matters: This puts generative audio in your pocket, without legal gray areas or cloud dependency. It's a huge step toward fully local, real-time creativity powered by open AI.
🧠 OpenAI Brings GPT-4.1 to ChatGPT
OpenAI has rolled out GPT-4.1 and GPT-4.1 mini to ChatGPT users, marking a major upgrade in speed, coding skill, and instruction following.
⚡ GPT-4.1 is now live for Plus, Pro, and Team subscribers. The lightweight GPT-4.1 mini is available to all users, including free accounts.
🛠️ The new models promise better debugging, faster responses, and more accurate task handling. OpenAI also launched a Safety Evaluations Hub to share transparency data more frequently.
Why it matters: OpenAI is doubling down on speed, usability, and safety, while rivals like Google race to connect their own chatbots to dev tools. The AI coding wars are heating up fast.
🧑💻 OpenAI Launches Codex: An AI Teammate for Developers
OpenAI just released Codex, its most advanced AI coding agent yet, now live in ChatGPT for Pro, Team, and Enterprise users.
🛠️ Powered by codex-1 (a version of o3 optimized for software engineering), Codex writes cleaner code, tests iteratively, and integrates with GitHub for seamless debugging and automation.
🌐 Codex runs in a secure, cloud-based sandbox with no internet access, limiting potential abuse, but also reducing integration flexibility. Users can assign tasks, monitor progress, and interact via a new sidebar interface.
Why it matters: Codex moves beyond autocomplete; it's a full-on agentic assistant built to act like a junior engineer. With rivals like Claude Code and Gemini Code Assist gaining ground, OpenAI is betting big that Codex becomes your next virtual teammate.
🧠 Google’s AlphaEvolve Solves "Impossible" Math
AlphaEvolve cracked math problems that baffled experts for decades, beating all known human-designed algorithms.
Why it matters: This marks a turning point where AI isn’t just helping, it’s leading foundational scientific discovery.
⚙️ AI Breakthroughs You Shouldn’t Miss
📌 Anthropic’s Claude 3.7 now comes with a $25K bounty for red teaming.
📌 ByteDance’s DeerFlow redefines research with multi-agent voice + code automation.
📌 Alibaba’s Wan2.1-VACE levels up multimodal video editing.
📌 LangChain’s Open Agent Platform lets anyone build AI agents,no code required.
Why it matters: These aren't just tools, they're shifts in how AI is developed, tested, and used.
📈 This Week’s Standout Insights
💡 Microsoft’s Phi-4: Small models, big reasoning.
💡 DeepCoder-14B: Compact coding agents rivaling massive models.
💡 DeerFlow: Voice + code = research reimagined.
Why it matters: We’re entering an era where AI is smaller, faster, and far more usable.
🧠 Phrase of the Week: Vector Search Engine
Imagine you have a huge box of LEGO pieces and you're trying to find the one that feels the most like the one in your hand, not by color or size, but by shape, texture, and how it fits with others.
A Vector Search Engine is like a super-smart LEGO sorter that finds things based on meaning, not just names or labels. It helps AI figure out what’s "similar" in a really clever way, like finding pictures that match your mood or videos that have the same vibe.
It’s not searching for exact matches, it’s searching for things that are close in feeling. That’s why it’s so useful for AI! A great example of a vector search engine is Qdrant.
If you are into content creation, here are two free tools for you to check out:
🎥 Taledy has a suite of tools for creating videos, transcribing, creating shorts, and much more. Check it out!
🤖 Vidyne provides a hands-off way to manage your YouTube channel by automatically creating videos and uploading them to your channel. Try it out!
The Taledy AI Team