Facebook Twitter Instagram
    Trending
    • Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.
    • iPhone Ultra adopts a single-hole display: Following the trend of Android devices
    • R4002 Mini Smartphone Review: A Compact Rugged Phone That Fits Anywhere
    • JJRC C8839 RC Car Review: A Powerful 1/14 Scale 4WD Off-Road Truck Built for Adventure
    • NH66-139 Simulation Engineering Model 3-In-1 RC Excavator RTR RC Car Review
    • KOOTAI F07 RC Helicopter Review: Scale Realism Meets Advanced Flight Technology
    • Is It Legal to Sell or Use Large Puff Vapes in the UK?
    • The Method for Batch Deletion of Photos on an iPhone
    Facebook YouTube
    IGeeKphone China Phone, Tablet PC, VR, RC Drone News, Reviews
    • HOME
      • NEWS
        • DeepSeek
        • ChatGPT
        • Minecraft
    • Amazon
    • PHONE
      • Top Phones For Your First Choice
      • Phone Comparison
      • Xiaomi
      • Blackview
      • Doogee
      • Black Shark
      • Geekbuying
      • Banggood
      • TEMU
      • TikTok
      • Aliexpress
      • Walmart
      • MercadoLibre
      • Lazada
    • TOP VAPE Awards for 2026
    • VAPES
      • E-CIGAR Upcoming
      • Vape News
      • Vape Deals
      • Vape Comparison
      • Vape Guide
      • Giveaway
    • BEST VAPE
      • Best Vape Stores
      • Best Starter Vape Kits
      • Best Vapes for Beginners
      • Best Disposable Vapes
      • Best Pod Systems
      • Best Pod Mod Vapes
      • Best Mods
      • Best Nicotine Pouches
      • Best Clearomizers/Tanks
      • Best E-Liquid
      • Best EGO/Pens
      • Best Vapes for Nic Salt E-Juice
      • Best Vapes to Quit Smoking
      • RDA vs. RDTA vs. RTA
    • Best Vape Brand 2026
      • VAPORESSO
      • VOOPOO
      • OXVA
      • NEXA BAR
      • MASKKING VAPE
      • SP2S
      • IPLAY
      • TODOO
      • MRFOG
      • LOSTVAPE
      • VEIIK
    • REVIEW
      • E-cigar Review
      • Phones
      • Tablet PC
      • TV Box
      • RC Drone
      • Wearables
      • Camera
      • Accessories
      • VR Headset
    • MORE
      • TABLET
        • Chuwi
        • INNOCN
        • Teclast
        • Top Tablet for Your First Choice
        • Tablet/Laptop Comparison
      • RC DRONE
      • CAMERA
      • WEARABLES
        • OneOdio
        • BlitzWolf
        • Top Smartwatch for First Choice
      • 3D PRINTER
        • 3D Printer Review
        • Anycubic
        • FLSUN
        • Xtool
        • LONGER
        • Top 3D printer to Choose First
      • POWER STATION
        • Oukitel
        • FOSSIBOT
      • GAMING
        • Top Gaming Products
      • E-BIKE
        • Samebike
        • Happyrun
        • ENGWE
      • SMART HOME
      • TV BOX
      • ACCESSORIES
      • VR HEADSET
      • CLOTHES
      • AUTO CAR
    • DEAL
    • VAPE LAWS
    • Shop
    IGeeKphone China Phone, Tablet PC, VR, RC Drone News, Reviews
    You are at:Home»NEWS»Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.
    NEWS

    Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.

    Brady CottonBy Brady CottonJune 9, 2026
    Facebook Twitter Pinterest LinkedIn Tumblr Email

    Igeekphone News, June 9th: Xiaomi, in collaboration with TileRT, has officially launched MiMo-V2.5-Pro-UltraSpeed, achieving a landmark breakthrough in the industry: Based on a trillion-parameter large model, on a single standard 8-card general-purpose GPU node, the text generation speed has been increased to 1000 tokens per second for the first time.

    Even the peak rate can reach 1200 tokens per second. There is no need to customize dedicated chips throughout the process, significantly lowering the implementation threshold for ultra-fast AI inference.
    This version has launched a limited-time API service in synchronization. The pricing is three times that of the original MiMo-V2.5-Pro, but the generation speed has increased by approximately 10 times, presenting a remarkable cost-performance advantage.

    Due to the limitation of high-speed reasoning resources, the service is temporarily available on a subscription basis. The trial period is from June 9th to 23rd, 23:59 Beijing Time. The platform will give priority to reviewing enterprises and professional developers with actual business needs. Ordinary users can freely experience the conversation function through the dedicated webpage.

    The daily queue limit for a single account is 10 times, and the maximum duration of a single session is 30 minutes. If the session is idle for 5 minutes, it will automatically be disconnected to ensure fair resource allocation.

    This performance leap is achieved through the deep collaborative design of models and systems. The core innovations include three major technological advancements:

    The first one is the FP4 quantization technology. According to the characteristics of the model’s MoE architecture, only the expert layer, which accounts for the majority of the parameters, undergoes lossless FP4 quantization. The remaining modules retain their original precision. This not only reduces memory usage and alleviates bandwidth pressure but also ensures that the overall capability of the model remains largely unchanged.

    The second is DFlash block parallel speculative decoding. It abandons the traditional serial decoding mode and can predict an entire text block at a time. In scenarios such as code and mathematical reasoning, it can confirm an average of 6-7 tokens per round, significantly improving the decoding efficiency.

    Thirdly, by relying on the TileRT inference system, the GPU execution architecture is restructured. Persistent cores and heterogeneous pipelines are adopted to eliminate the delay caused by operator switching, allowing the hardware computing power to remain fully operational at all times.

    The extremely fast reasoning ability has also reshaped the application scenarios of AI. The ultra-high speed enables parallel reasoning of models, autonomous error correction, and improvement of logical reasoning quality; it significantly alleviates the waiting and lag in code generation, releasing the productivity of programming agents; at the same time, it enables the deployment of trillion-parameter large models in high-frequency quantitative trading, real-time anti-fraud, medical image analysis and other real-time decision-making scenarios with millisecond latency.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    iPhone Ultra adopts a single-hole display: Following the trend of Android devices

    R4002 Mini Smartphone Review: A Compact Rugged Phone That Fits Anywhere

    JJRC C8839 RC Car Review: A Powerful 1/14 Scale 4WD Off-Road Truck Built for Adventure

    Leave A Reply Cancel Reply

    You must be logged in to post a comment.

    voopoo G4 Mini
    oxva xlim go lite
    sp2s sen x disposable vape
    • Popular
    • 3D Printer REVIEW
    • XIAOMI
    June 6, 2026

    Vaporesso LUXE X3 & LUXE Q3 Review. Two Paths to Exceptional Flavor

    June 6, 2026

    VOOPOO ARGUS G4 & ARGUS G4 Mini Review – Smart Flavor Meets Everyday Simplicity

    June 6, 2026

    VOOPOO NAVI x CYPH Kit 80K Review. Is This The Future of Disposable Vaping?

    May 17, 2026

    iPlay Nova 45K Review – High Puff Performance with a Futuristic Twist

    December 26, 2025

    ACMER ASCARVA 4S: Precision CNC Power for Makers, DIYers & Small Workshops

    June 23, 2024

    ACMER P2 20W Laser Engraver Fixed Focus Engraving: Hands on Review

    May 30, 2024

    xTool F1 Ultra Review: World’s First 20W Fiber & 20W Diode Laser Engraver

    May 30, 2024

    Anycubic Kobra 3 Combo Review: The Multicolor Masterpiece?

    June 9, 2026

    Xiaomi has released MiMo-V2.5-Pro-UltraSpeed: The generation speed has been increased by 10 times! It can process over 1,000 tokens per second.

    June 8, 2026

    Xiaomi MIX Fold 5 will be the first to feature the Xuanji O3

    June 8, 2026

    Xiaomi 17T Series Mobile Phones are Released, Equipped with Dimensity 8500-Ultra / Dimensity 9500 chips,Starting from 2999 yuan

    June 8, 2026

    The Specifications of Xiaomi 17T Series Phones have been Revealed: Equipped with MediaTek 8500

    fc 26 coins
    New Arrivals
    • Lost Mary MT35000 Turbo World Cup Edition Disposable Vape Lost Mary MT35000 Turbo World Cup Edition Disposable Vape
    • OnePlus 17T Smartphone OnePlus 17T Smartphone
    • OnePlus 17s OnePlus 17s
    • OnePlus Nord 7 OnePlus Nord 7
    • Oppo Find N7 Oppo Find N7
    • Huawei Mate X9 Huawei Mate X9
    • RELX Creator Pro 15K Disposable Vape Kit RELX Creator Pro 15K Disposable Vape Kit
    • OXVA Nexlim 2 Mini Pod System Kit OXVA Nexlim 2 Mini Pod System Kit
    • OnePlus Nord CE7 OnePlus Nord CE7
    About
  • Igeekphone.com provides the first global tech news and reviews about smartphone, vapes, e-cigar, smart home, 3D printers, e-bike,tablets, RC drones, VR headset, and other accessories. It's the best platform to improve your brand and product.
  • Contact us: info@igeekphone.com
  • Check Our Privacy Policy Here.
  • Note: *Right now we have US editor and EU editors for review, especially for Amazon US and EU.
  • *Shop and Compare Price Here*
  • Facebook
  • Youtube
  • OUR BEST VAPE PARTNERS
  • VAPE ONLINE STORE
  • HAYATI PRO MAX PLUS
  • VAPORESSO
  • VOOPOO
  • OXVA
  • NEXA
  • MASKKING
  • SP2S
  • IPLAY
  • TODOO
  • OTHER BEST PARTNERS
  • SVBONY
  • Chuwi
  • Blackview
  • Fossibot
  • Unihertz
  • Flsun
  • Anycubic
  • Xtool
  • Oukitel
  • Mukkpet Ebike
  • Ugreen
  • Copyright © 2026 igeekphone

    Type above and press Enter to search. Press Esc to cancel.