Close Menu
    What's Hot

    AI Is Now Part of Coach and Kate Spade Designers’ Workflow

    February 6, 2026

    Bitcoin Logs $3.2B In Loss-Taking, Beats Luna And FTX-Era Shock

    February 6, 2026

    Michael Ovitz Praised Jeffrey Epstein and Planned Meetings Files Show

    February 6, 2026
    Facebook X (Twitter) Instagram
    Hot Paths
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    Facebook X (Twitter) Instagram
    Hot Paths
    Home»Money»Transformers Probably Won’t Make AI As Smart As Humans. Others Might.
    Money

    Transformers Probably Won’t Make AI As Smart As Humans. Others Might.

    Press RoomBy Press RoomDecember 5, 2023No Comments4 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email
    • ChatGPT changed the conversation about AI.
    • But the tech powering it has limitations and may struggle to make AI that is as smart as humans.
    • Researchers are now looking at alternatives. 

    Loading Something is loading.

    Thanks for signing up!

    Access your favorite topics in a personalized feed while you’re on the go.

    Bull

    The groundbreaking work of a bunch of Googlers in 2017 introduced the world to transformers — neural networks that power popular AI products today.

    They power the large-language model, or LLM, beneath OpenAI’s ChatGPT, the chatbot whose explosion onto the scene last year prompted Bill Gates to declare “the age of AI has begun.”

    The mission for some AI entrepreneurs now is to realize a sci-fi vision and create artificial general intelligence (AGI): AI that appears as intelligent as a human.

    But while transformers can power ChatGPT, a preprint paper published by Google researchers last month suggests they might not be able to make the human-like abstractions, extrapolations, and predictions that would imply we’re at AGI.

    ChatGPT merely responds to users’ prompts with text using the data a human has trained it on. In its earliest public form, the chatbot had no knowledge of events beyond September, 2021, which it had to acknowledge every time someone asked abut more recent topics.

    Testing transformers’ ability to move beyond the data, the Google researchers described “degradation” of their “generalization for even simple extrapolation tasks.”

    This has raised the question of whether human-like AI is even possible. Another is whether different technologies may get us there.

    Some researchers are testing alternatives to figure that out, with another new paper suggesting that there might be a better model waiting in the wings.

    Research submitted to open-access repository ArXiv on December 1 by Albert Gu, assistant professor at the machine-learning department of Carnegie Mellon and Tri Dao, chief scientist at Together AI, introduces a model called Mamba.

    Quadratic attention has been indispensable for information-dense modalities such as language… until now.

    Announcing Mamba: a new SSM arch. that has linear-time scaling, ultra long context, and most importantly–outperforms Transformers everywhere we’ve tried.

    With @tri_dao 1/ pic.twitter.com/vXumZqJsdb

    — Albert Gu (@_albertgu) December 4, 2023

    Mamba is a state-space model, or SSM, and, according to Gu and Dao, it seems capable of beating transformers on performance in a bunch of tasks.

    A caveat: Research submitted to ArXiv is moderated but not necessarily peer-reviewed. This means the public gets to see research faster, but it isn’t necessarily reliable.

    Like LLMs, SSMs are capable of language modeling, the process through which chatbots like ChatGPT function. But SSMs do this with mathematical models of different “states” that users’ prompts can take.

    Gu and Dao’s research states: “Mamba achieves state-of-the-art performance across several modalities such as language, audio, and genomics.”

    On language modeling, Mamba “outperforms transformers of the same size and matches transformers twice its size, both in pretraining and downstream evaluation,” Gu and Dao noted.

    Writing on X, Dao also noted how a feature particular to SSMs means Mamba is able to generate language responses five times faster than a transformer.

    Our scan implementation is *30x faster* than basic PyTorch/JAX, and orders of magnitude faster than quadratic FlashAttention when sequence lengths get long.

    And because of the fixed-size recurrent state (no KV cache!) – Mamba can do LM inference 5x faster than a Transformer.
    6/ pic.twitter.com/llc1eZFHLt

    — Tri Dao (@tri_dao) December 4, 2023

    In response, Dr Jim Fan, a research scientist at software company Nvidia, wrote on X that he’s “always excited by new attempts to dethrone transformers. We need more of these.”

    He gave “kudos” to Dao and Gu “for pushing on alternative sequence architectures for many years now.”

    ChatGPT was a landmark cultural event that sparked an AI boom. But its technology looks unlikely to lead the industry to its promised land of human-like intelligence.

    But if repeated testing confirms Mamba does consistently outperform transformers, it could inch the industry closer.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Press Room

    Related Posts

    AI Is Now Part of Coach and Kate Spade Designers’ Workflow

    February 6, 2026

    Michael Ovitz Praised Jeffrey Epstein and Planned Meetings Files Show

    February 6, 2026

    Anthropic and OpenAI Release Dueling AI Models on the Same Day

    February 6, 2026
    Leave A Reply Cancel Reply

    LATEST NEWS

    AI Is Now Part of Coach and Kate Spade Designers’ Workflow

    February 6, 2026

    Bitcoin Logs $3.2B In Loss-Taking, Beats Luna And FTX-Era Shock

    February 6, 2026

    Michael Ovitz Praised Jeffrey Epstein and Planned Meetings Files Show

    February 6, 2026

    Bitwise Files S-1 With SEC to Launch Uniswap-Focused ETF

    February 6, 2026
    POPULAR
    Business

    The Business of Formula One

    May 27, 2023
    Business

    Weddings and divorce: the scourge of investment returns

    May 27, 2023
    Business

    How F1 found a secret fuel to accelerate media rights growth

    May 27, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!

    Archives

    • February 2026
    • January 2026
    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • May 2023

    Categories

    • Business
    • Crypto
    • Economy
    • Forex
    • Futures & Commodities
    • Investing
    • Market Data
    • Money
    • News
    • Personal Finance
    • Politics
    • Stocks
    • Technology

    Your source for the serious news. This demo is crafted specifically to exhibit the use of the theme as a news site. Visit our main page for more demos.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Buy Now
    © 2026 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.