The AI Race Just Got Smarter: Why the Hierarchical Reasoning Model Is a Game-Changer

๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—ถ๐˜€๐˜๐˜€ ๐—ต๐—ฎ๐˜ƒ๐—ฒ ๐—ท๐˜‚๐˜€๐˜ ๐—ฑ๐—ฒ๐˜ƒ๐—ฒ๐—น๐—ผ๐—ฝ๐—ฒ๐—ฑ ๐—ฎ ๐—ป๐—ฒ๐˜„ ๐—”๐—œ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ถ๐—ป๐˜€๐—ฝ๐—ถ๐—ฟ๐—ฒ๐—ฑ ๐—ฏ๐˜† ๐˜๐—ต๐—ฒ ๐—ต๐˜‚๐—บ๐—ฎ๐—ป ๐—ฏ๐—ฟ๐—ฎ๐—ถ๐—ป, ๐—ฎ๐—ป๐—ฑ ๐—ถ๐˜โ€™๐˜€ ๐—ฎ๐—น๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—บ๐—ฎ๐—ธ๐—ถ๐—ป๐—ด ๐˜„๐—ฎ๐˜ƒ๐—ฒ๐˜€. ๐—˜๐—ฎ๐—ฟ๐—น๐˜† ๐˜๐—ฒ๐˜€๐˜๐˜€ ๐˜€๐—ต๐—ผ๐˜„ ๐˜๐—ต๐—ฎ๐˜ ๐˜๐—ต๐—ถ๐˜€ ๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ๐˜๐—ต๐—ฟ๐—ผ๐˜‚๐—ด๐—ต ๐—”๐—œ ๐—ถ๐˜€ ๐—ผ๐˜‚๐˜๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ถ๐—ป๐—ด ๐—ฒ๐˜ƒ๐—ฒ๐—ป ๐—ฎ๐—ฑ๐˜ƒ๐—ฎ๐—ป๐—ฐ๐—ฒ๐—ฑ ๐—Ÿ๐—Ÿ๐— ๐˜€ ๐—น๐—ถ๐—ธ๐—ฒ ๐—–๐—ต๐—ฎ๐˜๐—š๐—ฃ๐—ง ๐˜„๐—ต๐—ฒ๐—ป ๐—ถ๐˜ ๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€ ๐˜๐—ผ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—น๐—ฒ๐˜… ๐—ฟ๐—ฒ๐—ฎ๐˜€๐—ผ๐—ป๐—ถ๐—ป๐—ด ๐˜๐—ฎ๐˜€๐—ธ๐˜€.

๐–ณ๐—๐–พ ๐–ง๐—‚๐–พ๐—‹๐–บ๐—‹๐–ผ๐—๐—‚๐–ผ๐–บ๐—… ๐–ฑ๐–พ๐–บ๐—Œ๐—ˆ๐—‡๐—‚๐—‡๐—€ ๐–ฌ๐—ˆ๐–ฝ๐–พ๐—… (๐–ง๐–ฑ๐–ฌ) ๐—‚๐—Œ ๐–ฝ๐–พ๐—Œ๐—‚๐—€๐—‡๐–พ๐–ฝ ๐—๐—ˆ ๐—†๐—‚๐—†๐—‚๐–ผ ๐—๐—ˆ๐— ๐—๐—๐–พ ๐—๐—Ž๐—†๐–บ๐—‡ ๐–ป๐—‹๐–บ๐—‚๐—‡ ๐—‰๐—‹๐—ˆ๐–ผ๐–พ๐—Œ๐—Œ๐–พ๐—Œ ๐–ผ๐—ˆ๐—†๐—‰๐—…๐–พ๐—‘ ๐—‚๐—‡๐–ฟ๐—ˆ๐—‹๐—†๐–บ๐—๐—‚๐—ˆ๐—‡, ๐–บ๐—‡๐–ฝ ๐—‚๐— ๐—๐–บ๐—Œ ๐—†๐–บ๐—‡๐–บ๐—€๐–พ๐–ฝ ๐—๐—ˆ ๐—ˆ๐—Ž๐—๐—‰๐–พ๐—‹๐–ฟ๐—ˆ๐—‹๐—† ๐—Œ๐—ˆ๐—†๐–พ ๐—ˆ๐–ฟ ๐—๐—๐–พ ๐—…๐–พ๐–บ๐–ฝ๐—‚๐—‡๐—€ ๐–ซ๐–ซ๐–ฌ๐—Œ ๐—ˆ๐—‡ ๐–บ ๐–ป๐–พ๐—‡๐–ผ๐—๐—†๐–บ๐—‹๐—„ ๐—๐—๐–บ๐—โ€™๐—Œ ๐—„๐—‡๐—ˆ๐—๐—‡ ๐–ฟ๐—ˆ๐—‹ ๐–ป๐–พ๐—‚๐—‡๐—€ ๐–พ๐—‘๐—๐—‹๐–พ๐—†๐–พ๐—…๐—’ ๐—๐—ˆ๐—Ž๐—€๐— ๐—๐—ˆ ๐–ป๐–พ๐–บ๐—.

๐–ฒ๐–ผ๐—‚๐–พ๐—‡๐—๐—‚๐—Œ๐—๐—Œ ๐—๐–บ๐—๐–พ ๐–ผ๐—‹๐–พ๐–บ๐—๐–พ๐–ฝ ๐–บ ๐—‡๐–พ๐— ๐—๐—’๐—‰๐–พ ๐—ˆ๐–ฟ ๐– ๐–จ ๐—†๐—ˆ๐–ฝ๐–พ๐—… ๐—๐—๐–บ๐— ๐–บ๐—‰๐—‰๐—‹๐—ˆ๐–บ๐–ผ๐—๐–พ๐—Œ ๐—‹๐–พ๐–บ๐—Œ๐—ˆ๐—‡๐—‚๐—‡๐—€ ๐—‚๐—‡ ๐–บ ๐–ผ๐—ˆ๐—†๐—‰๐—…๐–พ๐—๐–พ๐—…๐—’ ๐–ฝ๐—‚๐–ฟ๐–ฟ๐–พ๐—‹๐–พ๐—‡๐— ๐—๐–บ๐—’ ๐–ผ๐—ˆ๐—†๐—‰๐–บ๐—‹๐–พ๐–ฝ ๐—๐—ˆ ๐—†๐—ˆ๐—Œ๐— ๐—…๐–บ๐—‹๐—€๐–พ ๐—…๐–บ๐—‡๐—€๐—Ž๐–บ๐—€๐–พ ๐—†๐—ˆ๐–ฝ๐–พ๐—…๐—Œ (๐–ซ๐–ซ๐–ฌ๐—Œ) ๐—…๐—‚๐—„๐–พ ๐–ข๐—๐–บ๐—๐–ฆ๐–ฏ๐–ณ. ๐–ณ๐—๐–บ๐—‡๐—„๐—Œ ๐—๐—ˆ ๐—๐—๐—‚๐—Œ ๐—‡๐–พ๐— ๐–บ๐—‰๐—‰๐—‹๐—ˆ๐–บ๐–ผ๐—, ๐—๐—๐–พ ๐—†๐—ˆ๐–ฝ๐–พ๐—… ๐–ฝ๐–พ๐—…๐—‚๐—๐–พ๐—‹๐—Œ ๐—Œ๐—‚๐—€๐—‡๐—‚๐–ฟ๐—‚๐–ผ๐–บ๐—‡๐—๐—…๐—’ ๐–ป๐–พ๐—๐—๐–พ๐—‹ ๐—‰๐–พ๐—‹๐–ฟ๐—ˆ๐—‹๐—†๐–บ๐—‡๐–ผ๐–พ ๐—ˆ๐—‡ ๐—Œ๐–พ๐—๐–พ๐—‹๐–บ๐—… ๐—„๐–พ๐—’ ๐–ป๐–พ๐—‡๐–ผ๐—๐—†๐–บ๐—‹๐—„๐—Œ.

The new reasoning AI, known as the {๐—›๐—ถ๐—ฒ๐—ฟ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ถ๐—ฐ๐—ฎ๐—น ๐—ฅ๐—ฒ๐—ฎ๐˜€๐—ผ๐—ป๐—ถ๐—ป๐—ด ๐— ๐—ผ๐—ฑ๐—ฒ๐—น (๐—›๐—ฅ๐— )https://pmc.ncbi.nlm.nih.gov/articles/PMC11665873/}, is inspired by how the human ๐—ฏ๐—ฟ๐—ฎ๐—ถ๐—ป ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€๐—ฒ๐˜€ ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ผ๐—ป โ€” integrating data across different time scales, from milliseconds to minutes.

According to scientists at ๐—ฆ๐—ฎ๐—ฝ๐—ถ๐—ฒ๐—ป๐˜, an AI company based in Singapore, this model not only delivers ๐—ฏ๐—ฒ๐˜๐˜๐—ฒ๐—ฟ ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ but also works ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฒ๐—ณ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—น๐˜†. Thatโ€™s because it needs ๐—ณ๐—ฒ๐˜„๐—ฒ๐—ฟ ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€ and ๐—น๐—ฒ๐˜€๐˜€ ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด ๐—ฑ๐—ฎ๐˜๐—ฎ compared to traditional models.

According to the study uploaded on ๐—๐˜‚๐—ป๐—ฒ ๐Ÿฎ๐Ÿฒ to the (๐—ฎ๐—ฟ๐—ซ๐—ถ๐˜ƒ)https://arxiv.org/abs/2506.21734 preprint database (still awaiting peer review), the ๐—›๐—ฅ๐—  ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น uses just ๐Ÿฎ๐Ÿณ ๐—บ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€ and was trained on only ๐Ÿญ,๐Ÿฌ๐Ÿฌ๐Ÿฌ ๐˜€๐—ฎ๐—บ๐—ฝ๐—น๐—ฒ๐˜€. In contrast, most advanced ๐—Ÿ๐—Ÿ๐— ๐˜€ rely on ๐—ฏ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป๐˜€ โ€” ๐—ฒ๐˜ƒ๐—ฒ๐—ป ๐˜๐—ฟ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป๐˜€ โ€” ๐—ผ๐—ณ ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€. For comparison, while the exact number isnโ€™t public, estimates suggest that the newly released ๐—š๐—ฃ๐—ง-๐Ÿฑ could have anywhere between ๐Ÿฏ ๐˜๐—ฟ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฎ๐—ป๐—ฑ ๐Ÿฑ ๐˜๐—ฟ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€.

๐—” ๐—ป๐—ฒ๐˜„ ๐˜„๐—ฎ๐˜† ๐—ผ๐—ณ ๐˜๐—ต๐—ถ๐—ป๐—ธ๐—ถ๐—ป๐—ด ๐—ณ๐—ผ๐—ฟ ๐—”๐—œ

Scientists have developed a ๐—ฟ๐—ฒ๐˜ƒ๐—ผ๐—น๐˜‚๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—ฟ๐˜† ๐—”๐—œ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น designed to think more like the ๐—ต๐˜‚๐—บ๐—ฎ๐—ป ๐—ฏ๐—ฟ๐—ฎ๐—ถ๐—ปโ€” and itโ€™s already ๐—ฏ๐—ฒ๐—ฎ๐˜๐—ถ๐—ป๐—ด ๐˜€๐—ผ๐—บ๐—ฒ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐˜„๐—ผ๐—ฟ๐—น๐—ฑโ€™๐˜€ ๐—บ๐—ผ๐˜€๐˜ ๐—ฎ๐—ฑ๐˜ƒ๐—ฎ๐—ป๐—ฐ๐—ฒ๐—ฑ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€, including ChatGPT, in complex reasoning tests.

The new system, called the ๐—›๐—ถ๐—ฒ๐—ฟ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ถ๐—ฐ๐—ฎ๐—น ๐—ฅ๐—ฒ๐—ฎ๐˜€๐—ผ๐—ป๐—ถ๐—ป๐—ด ๐— ๐—ผ๐—ฑ๐—ฒ๐—น (๐—›๐—ฅ๐— ), is inspired by how the brain ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€๐—ฒ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ถ๐—ป๐˜๐—ฒ๐—ด๐—ฟ๐—ฎ๐˜๐—ฒ๐˜€ ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ผ๐—ป across different time scales โ€” from ๐—บ๐—ถ๐—น๐—น๐—ถ๐˜€๐—ฒ๐—ฐ๐—ผ๐—ป๐—ฑ๐˜€ ๐˜๐—ผ ๐—บ๐—ถ๐—ป๐˜‚๐˜๐—ฒ๐˜€. Unlike traditional large language models (LLMs) that depend on brute-force computation, HRM focuses on ๐˜€๐—บ๐—ฎ๐—ฟ๐˜๐—ฒ๐—ฟ, ๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ๐—ฑ ๐—ฟ๐—ฒ๐—ฎ๐˜€๐—ผ๐—ป๐—ถ๐—ป๐—ด.

Researchers at ๐—ฆ๐—ฎ๐—ฝ๐—ถ๐—ฒ๐—ป๐˜, an AI company based in Singapore, say HRM not only ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐˜€ ๐—ฏ๐—ฒ๐˜๐˜๐—ฒ๐—ฟ but also ๐˜„๐—ผ๐—ฟ๐—ธ๐˜€ ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฒ๐—ณ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—น๐˜†. Unlike models with massive architectures, HRM uses just ๐Ÿฎ๐Ÿณ ๐—บ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€ and was trained on only ๐Ÿญ,๐Ÿฌ๐Ÿฌ๐Ÿฌ ๐˜€๐—ฎ๐—บ๐—ฝ๐—น๐—ฒ๐˜€ โ€” a fraction of what modern LLMs need. For comparison, todayโ€™s cutting-edge models, like ๐—š๐—ฃ๐—ง-๐Ÿฑ, are estimated to have between ๐Ÿฏ ๐˜๐—ฟ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฎ๐—ป๐—ฑ ๐Ÿฑ ๐˜๐—ฟ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฝ๐—ฎ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜๐—ฒ๐—ฟ๐˜€.

When tested on the ๐—”๐—ฅ๐—–-๐—”๐—š๐—œ ๐—ฏ๐—ฒ๐—ป๐—ฐ๐—ต๐—บ๐—ฎ๐—ฟ๐—ธ โ€” an extremely challenging test designed to measure how close AI is to achieving ๐—ฎ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฎ๐—น ๐—ด๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐—น ๐—ถ๐—ป๐˜๐—ฒ๐—น๐—น๐—ถ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ (๐—”๐—š๐—œ) โ€” HRM delivered impressive results.

๐—”๐—ฅ๐—–-๐—”๐—š๐—œ-๐Ÿญ: HRM scored ๐Ÿฐ๐Ÿฌ.๐Ÿฏ%
(๐˜ท๐˜ด. ๐˜–๐˜ฑ๐˜ฆ๐˜ฏ๐˜ˆ๐˜โ€™๐˜ด ๐˜ฐ3-๐˜ฎ๐˜ช๐˜ฏ๐˜ช-๐˜ฉ๐˜ช๐˜จ๐˜ฉ ๐˜ข๐˜ต 34.5%, ๐˜Š๐˜ญ๐˜ข๐˜ถ๐˜ฅ๐˜ฆ 3.7 ๐˜ข๐˜ต 21.2%, ๐˜ข๐˜ฏ๐˜ฅ ๐˜‹๐˜ฆ๐˜ฆ๐˜ฑ๐˜š๐˜ฆ๐˜ฆ๐˜ฌ ๐˜™1 ๐˜ข๐˜ต 15.8%)

๐—”๐—ฅ๐—–-๐—”๐—š๐—œ-๐Ÿฎ: HRM achieved ๐Ÿฑ%
(๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ข๐˜ณ๐˜ฆ๐˜ฅ ๐˜ต๐˜ฐ 3% ๐˜ง๐˜ฐ๐˜ณ ๐˜ฐ3-๐˜ฎ๐˜ช๐˜ฏ๐˜ช-๐˜ฉ๐˜ช๐˜จ๐˜ฉ, 1.3% ๐˜ง๐˜ฐ๐˜ณ ๐˜‹๐˜ฆ๐˜ฆ๐˜ฑ๐˜š๐˜ฆ๐˜ฆ๐˜ฌ ๐˜™1, ๐˜ข๐˜ฏ๐˜ฅ ๐˜ซ๐˜ถ๐˜ด๐˜ต 0.9% ๐˜ง๐˜ฐ๐˜ณ ๐˜Š๐˜ญ๐˜ข๐˜ถ๐˜ฅ๐˜ฆ 3.7)

Most LLMs, including ChatGPT, rely on ๐—ฐ๐—ต๐—ฎ๐—ถ๐—ป-๐—ผ๐—ณ-๐˜๐—ต๐—ผ๐˜‚๐—ด๐—ต๐˜ (๐—–๐—ผ๐—ง) reasoning, which breaks complex problems into smaller, natural-language steps. While this works well, HRM takes a ๐—ฑ๐—ถ๐—ณ๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐˜, ๐—ฏ๐—ฟ๐—ฎ๐—ถ๐—ป-๐—ถ๐—ป๐˜€๐—ฝ๐—ถ๐—ฟ๐—ฒ๐—ฑ ๐—ฎ๐—ฝ๐—ฝ๐—ฟ๐—ผ๐—ฎ๐—ฐ๐—ต, allowing it to ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€ ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ต๐—ถ๐—ฒ๐—ฟ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ถ๐—ฐ๐—ฎ๐—น๐—น๐˜† and solve difficult reasoning tasks with ๐—ณ๐—ฒ๐˜„๐—ฒ๐—ฟ ๐—ฟ๐—ฒ๐˜€๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ๐˜€.

Experts believe this breakthrough could mark a ๐˜€๐—ถ๐—ด๐—ป๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐—ป๐˜ ๐—น๐—ฒ๐—ฎ๐—ฝ ๐˜๐—ผ๐˜„๐—ฎ๐—ฟ๐—ฑ ๐—ต๐˜‚๐—บ๐—ฎ๐—ป-๐—น๐—ถ๐—ธ๐—ฒ ๐—”๐—œ ๐—ฟ๐—ฒ๐—ฎ๐˜€๐—ผ๐—ป๐—ถ๐—ป๐—ด โ€” and possibly bring us ๐—ฐ๐—น๐—ผ๐˜€๐—ฒ๐—ฟ ๐˜๐—ผ ๐—”๐—š๐—œ than ever before.

Leave a Reply

Your email address will not be published. Required fields are marked *