Oak Ridge National Laboratory’s Frontier Supercomputer Now Fastest
AMD-Powered Frontier Supercomputer Breaks the Exascale Barrier
Hey Guys,
The is Data Science Learning Center Premium,
I like following the Supercomputer AiSupremacy race, though I don’t consider it really about artificial intelligence, more like pure-play computing.
If you want to join 3,500 other free subscribers about A.I you can though. It’s my flag-ship Newsletter and where on Substack I tend to be the most active in any given week.
This supercomputer race tends to change hands pretty quickly and often, but it’s all in good sport. The progress made is pretty incredible.
Check out the famous Top500 list here.
Quintillion calculations a second Barrier Broken
Today, Oak Ridge National Laboratory’s Frontier supercomputer was crowned fastest on the planet in the semiannual Top500 list. Frontier more than doubled the speed of the last titleholder, Japan’s Fugaku supercomputer, and is the first to officially clock speeds over a quintillion calculations a second—a milestone computing has pursued for 14 years.
True to the AiSupremacy stories I like to cover, this is America and China in race to the top of cutting-edge supercomputing why of course.
How can we visualize how powerful this supercomputer is? There’s an easy way:
Imagine giving all 7.9 billion people on the planet a pencil and a list of simple arithmetic or multiplication problems. Now, ask everyone to solve one problem per second for four and half years.
Frontier can do the same work in a second, and keep it up indefinitely. A thousand years’ worth of arithmetic by everyone on Earth would take Frontier just a little under four minutes.
You can read the original press release here. (I like to always read the source).
The one min promo video for this is actually a bit amusing.
With 1.1 exaflops of performance. The system is the first to achieve an unprecedented level of computing performance known as exascale, a threshold of a quintillion calculations per second.
Frontier features a theoretical peak performance of 2 exaflops, or two quintillion calculations per second, making it ten times more powerful than ORNL’s Summit system. The system leverages ORNL’s extensive expertise in accelerated computing and will enable scientists to develop critically needed technologies for the country’s energy, economic and national security, helping researchers address problems of national importance that were impossible to solve just five years ago.
In the top 500, here are positions 5-10.
The Age of Exascale
The number of floating-point operations, or simple mathematical problems, a computer solves per second is denoted FLOP/s or colloquially “flops.” Progress is tracked in multiples of a thousand: A thousand flops equals a kiloflop, a million flops equals a megaflop, and so on.
The ASCI Red supercomputer was the first to record speeds of a trillion flops, or a teraflop, in 1997. Suffice to say we’ve come a long way since then.
Still Huge Rooms
It’s true today’s supercomputers are far faster than older machines, but they still take up whole rooms, with rows of cabinets bristling with wires and chips. Frontier, in particular, is a liquid-cooled system by HPE Cray running 8.73 million AMD processing cores. In addition to being the fastest in the world, it’s also the second most efficient—outdone only by a test system made up of one of its cabinets—with a rating of 52.23 gigaflops/watt.
So, What’s the Big Deal?
Most supercomputers are funded, built, and operated by government agencies. They’re used by scientists to model physical systems, like the climate or structure of the universe, but also by the military for nuclear weapons research.
Supercomputers are now tailor-made to run the latest algorithms in artificial intelligence too. Indeed, a few years ago, Top500 added a new lower precision benchmark to measure supercomputing speed on AI applications. By that mark, Fugaku eclipsed an exaflop way back in 2020. The intersection of supercomputing and A.I. has yet to fully be scaled and explored.
The next steps for Frontier include continued testing and validation of the system, which remains on track for final acceptance and early science access later in 2022 and open for full science at the beginning of 2023.
China holds two top-ten spots with its Sunway TaihuLight from the National Research Center of Parallel Computer Engineering & Technology (NRCPC) and Tianhe-2A built by China's National University of Defense Technology (NUDT). Engadget notes however that China is rumored to already have no less than two exascale systems (according to the Linmark benchmark) on new Sunway Oceanlite and Tianhe-3 systems. China doesn’t disclose everything.
Frontier equates to 68 million instructions per second for each of the 86 billion neurons in the brain, highlighting the sheer computational horsepower. It appears this system will compete for the AI leadership position with newly-announced AI-focused supercomputers powered by Nvidia's Arm-based Grace CPU Superchips.
A Win for AMD as well
AMD's EPYC is now in 94 of the Top500 supercomputers in the world, marking a steady increase over the 73 systems listed in November 2021, and the 49 listed in June 2021. AMD also appears in more than half of the new systems on the list this year.
Frontier, a HPE Cray EX supercomputer, also claimed the number one spot on the Green500 list, which rates energy use and efficiency by commercially available supercomputing systems, with 62.68 gigaflops performance per watt.
However, in terms of power efficiency, AMD reigns supreme in the latest Green500 list — the company powers the four most efficient systems in the world, and also has eight of the top ten and 17 of the top 20 spots.
Supercomputers Powering AI?
As very large machine learning algorithms have emerged in recent years, private companies have begun to build their own machines alongside governments. Microsoft and OpenAI made headlines in 2020 with a machine they claimed was fifth fastest in the world. In January, Meta said its upcoming RSC supercomputer would be fastest at AI in the world at 5 exaflops. (It appears they’ll now need a few more chips to match Frontier). Then there’s Tesla as well.
Dojo, its supercomputer designed entirely in-house. Dojo is a supercomputer by virtue of its complexity and speed but differs from other supercomputers in quite a few ways. It’s not yet a Supercomputer since it’s not fully built out yet.
With scalability of models and super-computers A.I. will be able to do new things for sure. Frontier and other private supercomputers will allow machine learning algorithms to further push the limits. Today’s most advanced algorithms boast hundreds of billions of parameters—or internal connections—but upcoming algorithms will likely grow into the trillions.
More relevantly, rumors were flying last year that China had as many as two exascale supercomputers operating in secret. Researchers published some details on the machines in papers late last year, but they have yet to be officially benchmarked by Top500. It’s widely suspected China is ahead in this race as of 2021 or 2022.
By solving calculations up to 50 times faster than today’s top supercomputers—exceeding a quintillion, or 1018, calculations per second—Frontier will enable researchers to deliver breakthroughs in scientific discovery, energy assurance, economic competitiveness, and national security.
Exciting times in AI, supercomputing and data science.
Thanks for reading and supporting the channel!