Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Former UN special rapporteur Richard Falk interrogated for several hours in Canada | Israeli-Palestinian conflict News

US immigration crackdown continues with arrests in Charlotte, North Carolina | Donald Trump News

Renewable energy is reshaping the global economy – new report

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » NPU core improves inference performance by over 60%
Electronics & Semiconductor

NPU core improves inference performance by over 60%

ThefuturedatainsightsBy ThefuturedatainsightsJuly 8, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Core neural processing unit technology to improve CHATGPT inference performance by over 60%

Oaken’s quantization algorithm consisting of three components: (a) threshold-based online offline hybrid quantization, (b) group-shift quantization, and (c) dense density and sparse encoding. Credit: Proceedings of the 52nd Annual International Symposium on Computer Architecture (2025). doi:10.1145/3695053.3731019

Modern generation AI models such as Openai’s ChatGPT-4 and Google’s Gemini 2.5 require not only high memory bandwidth but also large memory capacity. This is why generate AI cloud operating companies like Microsoft and Google buy hundreds of thousands of Nvidia GPUs.

As a solution to address the core challenges of building such high-performance AI infrastructure, Korean researchers have successfully developed NPU (neural machining unit) core technology that improves the inference performance of generated AI models by an average of over 60%, while consuming approximately 44% less power than modern GPUs.

Professor Jongse Park, a research team from Kaist School of Computing, has collaborated with HyperAccel Inc. to develop high-performance, low-power NPU core technologies specializing in generator AI clouds like ChatGpt.

The techniques proposed by the research team were presented by the Ph.D. Students Minsu Kim and Dr. Seongmin Hong, PhD, from HyperAccel Inc., are co-first authors at the 2025 International Symposium on Computer Architecture (ISCA 2025), held in Tokyo from June 21-25.

The key objective of this study is to improve the performance of large-scale generation AI services by minimizing loss of accuracy and solving memory bottleneck problems while reducing the weight of the inference process. This research has been highly praised for its integrated design of AI semiconductors and AI system software, which are key components of the AI ​​infrastructure.

While existing GPU-based AI infrastructures require multiple GPU devices to meet high bandwidth and capacity demands, this technology allows for the same level of AI infrastructure configuration with fewer NPU devices via KV cache quantization. KV caches account for most memory usage, and quantization significantly reduces the cost of building a generated AI cloud.

Core neural processing unit technology to improve CHATGPT inference performance by over 60%

Overall Oken Accelerator Architecture. Credit: Proceedings of the 52nd Annual International Symposium on Computer Architecture (2025). doi:10.1145/3695053.3731019

The researchers designed it to integrate with memory interfaces without modifying the operational logic of existing NPU architectures. This hardware architecture not only implements the proposed quantization algorithm, but also employs page-level memory management techniques to efficiently utilize limited memory bandwidth and capacity, and introduces new encoding techniques optimized for quantized KV caches.

Furthermore, when building an NPU-based AI cloud with superior cost and power efficiency compared to modern GPUs, the high performance and low power nature of NPUs is expected to significantly reduce operational costs.

Professor Jongse Park said, “Through collaboration with HyperAccel Inc., this research has discovered solutions for the optical strength algorithms of generated AI inference and successfully developed core NPU technology that can solve memory problems.

“This technology demonstrates the potential to implement high-performance, low-power infrastructure specialized for generating AI, and is expected to play an important role not only in AI cloud data centers, but also in AI transformation (AX) environments, represented by dynamically viable AI such as agent AI.”

Details: Minsu Kim et al, Oaken: Fast and efficient LLM using the minutes of the Computer Architecture (2025) on Online Offline Hybrid KV Cache Quantization, 52nd Annual International Symposium. doi:10.1145/3695053.3731019

Provided by Korea Institute of Advanced Science and Technology (KAIST)

Quote: AI Cloud Infrastructure Faster and Eco-Friendly: NPU Core will improve 60% (2025, July 7) above 60% (2025, July 7) from https://techxplore.com/news/2025-07-Ai-cloud-infrastuture-faster-greener.htmll on July 8, 2025.

This document is subject to copyright. Apart from fair transactions for private research or research purposes, there is no part that is reproduced without written permission. Content is provided with information only.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous Article“There’s nothing more to give”: Farmers stand firm against our trade pressures
Next Article Research shows that manipulated wood is more resistant to microorganisms than plastic
Thefuturedatainsights
  • Website

Related Posts

Renewable energy is reshaping the global economy – new report

November 16, 2025

Newsom touts California’s record increase in battery energy at UN climate change conference

November 15, 2025

Super strong and lightweight metal composite material can withstand extreme heat.

November 15, 2025
Leave A Reply Cancel Reply

Latest Posts

NFU warns as UK considers cattle feed additives to reduce methane

Unions sound alarm after wave of GPS attacks on NI farms

NI farmers warned to act as BVD rules tightened on 1 December

Northern Ireland braces for significant loss of veterinary medicine packs by 2026

Latest Posts

Boeing defense workers strike votes on new contract

November 13, 2025

Firefly Aerospace (FLY) Q3 2025 Earnings

November 12, 2025

Flight cancellations have eased and the end of the shutdown is in sight

November 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Former UN special rapporteur Richard Falk interrogated for several hours in Canada | Israeli-Palestinian conflict News
  • US immigration crackdown continues with arrests in Charlotte, North Carolina | Donald Trump News
  • Renewable energy is reshaping the global economy – new report
  • JP Morgan doesn’t want to pay Frank founder Charlie Jarvis’ legal costs
  • Mexican protests inspired by Gen Z movement draw older government critics | Mexican protest news

Recent Comments

No comments to show.

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.