Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Energy Secretary Trump says US aims to ‘liberate’ Venezuela’s economy – Energy News, Top Headlines, Commentary, Features, Events

Farmers target Tesco in Niseko as inheritance tax campaign intensifies across the UK

Animal welfare charity launches as Unilever halts cage-free initiative

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » The approach improves how new skills can be taught to large-scale language models
Electronics & Semiconductor

The approach improves how new skills can be taught to large-scale language models

Bussiness InsightsBy Bussiness InsightsJuly 7, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


chatgpt

Credit: Unsplash/CC0 Public Domain

Researchers have developed methods that significantly improve the performance of large-scale language models without increasing the computational power required to fine-tune the model. Researchers have demonstrated that their methods improve the performance of these models with previous techniques of tasks such as commonsense inference, arithmetic inference, post-directional instructions, code generation, and visual recognition.

Large language models are artificial intelligence systems that are pre-protected with huge datasets. After trading in advance, these models predict which words must follow each other to respond to user queries. However, the non-specific nature of pre-training means that there is ample room for improvement in these models when user queries are focused on a particular topic, such as when users answer mathematical questions or write computer code in the model.

“To improve the model’s ability to perform more specific tasks, we need to fine-tune the model,” said Tianfu Wu, co-author of a paper on work at North Carolina State University and an associate professor of computer engineering.

“However, these models are so large that it is not feasible to retrain the entire model. Instead, we need to determine the minimum number of changes needed to improve the performance of the model. We have developed a technique called Wegeft (pronounced Wee-Gift), which represents a key advancement to fine-tune these large models.”

A major breakthrough to fine-tune these large models was called the Lora, which was announced in 2022. LORA works by using mathematical tools to identify small subsets of important parameters that are most likely to improve model performance at a particular task.

There have been many attempts to improve Lora, but Wu and his collaborators have discovered that these previous efforts require significantly more computing power to improve performance, or that they need to use the same amount of computing power without improving performance.

“Wegeft is based on LORA, but incorporates additional mathematical tools that allow the model to determine the key parameters it is already familiar with and the parameters that the model “needs to learn,” says Wu. “By placing more weight on truly novel parameters, we can improve model performance compared to LORA without incorporating important new computational demands.”

In the proof-of-concept test, researchers found that Wegeft ran many variations and more than that, spanning a variety of downstream tasks, including many variations.

“I think this is a valuable step forward,” Wu says. “Wegeft can also be used to identify elements of the model that are causing harmful output, with the aim of improving AI alignment and ‘surgery’ to improve model safety and power.

This paper, “Wegeft: Fine-tuning the weight generation for multifaceted and efficient adaptation of large-scale models,” will be presented at the International Machine Learning Conference in Vancouver, Canada on July 17th. The paper’s co-author is PhD Chinmay Savadikar. Students from NC. This paper was co-authored by independent researcher Xi Song.

Details: “Wegeft: Fine-tuning of Weight Generation for Multi-Face and Efficient Adaptation of Large-Scale Models,” Chinmay Savadikar and Tianfu Wu, North Carolina State University. Xi Song, an independent researcher. Presentation: International Conference on Machine Learning in Vancouver, Canada, July 13th-19th. icml.cc/virtual/2025/poster/45660

Provided by North Carolina State University

Quote: The Approach will improve how new skills can be taught in a large-scale language model (2025, July 7) obtained from https://techxplore.com/news/2025-07-Approach-skills-taught–large-language.html on July 7, 2025.

This document is subject to copyright. Apart from fair transactions for private research or research purposes, there is no part that is reproduced without written permission. Content is provided with information only.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleWaymo Robotaxis is heading to Philadelphia and New York
Next Article Peak Technologies appointed Michael Wills as North American Chief Revenue Officer
Bussiness Insights
  • Website

Related Posts

Dual-mode design improves accuracy of MEMS accelerometers, study finds

November 18, 2025

Researchers complete first real-world validation of maritime IoT communications network

November 18, 2025

Plasma-based method creates efficient, low-cost catalyst for metal-air batteries

November 18, 2025
Leave A Reply Cancel Reply

Latest Posts

Farmers target Tesco in Niseko as inheritance tax campaign intensifies across the UK

Animal welfare charity launches as Unilever halts cage-free initiative

Farmers brace for more flooding as heavy rains cause prolonged disruption

NI farmers face slurry storage pressure after 28 days of rain

Latest Posts

India approves purchase of Rafale jets in $40 billion defense package ahead of President Macron’s visit

February 13, 2026

Spirit Airlines sells planes, brings back furloughed flight attendants

February 12, 2026

American Airlines flight attendants picketed amid growing dissatisfaction

February 12, 2026

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Energy Secretary Trump says US aims to ‘liberate’ Venezuela’s economy – Energy News, Top Headlines, Commentary, Features, Events
  • Farmers target Tesco in Niseko as inheritance tax campaign intensifies across the UK
  • Animal welfare charity launches as Unilever halts cage-free initiative
  • Bangladesh Election Results 2026: Who won, who lost, and what’s next? |Bangladesh Election 2026 News
  • Farmers brace for more flooding as heavy rains cause prolonged disruption

Recent Comments

  1. Numbersjed on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  2. JamesPak on Hundreds gather in Barcelona to protest overtourism in southern Europe
  3. vibroanalizador on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  4. игровой аппарат гейтс оф олимпус on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  5. online casino games slots on 100% tariffs on Trump’s drugs: What we know | Donald Trump News

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.