Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Integrate raises $17 million to move defense project management into the 21st century

Novo Nordisk faces defining year in the obesity drug market

US reopens airspace over El Paso after cartel claims of drone intrusion | Donald Trump News

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » AI Vision Language Model provides video descriptions for blind users
Electronics & Semiconductor

AI Vision Language Model provides video descriptions for blind users

Bussiness InsightsBy Bussiness InsightsJuly 1, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


AI Vision Language Model provides video descriptions for blind users

Blind and low-vision people request explanations for YouDescribe videos, but only 7% are complete. AI is speeding up processes. Credit: Matthew Moderno/Northeastern University

For people with blind or poor vision, audio descriptions of action in movies and TV shows are essential to understanding what is happening. Networks and streaming services hire experts to create audio descriptions, but this is not the case for billions of YouTube and Tiktok videos.

That doesn’t mean people don’t want to access content.

Using AI Vision Language Models (VLM), researchers at Northeastern University have made audio descriptions available in user-generated videos as part of a crowdsourcing platform called YouDescribe. Like libraries, blind and low vision users can request descriptions of the video and make subsequent rates and contributions.

“I understand that a 20-second video about Tiktok in Tiktok might not give a professional explanation,” says Lana Do, who earned her Masters in Computer Science from the Silicon Valley campus in Northeastern in May. “But blind and low-minded people might want to see the dance video as well.”

In fact, the 2020 video of the song “Dynamite” by the Korean boy band BTS is at the top of YouDescribe’s wish list and is waiting to be explained. The platform has 3,000 volunteer accountants, but the wish list is so long that it can’t keep up. Only 7% of the requested videos on the wishlist have an audio description, Do says.

I work in Ilmiyun’s lab, where I teach computer science professors on the Silicon Valley campus. Yoon joined the YouDescribe team in 2018 to develop the machine learning elements of the platform.

This year, we added new features to speed up the human loop workflow in YouDescribe. New VLM technology provides better quality explanations, and the new Infobot tool allows users to request more information about a particular video frame. Low-Vision users can even fix mistakes in the description in the collaboration editing interface, Do says.

As a result, video content is explained better and more quickly becomes available. AI-generated drafts reduce the burden on human explainers and allow users to easily engage in the process through ratings and comments, she said.

“They could say they were watching documentary sets in the woods and heard the sound of unexplained flapping.

DO and her colleagues recently published a paper at a symposium on the interaction of human computers for work in Amsterdam on the possibility that AI will accelerate the development of audio descriptions. AI does an incredibly good job, says Yoon by explaining human facial expressions and movements. In this video, the AI ​​agent explains the steps a chef takes while making cheese rolls.

But there are some consistent weaknesses, she says. AI is not good at reading facial expressions in manga. And overall, humans are excellent at picking up the most important details in the scene. This is an important skill for creating useful explanations.

“It’s very labor-intensive,” says Yun.

Graduate students in her lab compare the first draft of AI to what a human explainer creates.

“Then we’ll measure the gap so that we can train our AI to do a better job,” she says. “Blind users don’t want to be distracted by too many verbal explanations. It’s editorial arts to verbalize the most important information in a concise way.”

YouDescribe was launched in 2013 by the San Francisco-based Smith-Kettlewell Eye Research Institute and trained volunteers who were spotted in creating audio descriptions. Focusing on YouTube and Tiktok videos, the platform offers recording and timing narration tutorials that will allow user-generated video content to be accessed.

Provided by Northeastern University

This story has been republished courtesy of Northeastern Global News news.northeastern.edu.

Quote: AI Vision Language Model Provides Video Descriptions for Blind Users (2025, June 30) Retrieved July 1, 2025 from https://techxplore.com/news/2025-06-Ai-vision-language-video-descriptions.html

This document is subject to copyright. Apart from fair transactions for private research or research purposes, there is no part that is reproduced without written permission. Content is provided with information only.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleBodhi says the Scottish government is taking part in the collapse of the sucker beef sector
Next Article If Trump’s “one big beautiful bill” passes, who will win? | Donald Trump News
Bussiness Insights
  • Website

Related Posts

Dual-mode design improves accuracy of MEMS accelerometers, study finds

November 18, 2025

Researchers complete first real-world validation of maritime IoT communications network

November 18, 2025

Plasma-based method creates efficient, low-cost catalyst for metal-air batteries

November 18, 2025
Leave A Reply Cancel Reply

Latest Posts

Supreme Court bans Oatly from using ‘milk’ in UK branding dispute

New research supported by Defra aims to improve tenant farming relationships

UK secures 157 new solar power schemes amid concerns over land use priorities

Pig producers dispute BBC claims over four-year farrowing box ban

Latest Posts

FAA abruptly lifts order suspending operations at El Paso Airport for 10 days

February 11, 2026

Hanwha Aerospace, South Korea’s largest defense company, falls 6%

February 10, 2026

Elon Musk on his way to becoming the world’s first millionaire with SpaceX-xAI

February 7, 2026

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Integrate raises $17 million to move defense project management into the 21st century
  • Novo Nordisk faces defining year in the obesity drug market
  • US reopens airspace over El Paso after cartel claims of drone intrusion | Donald Trump News
  • Compliance raises $20 million to help businesses manage risk and compliance
  • Humanoid robot startup Apptronik raises $935 million at a valuation of more than $5 billion

Recent Comments

  1. Numbersjed on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  2. JamesPak on Hundreds gather in Barcelona to protest overtourism in southern Europe
  3. vibroanalizador on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  4. игровой аппарат гейтс оф олимпус on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  5. online casino games slots on 100% tariffs on Trump’s drugs: What we know | Donald Trump News

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.