Software to generate description of videos
By
siliconindia news bureau
Maryland: According to the researchers from the University of Maryland and the University of Pennsylvania, a computer program has been developed to understand the video footage and to describe it in words. The software reports the action rather than the dialogue and functions a sort of video-to-text system, reports the Ethiopian Review.

At the IEEE Computer Society's Conference on Computer Vision and Pattern Recognition, in Miami Beach, the researchers described the software's ability to draw links among human actions and to understand causal relationships. The software essentially produces a flowchart full of 'AND and OR' junctions, which account for all the possibilities or story lines, that it has experienced. Once the system settles on the most likely story line for a play, it then gives a linguistic description of the actions in the video. Abhinav Gupta, a Doctoral candidate at the University of Maryland who was a member of the research team, notes that the system's phrasings sound is like those of Frankenstein's monster. It uses only nouns, verbs and prepositions.
To illustrate how the system works, the researchers showed how it analyzed footage of Major League Baseball games. During a learning period, the system watched games that had already been tagged with human-generated captions describing who the players were and what they were doing (pitcher: pitch; batter: no swing; batter: swing-miss; batter: swing-hit-run; fielder: run-catch-throw). Like the average human fan, the system learned the mechanics of the game by watching. It created a set of hypotheses about the relationships among the actions using the narratives and measured the correctness of each using the training videos.
Gupta admits that it is quite likely the system would foul up a description of a play that had not previously appeared on videos it had analyzed. But he notes that such an instance would immediately push the system back into learning mode. It would add that new wrinkle - say, a wild pitch or a balk - to its flowchart and instantly recognize it the next time it happens.
Describing the significance of the system, Gupta says the system would also be useful for video surveillance of places like airports and traffic intersections. Incidents can be tagged for later review by law enforcement as "unusual/suspicious" and brought to the attention of security personnel, all without relying on humans to keep their eyes on video screens and remain attentive.

At the IEEE Computer Society's Conference on Computer Vision and Pattern Recognition, in Miami Beach, the researchers described the software's ability to draw links among human actions and to understand causal relationships. The software essentially produces a flowchart full of 'AND and OR' junctions, which account for all the possibilities or story lines, that it has experienced. Once the system settles on the most likely story line for a play, it then gives a linguistic description of the actions in the video. Abhinav Gupta, a Doctoral candidate at the University of Maryland who was a member of the research team, notes that the system's phrasings sound is like those of Frankenstein's monster. It uses only nouns, verbs and prepositions.
To illustrate how the system works, the researchers showed how it analyzed footage of Major League Baseball games. During a learning period, the system watched games that had already been tagged with human-generated captions describing who the players were and what they were doing (pitcher: pitch; batter: no swing; batter: swing-miss; batter: swing-hit-run; fielder: run-catch-throw). Like the average human fan, the system learned the mechanics of the game by watching. It created a set of hypotheses about the relationships among the actions using the narratives and measured the correctness of each using the training videos.
Gupta admits that it is quite likely the system would foul up a description of a play that had not previously appeared on videos it had analyzed. But he notes that such an instance would immediately push the system back into learning mode. It would add that new wrinkle - say, a wild pitch or a balk - to its flowchart and instantly recognize it the next time it happens.
Describing the significance of the system, Gupta says the system would also be useful for video surveillance of places like airports and traffic intersections. Incidents can be tagged for later review by law enforcement as "unusual/suspicious" and brought to the attention of security personnel, all without relying on humans to keep their eyes on video screens and remain attentive.
Reader's comments(3)
1: i wish i could also do big research like that
on latest technology. good work done.
congrats!
Posted by: deepak jain - 27 Jul, 2009
2: Good gentleman goahed with more and more.
Posted by: prem kumar - 27 Jul, 2009
3: Feeling proud of the Indian researcher
AAbhinav Gupta for such an intelligent
software.
Posted by: Samuel - 27 Jul, 2009
Disclaimer
Messages posted on this Web site under the `Comments' area are solely the opinions of those who have posted them and do not necessarily reflect the opinions of Infoconnect Web Technologies India Pvt Ltd or its site www.siliconindia.com. Gossip, mud slinging and malicious attacks on individuals and organizations are strictly prohibited. Infoconnect Web Technologies India Pvt Ltd can not be held responsible for errors or omissions in content, nor for the authenticity of the user/company name or email addresses associated with posted messages. Infoconnect Web Technologies India Pvt Ltd reserves the right to edit or remove messages containing inappropriate language or any other material that could be construed as libelous, potentially libelous,
or otherwise offensive or inappropriate.Infoconnect Web Technologies India Pvt Ltd do not endorse the products and services or any other offerings mentioned in these messages.
- Windows mobile OS 7 might not have multitasking
- New video games pop up for Winter Olympics
- Satyam plans to expand in Latin America
- London Mayor shuts down business offices in India
- 58 percent of laid-off employees are back in jobs
- Wireless networks can help forecast calamities
- Digital meters of autos can be tampered
- Indian IT firms hiring local talent in U.S.
- China shuts hackers training website
- Tata Infrastructure Fund tops MF scheme last week
- 14 year old NRI is Australia's brainiest student
- Auto driver turns web entrepreneur
- Students make electric appliances operable via sms
- Change management: BSNL employees to PM
- An IITian lights 10,000 homes in rural Bihar
- Indian entrepreneurs sign deals like housewives
- Indians are over committed, reluctant to differ
- Mahindra Satyam recalls 2000 virtual pool staff
- Biggies back with bonuses, startups suffer
- Over 15,000 NRI resumes lying with Indian HR firms






