Gregarius » BetaNews.Com » Microsoft Research makes great strides in automatic image captioning

Canaux

108470 éléments (108470 non lus) dans 10 canaux

Actualités

01net. Actualités (48730 non lus)

Hoax

Top dix quotidien Sophos des canulars (65 non lus)

Logiciels

BetaNews.Com (34382 non lus)
Presence PC - Actualité Logiciels, jeux vidéos (3551 non lus)
LeMondeInformatique.fr : Actualités Applications (1133 non lus)

Sécurité

Actualités Sécurité Informatique (406 non lus)
Les Nouvelles.net : sécurité informatique. (1262 non lus)

Referencement

Secrets2moteurs (11793 non lus)
Referencement Google WebRankInfo (2255 non lus)
Abondance (4893 non lus)

BetaNews.Com

Microsoft Research makes great strides in automatic image captioning

Publié: novembre 20, 2014, 6:53pm CET par Mark Wilson

Microsoft Research is home to all manner of interesting projects and experiments, and one of the latest that the team is keen to share news about is automatic image captioning. There are no prizes for guessing what this is -- it's very much what it says on the tin -- and the technology has now reached a stage where the automatically generated captions for an image are at least as good as those thought of by people.

A team of just 12 worked on the project, and the results are pretty impressive. The system analyzes an image and identifies its key components. After determining objects and characteristics, these can then be evaluated in relation to each other to help decide what is important, and what can be ignored.

A series of possible image descriptions is then created, and these are ranked according to how much sense they make in natural language, and whether importance has been attributed to the correct components. A post on the Machine Learning Blog gives you the opportunity to see if you can determine which image captions were written by hand and which came from a machine. You might be surprised at just how difficult it is to distinguish between the two!

There are lots of potential applications for this technology, but two of the most obvious are voice control and accessibility options. At the moment it is possible to conduct web searches for images by controlling a computer with your voice, but this is reliant on images having been appropriately tagged and captioned already. If software can be used to automatically caption images on the fly, voice-activated image searches can cast a wider net.

But perhaps a more interesting use of the technology would be to make computers in general, and the internet specifically, more accessible to people with sight problems. Text-to-speech is great for hearing what has been written on a particular page, but what about the accompanying imagery. With automatic captioning, pictures that have been added to an article could be described by text-to-speech software regardless of whether a descriptive caption had been added by the author.

Photo credit: Zmiter / Shutterstock

← Call your bookie and place a bet -- M... Global monthly smartphone data traffi... →

TOP Gregarius 0.5.2 est propulsé par PHP, MagpieRSS, kses, SAJAX Tentatively valid XHTML1.0, CSS2.0 Last update: lun. 23 déc. 2024 17:11:51 CET

Gregarius » BetaNews.Com » Microsoft Research makes great strides in automatic image captioning

Canaux

BetaNews.Com

Microsoft Research makes great strides in automatic image captioning

Publié: novembre 20, 2014, 6:53pm CET par Mark Wilson