Skip to main content

IBM Multimedia Analysis and Retrieval System

A desktop system for automatic indexing, classification and searching of digital images and video collections.

Date Posted: February 28, 2005

alphaworks tab navigation


 

Update: May 21, 2009
New version fixes some problems with processing large collections and appending or reprocessing collections, as well as an updated set of classifiers.

 

What is IBM Multimedia Analysis and Retrieval System?

IBM Multimedia Analysis and Retrieval System (IMARS) is a powerful system that can be used to automatically index, classify, and search large collections of digital images and videos. IMARS works by applying computer-based algorithms that analyze visual features of the images and videos, and subsequently allows them to be automatically organized and searched based on their visual content. In addition to search and browse features, IMARS also:

How does it work?

IMARS is comprised of the IMARS extraction tool and the IMARS search tool. The IMARS extraction tool takes a collection of images and videos from the user, and produces indexes based on mathematical analyses of each piece of content. These indexes organize the results of the analyses for the IMARS search tool.

IMARS Extraction tool

The IMARS extraction functionality is enabled by two main categories of computer algorithms that work together to bridge the “semantic gap” for images and videos:

IMARS Search tool

The IMARS search tool provides a graphical interface which allows the user to search, browse and navigate the collection based on the values produced by the analyses performed by the IMARS extraction tool.

The IMARS search tool presents the results of a query in different formats, among which the user can decide and switch according to his preferences. One consists of mosaic overview images that provide a simple at-a-glance summary of each of the main categories extracted. Another is a word-based representation.

The tool also allows drilling-down for more details, for example, to provide a sorted list of matches for each semantic category, or to provide the full set of extracted semantics for each image or video key-frame.

About the technology author(s)

This tool was developed by the IBM T. J. Watson Research Center Multimedia Research team: John R. Smith, Apostol (Paul) Natsev, Jelena Tešić Lexing Xie, and Rong Yan.

Trademarks






Related technologies