App Store Logo

Still Life

Rally in the Valley excites fans

Rally in the Valley excites fans

November 6, 2009

Students capture fall at University Park

Students capture fall at University Park

November 5, 2009

Penn State Greeks strut their Broadway stuff

Penn State Greeks strut their Broadway stuff

November 1, 2009

THON 5K draws thousands

THON 5K draws thousands

November 1, 2009

Jazz masters wow audience

Jazz masters wow audience

October 28, 2009

Arboretum boardwalk and overlook chosen as 2010 senior class gift

Arboretum boardwalk and overlook chosen as 2010 senior class gift

October 27, 2009

Outreach mission brings jazz legends to high school musicians

Outreach mission brings jazz legends to high school musicians

October 27, 2009

Penn State Altoona celebrates 70th anniversary

Penn State Altoona celebrates 70th anniversary

October 27, 2009

Campus Night Out

Campus Night Out

October 22, 2009

Photography students play with light, shadow

Photography students play with light, shadow

October 20, 2009

Homecoming 2009

Homecoming 2009

October 17, 2009

Weather not a factor in Homecoming enthusiasm

Weather not a factor in Homecoming enthusiasm

October 16, 2009

Featured Video

2009 State of the University Address

2009 State of the University Address

Penn State Solar Decathlon 2009, part two: Natural Fusion goes to Washington

Penn State Solar Decathlon 2009, part two: Natural Fusion goes to Washington

Natural Fusion, Penn State's Solar Decathlon Team 2009

Natural Fusion, Penn State's Solar Decathlon Team 2009

Behind the scenes with the stadium concessions team

Behind the scenes with the stadium concessions team

Penn State's creamery, from the cow to the cone

Penn State's creamery, from the cow to the cone

Beaver Stadium Behind the Scenes and On the Air

Beaver Stadium Behind the Scenes and On the Air

Beaver Stadium Behind the Scenes: Video Board

Beaver Stadium Behind the Scenes: Video Board

Video gives students sneak peek at new campus location

Video gives students sneak peek at new campus location

Historic Old Main Bell removed from tower for restoration and display

Historic Old Main Bell removed from tower for restoration and display

Researchers teach computers how to name images by 'thinking'

Wednesday, November 1, 2006

University Park, Pa. -- Penn State researchers have "taught" computers how to interpret images using a vocabulary of up to 330 English words, so that a computer can describe a photograph of two polo players, for instance, as "sport," "people," "horse," "polo."

The new system, which can automatically annotate entire online collections of photographs as they are uploaded, means significant time-savings for the millions of Internet users who now manually tag or identify their images. It also facilitates retrieval of images through the use of search terms, said James Wang, associate professor in the Penn State College of Information Sciences and Technology, and one of the technology's two inventors.

The system is described in a paper, "Real-Time Computerized Annotation of Pictures," given at the recent ACM Multimedia 2006 conference in Santa Barbara, Calif., and authored by Jia Li, associate professor, Department of Statistics, and Wang. Penn State has filed a provisional patent application on the invention.

Major search engines currently rely upon uploaded tags of text to describe images. While many collections are annotated, many are not. The result: Images without text tags are not accessible to Web searchers. Because it provides text tags, the ALIPR system -- Automatic Linguistic Indexing of Pictures-Real Time -- makes those images visible to Web users.

ALIPR does this by analyzing the pixel content of images and comparing that against a stored knowledge base of the pixel content of tens of thousands of image examples. The computer then suggests a list of 15 possible annotations or words for the image.

"By inputting tens of thousands of images, we have trained computers to recognize certain objects and concepts and automatically annotate those new or unseen images," Wang said. "More than half the time, the computer's first tag out of the top 15 tags is correct."

In addition, for 98 percent of images tested, the system has provided at least one correct annotation in the top 15 selected words. The system, which completes the annotation in about 1.4 seconds, also can be applied to other domains such as art collections, satellite imaging and pathology slides, Wang said.

The new system builds on the authors' previous invention, ALIP, which also analyzes image content. But unlike ALIP which characterized images by incorporating computational-intensive spatial modeling, ALIPR characterizes images by modeling distributions of color and texture.

The researchers acknowledge computers trained with their algorithms have difficulties when photos are fuzzy or have low contrast or resolution; when objects are shown only partially; and when the angle used by the photographer presents an image in a way that is different than how the computer was trained on the object. Adding more training images as well as improving the training process may reduce these limitations -- future areas of research.

A demonstration of the ALIPR system can be found at http://www.alipr.com online.

In a companion paper also presented at the ACM conference, the researchers describe another of their systems-one that can use annotations in a retrieval process. This new system leverages annotations from different sources, human and computer. The researchers, who have built a prototype of the system, are working on testing it in real-world situations. That paper, "Toward Bridging the Annotation-Retrieval Gap in Image Search by a Generative Modeling Approach," was authored by Ritendra Datta and Weina Ge, Ph.D. students in computer science and engineering; Li; and Wang.

"Our approach aims at making all pictures on the Internet visible to the users of search engines," Wang said.

Research on both systems was supported by the National Science Foundation.

Contact
Newswires you might enjoy