App Store Logo

Still Life

Rockettes rock Jordan Center

Rockettes rock Jordan Center

November 19, 2009

Penn State laureate, School of Music host high school singers

Penn State laureate, School of Music host high school singers

November 18, 2009

Virsky Ukrainian Dance Company performs at Eisenhower

Virsky Ukrainian Dance Company performs at Eisenhower

November 17, 2009

Students to present major Disney production For The Kids

Students to present major Disney production For The Kids

November 16, 2009

Penn State celebrates Senior Day

Penn State celebrates Senior Day

November 14, 2009

Hershey breaks ground for Children's Hospital

Hershey breaks ground for Children's Hospital

November 13, 2009

Kronos Quartet performs at Eisenhower Auditorium

Kronos Quartet performs at Eisenhower Auditorium

November 10, 2009

Rally in the Valley excites fans

Rally in the Valley excites fans

November 6, 2009

Penn State Greeks strut their Broadway stuff

Penn State Greeks strut their Broadway stuff

November 1, 2009

THON 5K draws thousands

THON 5K draws thousands

November 1, 2009

Jazz masters wow audience

Jazz masters wow audience

October 28, 2009

Featured Video

2009 State of the University Address

2009 State of the University Address

Behind the scenes with stadium police

Behind the scenes with stadium police

Poultry science professor shares turkey news

Poultry science professor shares turkey news

Penn State Solar Decathlon 2009, part two: Natural Fusion goes to Washington

Penn State Solar Decathlon 2009, part two: Natural Fusion goes to Washington

Natural Fusion, Penn State's Solar Decathlon Team 2009

Natural Fusion, Penn State's Solar Decathlon Team 2009

Behind the scenes with the stadium concessions team

Behind the scenes with the stadium concessions team

Penn State's creamery, from the cow to the cone

Penn State's creamery, from the cow to the cone

Beaver Stadium Behind the Scenes and On the Air

Beaver Stadium Behind the Scenes and On the Air

Beaver Stadium Behind the Scenes: Video Board

Beaver Stadium Behind the Scenes: Video Board

Video gives students sneak peek at new campus location

Video gives students sneak peek at new campus location

Historic Old Main Bell removed from tower for restoration and display

Historic Old Main Bell removed from tower for restoration and display

IST researchers classify Web searches

Wednesday, April 2, 2008

University Park, Pa. — Although millions of people use Web search engines, Penn State researchers show that — by using relatively simple methods — most queries submitted can be classified into one of three categories.

Jim Jansen, assistant professor in Penn State's College of Information Sciences and Technology, worked with IST undergraduate Danielle Booth, as well as Amanda Spink of the Queensland University of Technology, to find that Web search engine users are doing primarily informational, navigational or transactional searching.

Informational searching involves looking for a specific fact or topic, navigational searching seeks to locate a specific Web site and transactional searching looks for information related to buying a particular product or service.

The research was the first published work of its kind done using actual searching data, with the aim of real-time classification.  Researchers analyzed more than 1.5 million queries from hundreds of thousands of search engines users. Findings showed that about 80 percent of queries are informational and about 10 percent each are for navigational and transactional purposes.

Jansen and his colleagues arrived at those results by selecting random samples of records and analyzing query length, the order of the query in the session and the search results. These fields helped the team develop an algorithm that classified the searches with a 74-percent accuracy rate.

"Other results have classified comparatively much smaller sets of queries, usually manually," Jansen said. "This research aimed to classify queries automatically. Our findings have broad implications for search engines and e-commerce if they can classify the user intent of queries in real time. This is why we wanted a computational undemanding algorithm. It proves the 80/20 rule that 80 percent of the cases can be achieved with these clear-cut methods."

The paper "Determining the informational, navigational and transactional intent of Web queries" will appear in the May 2008 issue of Information Processing & Management. The article is currently available online.

Jansen said he plans to continue this research using a more complex algorithm that will hopefully yield a 90-percent accuracy rate using similar searching criteria.

Newswires you might enjoy