Search Engines: Diversity and Controversy

The Sixth Search Engine Meeting

April 9-10, 2001, Boston, Massachusetts

 

This series of meetings originated in Albuquerque, New Mexico in 1995. That inaugural meeting (part of an ASIDIC series) was transplanted to Bath in England (1996 and 1997) and then to Boston, Massachusetts (1998, 1999 and 2000). The Search Engines Meetings bring together commercial search engine developers, academics and corporate professionals to learn from each other. Infonortics, sponsor of meetings post-1995 with Ev Brenner, will continue the same success in Boston in 2001. Early booking is recommended; these meetings get crowded.

Book  now for the 2001 Search Engines Meeting (secure server)

See also details of how to exhibit at this meeting

 

The program

Last updated 6 April 2001

Sunday April 8, 2001: three half-day workshops


DAY I : Monday April 9, 2001

Search Engines Today

08.00-08.55 -- Breakfast

Day One Keynote - 09.05-09.50

David Seuss (Northern Light, Massachusetts)
Search Strategies for Large Enterprises

The Web is bringing profound changes in the electronic information available to individuals. Everyone uses Web search engines in their business, personal and investing lives. As a result, everyone has become his or her own research librarian and expects to seek and use electronic information on each and every task. Returns to individuals and to enterprises from having information resources and effective search capability are soaring, and search has become a central focus of many new business-critical applications.
Unfortunately, the evolving new class of search applications requires technology, library skills, content relationships and operating infrastructure that few companies have in their IT departments.

_________________________________________________________

Panel: Search Engine Developments and Trends since April 2000 - 09.50-10.50
Greg Notess (Montana State University-Bozeman) - http:/searchengineshowdown.com/
Chris Sherman (About.com, California) - http://websearch.about.com/
Avi Rappoport (Searchtools.com) - www.searchtools.com/

Two noted commentators on the search engine scene present their analyses of what we have achieved over the past twelve months, and where we are heading next

Networking break 10.50-11.15

Lou Rosenfeld (Argus Center for Information Architecture, Michigan) 11.15-11.50
We've Got it Backwards: an information architect's viewpoint for broadening searching and narrowing domains

The search engine community has made gargantuan efforts to provide improved access to broad, complicated collections of information that are used by highly heterogeneous audiences. These are difficult challenges and, not surprisingly, lead to less than ideal information retrieval performance.
The solution may not lie in using better search algorithms, but in re-examining the problem itself. First, users typically do not care about search engines and how they work; they just care about getting the information they want. Second, the more broadly we define search spaces, the more difficult it is for conventional search solutions to succeed. So we should apply the holistic finding approach to specialized problems where content is more homogeneous, less voluminous, and where users and their information needs are better understood.

Bob Travis (Altavista, California) 11.50-12.25
The Need Behind the Query: Web Search vs Classic Information Retrieval

Classic IR is predicated on users searching for needed information. But the intent behind a web search is often not informational -- it might be navigational (show me the url of the site I want to reach) or transactional (show me sites where I can perform a certain transaction, eg shop, download a file, or find a map). This presentation dissects some of the differences between classic IR and web search and shows how search engines evolved to deal with web-specific needs.

Chris Cardinal (Hummingbird, Canada) 12.25-1.00
The TREC and commercial search companies

Commercial participation in TREC has been almost non-existent. This year Hummingbird participated for the first time, submitting runs in the web and query tracks using the Fulcrum search engine. Hummingbird will talk about how it approached TREC and the value derived from this experience.

Lunch: 1.00-2.15

Mark Hansen and Elizabeth Shriver (Bell Labs, Lucent Technologies, New Jersey) 2.15-2.50
The Information Hidden When Web Searching

We propose techniques for both personalizing and improving the effectiveness of searching by focusing on the presentation and ranking of search results. We display groups of URLs, where the URLs in a group are "related". We determine the relationship by applying several types of model-based clustering to the key words of the query as well as the web pages visited during the search session and the timing of each session.
We trained a system with the search sessions from a number of months of Bell Labs proxy logs, and verified the usefulness of our clustering scheme with a few months of new search session data. In most cases, we report savings both in terms of the number of URLs visited as well as the time required for a search.

Panel: Content Architecture (Part I) 2.50-4.00

Chairman
: Susan Feldman, IDC, Massachusetts
Is search terrible, as the headlines proclaimed last fall, or is it the materials that search engines are crunching? What can we do to improve the situation? These sessions explore technologies that can improve search. We look at techniques that help structure unstructured text such as categorization, metatags, XML, entity extractions, and authority files. We examine some of the interaction techniques that will guide searchers to the information they are seeking. What is still a dream, and what is now available on the market? Each speaker will explore one facet of how to make search better.

Sue Feldman (IDC, Massachusetts) -- Introduction: What is content architecture?
Mark Bennett (Searchbutton, California) -- XML
Richard Boulderstone (Looksmart, California) -- metadata
Eytan Ruppin (Zapper Technologies, Israel) -- putting search and categorization in context

Networking Break: 4.00-4.25

Panel: Content Architecture (Part II) 4.25-5.45

Daniel Lulich, (Rulespace, Oregon) -- automatic classification and its visualization
Tom Wilde (FAST, MA) -- scaling and content aggregation platforms
Horst Koerner (Seruba, Germany) -- cross language ontologies and topic maps - knowledge engineering
Ashok Chandra, (Verity, California) -- transforming the web into social networks

5.45 Conference Mixer Cocktail sponsored by Northern Light


DAY II : Tuesday April 10, 2001

The New Frontier

07.30-08.25 -- Breakfast

Day Two Keynote - 8.30-9.10

Eric Brewer (Inktomi Corporation, California)
The Next Evolution: A New Infrastructure for Connecting Content with Users

Dr Eric Brewer, co-founder and chief scientist of Inktomi, will discuss how an emerging infrastructure for content providers connecting to search engines will enable users to access higher quality results when searching and browsing the Web. He will also examine the new business and technical arrangements that are being developed to enable content providers to make their best and most current content readily available to consumers.

________________________________________________________________________________________________

Bernard Normier (LexiQuest, New York) 9.10-9.45
From String Indexing to Semantic Indexing. The example of Lexisez, an NLP-based question / answering system

Question Answering systems are a new application in the general framework of information retrieval. Several strategies are possible, from question/answers databases, to answer extraction from plain text. On the search engine part of the system, the indexing is not controlled, and the query is expanded to a Boolean query using NLP analysis of the query and semantic expansion. On the question/answer database, the index is the result of a linguistic analysis. Two levels of indexing are used: classical part of speech level, and a more sophisticated semantic level. At the semantic level, the indexes are no more words, but "concepts", that is words are disambiguated and a given meaning of the word is selected. When not possible, the sentence is over-indexed by several meanings. At this level too, the index is not a flat list of descriptors, but a structured representation of the basic form. A sophisticated matching algorithm is then used, which improves in large proportions both precision and recall.

David A Evans (Clairvoyance, Pennsylvania) 9.45-10.20
The 'Affect' Dimensions of Search

Search technology and functionality have traditionally focused on the problem of matching queries to documents, principally by comparing the content of the query to the content of documents. Many internet-based applications do not involve literal queries but do require a 'match' between the content of a text and some other information object. An example of such an application is the matching of a banner ad or offer to a user who is browsing through a web page with certain content; or the suggestion to a user of a book, or video, or CD based on the content or description of another book, etc, that the user may be reading. Especially in such contexts, the superficial similarity of texts may be a poor guide to appropriate "matches". Another dimension -- affect (or emotion) -- provides a much better measure of the features that matter in making good suggestions. This talk will describe one approach to the identification, measurement, and use of the multiple dimensions of affect in texts to make internet search and related applications more effective.

Networking break: 10.20-10.40

Stephen E. Arnold (AIT, Kentucky) 10.40-11.15
Vertical Search Engines

Automated systems for indexing the Web are useful. The surge in topic or market specific Web sites has created a need for more restricted collections of Web pointers and indexes to high value content closely related to the focus of a Web site. The vertical collection functionality of four search engines will be described and illustrated. The engines included in the presentation are:

· Zbase, a new automated service that builds a Yahoo! style directory
· EoExchange, an engine that has been tuned to create vertical content collections
· AnswerSleuth, a combination of human editorial and automated content indexing
· Mohomine, an engine that creates vertical collections from Web sources as well as selected content created by employees of an organization.

The "best fit" for each of these vertical search engine services will be identified. General pricing parameters for each service indicate the direction that vertical search services are moving in the internet / intranet markets.

John Snyder, Webtop.com, Cambridge, England 11.15-11.50
Beyond the Keyboard: Speech Enabled Search

Speech recognition is increasingly entering our consumer lives as a call direction technology and dictation tool. How can the latest technology be harnessed for speaking real natural language searches? Dividing the Web into zones of content assists the delivery of untrained "speaking search engines" to a new generation of mobile web users.

Matthew Koll (Virginia) 11.50-12.25
Massively Distributed Search: Still a Good New Idea

The invisible web continues to elude the spiders. Coverage and precision of web searches are not as good as we would like. Timeliness matters. Cost of service matters. Business imperatives impact the supply side. P2P gives a push -- but is it just another push? When will massively distributed search happen?

Lunch: 12.25-1.55

Craig Silverstein, (Google, California) 2.00-2.35
Beyond Search Engines: The Future History of Information Retrieval

The pace of advanced search technology is accelerating. What does 'search' mean in this brave new world? This presentation discusses Google's view of one way in which search could develop. With search sites becoming the destination of first result for Internet users, there will be more emphasis on quality, from 'search' to 'find'. User interfaces must be able to simplify query inputs, synthesize results, and return only the most relevant information. The presentation will also provide a perspective on what search features will likely have the biggest impact on the search industry, and why.

Gregory T Grefenstette (Xerox Research Centre Europe, France) 2.35-3.10
Multilinguality in the Web

The Web is and will remain multilingual. We will show how language is automatically identifiable by search engines, how one can estimate the volume of a language accessible through a search engine, and what language-specific resources are needed to do simple, but good indexing.

Networking break 3.10-3.30

Panel: The Secularization of Search - 3.30-4.45

Chairman: David A. Evans (Clairvoyance, Pennsylvania)
William R. Hersh (Oregon Health Sciences University (Oregon)
Joshua Arai and Keiichi Kitagawa (Justsystem Corporation, Japan)

Before 1992, there were probably fewer than one million people worldwide -- mostly professionals in library or information science, or scientists and physicians --who had ever conducted an automated text search. By 1995, there were probably 100 million -- "ordinary" people who had searched the internet via their browsers and a search engine. With search now unambiguously in the hands of non-professionals and with the use of search now well integrated into regular business and personal use of the internet, we might well ask whether a "received view" of search is emerging. This panel will explore this question by examining several cases from around the world in which search as a discipline is being taught in schools. In each case, we will attempt to answer a series of questions: Why is search perceived to be important? What aspects of search are in focus? How is the methodology taught? What is the expected outcome and how is it measured?

 

END OF 2001 MEETING

The 2001 Search Engines Meeting
Diversity and Controversy
Boston, Massachusetts, April 9-10, 2001



Book  now for the 2001 Search Engines Meeting (secure server)


General Meeting Information

Fee

The meeting fee is US $695 and is payable in advance. The fee includes the two day meeting, two breakfasts, two lunches, the meeting cocktail, and all breaks. Please note meeting fees are payable in respect of all reservations for this event unless a cancellation has been received under the terms below. For those working at an academic institution, there is a limited number of places at a special academic rate of US $495. The fee for the half-day pre-conference workshops on April 8 is $290 per workshop. The workshops are held in parallel, so you will only be able to attend one per person.

Multiple bookings

Organisations will receive a discount of 15% on all bookings if more than two bookings from the same organisation are received. Bookings for multiple attendees must be received at the same time and be invoiced on the same invoice. This discount is not in addition to the academic or early booking rates (standard rate only applies).

Cancellations and substitutions

Notice of cancellations must be received at least ten working days prior to the event (ie, by March 26); a cancellation, refund and handling fee of $80 will be charged. Cancellations notified after this date cannot be accepted for refund, and the full meeting fee will stand. Substitutions may be made at any time.

Hotel accommodation

Hotel accommodation is not included in the meeting price. The meeting hotel, the prestigious Fairmont Copley Plaza, has a limited block of rooms available for attendees at this meeting. The special meeting rate is US $219.00 (excluding tax). Those booking should mention "Search Engine Meeting" to be given this special rate. Once this room block is fully occupied, normal hotel rates may apply.

Meeting registration and registration times

Registrations may be made at the meeting registration desk at the Fairmont Copley Plaza on Sunday April 8 from 4 pm until 6.30 pm, or on Monday April 9 from 8 am. Workshop registrations are Sunday April 8 from midday.

The meeting sessions begin on Monday April 9 at 8.45 am. The meeting ends Tuesday April 10 at approximately 5.15 pm.

The meeting mixer cocktail will take place the evening of Monday April 9 at approximately 6 pm.

Meeting venue

The meeting is at the Fairmont Copley Plaza Hotel, 138 St James Avenue, Boston, MA 02116, USA.
Telephone is +1 617 267 5300. Fax is +1 617 375 9648. Toll-free reservation phone within USA is: 1-800-527-4727.


 Book  now for the 2001 Search Engines Meeting (secure server)

contact@infonortics.com
Our postal address: 15 Market Place, Tetbury, Glos. GL8 8DD, UK
Our telephone number : + 44 1666 505 772
US fax number: 401 696 5627
ROW fax number : +44 1666 505 774

Return to Infonortics home page