This page last changed 04 July 2007

Sitges, Spain

Tutorial: Wednesday 24 October 2007 (afternoon)

Text Mining: Placebo or Scopolamine for Patent Analysis,
Pharma Research and Business Intelligence?

Stephen E. Arnold
President
Arnold Information Technology

A recent report pegged the text mining market at a healthy $1.8 billion in 2007. An astounding figure when you calculate that the five publicly-traded "leaders" in text mining generate less than $300 million per year from licences directly related to unstructured text.

The interest in the use of algorithmic processes to extract actionable information and facts from terabytes of unstructured content continues to rise. Can algorithms alone identify the names of people, places and products without error? Can these systems handle content in a single language or across multiple languages? Can text mining systems process double byte unstructured content such as Chinese and Korean laboratory reports? Are the breakthroughs in CPUs up to tasks imposed by iterative algorithms cycling through email, Word documents, PDF files, PowerPoints and semi-structured Web content?

This tutorial explores the reality of today's leading text mining systems. The systems that deliver useful results are often quite different from the Alice-in-Wonderland descriptions in vendors' marketing collateral.

What you will learn

What the tutorial covers

The tutorial is divided into four 45 minute segments:

Part 1: Background and Benefits of Text Mining

Part 2: Profiles of 20 Vendors. For each vendor, you will learn:

Part 3: The Pitfalls and How to Avoid Them

Part 4: Getting a Fast Start

Tutorial Format

The structure of the tutorial will be 30-minute lectures by the presenter. Each lecture will be followed by a discussion period between Mr Arnold and those in attendance.

Tutorial Materials

The materials used in the session and referenced will be made available to registered attendees for a period of 10 days following the tutorial at an ftp site. Registered attendees may use these
materials within their organisations, but any other use of the data or the information requires the written permission of the presenter.

Who should attend

This tutorial will provide actionable information to:

Companies Profiled

Companies in blue are treated in more detail in this tutorial.

Tutorial Logistics

The tutorial will take place at the ICIC meeting hotel, the Hotel Meliá Sitges. The tutorial starts at 14:00 on Wednesday 24 October and will end around 17:30. Prior registration is required. Note that the tutorial is not included in the registration fee for the ICIC meeting. A separate workshop registration fee of €215 per person will be payable.


general event details

tutorial order form