Information extraction and XML tagging process
URL of the document
http://www.
New Article
Information Extraction System
XML Generator
XML data
Automat cally Extract ng and Tagg ng Bus ness Informat on for E-Bus ness Systems 0
Copyright ?© 2007, Idea Group Inc. Copying or distributing in print or electronic forms without written permission
of Idea Group Inc. is prohibited.
first be taught to parse the sentences, and then taught which words or phrases are
synonyms. Also, just as children learn to recognize which sentences in a paragraph
are the topic or key sentences, computers must also be taught how to recognize which
sentences in a text are paramount versus which are simply expository. Once these
key sentences are found, the computer programs will extract the vital information
from them for inclusion in templates or databases.
There are two major approaches to building information extraction systems: the
knowledge engineering approach and the automatic training approach (Appelt &
Israel, 1999). In the knowledge engineering approach, knowledge engineers employ
their own understanding of natural language, along with the domain expertise they
extract from subject matter experts, to build rules which allow computer programs
to extract information from text documents.
Pages:
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255