Copying or distributing in print or electronic forms without written permission
of Idea Group Inc. is prohibited.
title and date, followed by each company??™s financial details, such as company name,
earnings, revenue information, and so forth. A sample input file is shown in Figure
8. Figure 9 shows the user interface page while Figure 10 shows results that the
XML processor sent back to the browser in XML format.
Future.Trends
Information extraction from natural language will become increasingly important
as the number of documents on the Web continues to explode. This makes timely
manual processing ever less feasible as a means of seeking competitive advantage
in business. Such processing will continue to be a difficult task, and in fact, one
that cannot be perfectly achieved.
In addition to the manual pattern-based, rule creation techniques discussed in this
article, machine learning algorithms are also being used by some researchers to
teach computers to recognize the meanings of new texts based on known meanings
of previously human-deciphered texts. We plan to hybridize our own technique to
include machine learning algorithms, to see if they incrementally enhance the recall
and precision of FIRST.
Pages:
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274