Text Algorithms

简介:
手边有这本书很方便。这是第一本书,系统地介绍了有关文本的实际和理论算法问题的工作结果,直到字符串学的最新发展。这本关于文本处理的算法和数据结构设计的急需书籍强调理论基础和实际应用。它既可以作为算法设计课程的教科书,尤其是与文本处理相关的课程,也可以作为计算机科学专业人员的参考。这项工作采用了一种独特的方法,比其他更一般的书籍更深入地探讨其主题。它包含经典算法和该主题的最新研究成果。这本书是第一个包含各种文本算法集合的文本,其中许多算法都是新的,并且是第一次出现在这里。其他算法虽然以声誉闻名,但从未在期刊文献中发表过。两个如此重要的算法是Karp,Miller和Rosenberg的算法以及Weiner的算法。在这里,他们一起展示了第一次。本书的核心是后缀树和子词图的材料,这些数据结构的应用,时空最佳字符串匹配的新方法以及文本压缩。还涵盖了文本问题的基本并行算法。所有这些算法的应用都针对涉及数据检索系统,自然语言处理,基因组研究,数据压缩软件和文本处理工具的问题。从理论的角度来看。这本书是开发高效算法的典范金矿,为创建处理序列的实用软件提供了必要的基础。作者方法的关键点是开发一种用于呈现文本算法的方法,以便可以完全理解它们。在整个过程中,本书强调了算法的效率,认为其有用性的本质取决于它。这是特别重要的,因为这里描述的算法将在 “大科学” 领域中找到应用,如分子序列分析,其中数据的爆炸性增长已经给当前一代软件带来了问题。最后,凭借其理论背景的发展,该书可以被视为分析和生产文本处理算法的数学基础。
英文简介:
This much-needed book on the design of algorithms and data structures for text processing emphasizes both theoretical foundations and practical applications. It is intended to serve both as a textbook for courses on algorithm design, especially those related to text processing, and as a reference for computer science professionals. The work takes a unique approach, one that goes more deeply into its topic than other more general books. It contains both classical algorithms and recent results of research on the subject. The book is the first text to contain a collection of a wide range of text algorithms, many of them quite new and appearing here for the first time. Other algorithms, while known by reputation, have never been published in the journal literature. Two such important algorithms are those of Karp, Miller and Rosenberg, and that of Weiner. Here they are presented together for the fist time.
The core of the book is the material on suffix trees and subword graphs, applications of these data structures, new approaches to time-space optimal string-matching, and text compression. Also covered are basic parallel algorithms for text problems. Applications of all these algorithms are given for problems involving data retrieval systems, treatment of natural languages, investigation of genomes, data compression software, and text processing tools.
From the theoretical point of view, the book is a goldmine of paradigms for the development of efficient algorithms, providing the necessary foundation to creating practical software dealing with sequences. A crucial point in the authors' approach is the development of a methodology for presenting text algorithms so they can be fully understood. Throughout, the book emphasizes the efficiency of algorithms, holding that the essence of their usefulness depends on it. This is especially important since the algorithms described here will find application in "Big Science" areas like molecular sequence analysis where the explosive growth of data has caused problems for the current generation of software. Finally, with its development of theoretical background, the book can be considered as a mathematical foundation for the analysis and production of text processing algorithms.
- 书名
- Text Algorithms
- 译名
- 文本算法
- 语言
- 英语
- 年份
- 1997
- 页数
- 396页
- 大小
- 2.39 MB
- 标签
- 算法
- 下载
Text Algorithms.pdf
- 密码
- 65536
最后更新:2025-04-12 23:57:48