Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Unsupervised Learning of Natural Languages
Details
Many types of sequential symbolic data possess
structure that is (i) hierarchical, and (ii) context-
sensitive. Natural-language text or transcribed
speech are prime examples of such data: a corpus of
language consists of sentences, defined over a
finite lexicon of symbols such as words. Linguists
traditionally analyze the sentences into recursively
structured phrasal constituents; at the same time, a
distributional analysis of partially aligned
sentential contexts reveals in the lexicon clusters
that are said to correspond to various syntactic
categories (such as nouns or verbs). Such structure,
however, is not limited to the natural languages:
recurring motifs are found, on a level of
description that is common to all life on earth, in
the base sequences of DNA that constitute the
genome. In this book, I address the problem of
extracting patterns from natural sequential data and
inferring the underlying rules that govern their
production. This is relevant to both linguistics and
bioinformatics, two fields that investigate
sequential symbolic data that are hierarchical and
context sensitive.
Autorentext
Dr. Zach SOLAN is a researcher in the field of Natural Language Processing (NLP). He holds a PhD in physics from Tel Aviv University. This book is based on Dr. Solan's interdisciplinary dissertation, supervised by Prof. Shimon Edelman from Cornell University, Prof Eytan Ruppin and David Horn from Tel-Aviv University.
Klappentext
Many types of sequential symbolic data possess structure that is (i) hierarchical, and (ii) context-sensitive. Natural-language text or transcribed speech are prime examples of such data: a corpus of language consists of sentences, defined over a finite lexicon of symbols such as words. Linguists traditionally analyze the sentences into recursively structured phrasal constituents; at the same time, a distributional analysis of partially aligned sentential contexts reveals in the lexicon clusters that are said to correspond to various syntactic categories (such as nouns or verbs). Such structure, however, is not limited to the natural languages: recurring motifs are found, on a level of description that is common to all life on earth, in the base sequences of DNA that constitute the genome. In this book, I address the problem of extracting patterns from natural sequential data and inferring the underlying rules that govern their production. This is relevant to both linguistics and bioinformatics, two fields that investigate sequential symbolic data that are hierarchical and context sensitive.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09783639145083
- Sprache Englisch
- Größe H220mm x B220mm
- Jahr 2009
- EAN 9783639145083
- Format Kartonierter Einband (Kt)
- ISBN 978-3-639-14508-3
- Titel Unsupervised Learning of Natural Languages
- Autor Zach Solan
- Untertitel Revealing the structures behind human languages, proteins, DNA and other sequential data
- Herausgeber VDM Verlag
- Anzahl Seiten 128
- Genre Wirtschaft