电子书 > 科学 > 一般 > W. Bruce Croft & John Lafferty: Language Modeling for Information Retrieval (PDF)

W. Bruce Croft & John Lafferty
Language Modeling for Information Retrieval [PDF ebook]

支持

的封面 W. Bruce Croft & John Lafferty: Language Modeling for Information Retrieval (PDF)

A statisticallanguage model, or more simply a language model, is a prob- abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat- egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how we H simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon’s study, language models remain, by all measures, far from the Shannon entropy li Init in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.

€114.91

购买此电子书可免费获赠一本！

语言英语 ● 格式 PDF ● ISBN 9789401701716 ● 编辑 W. Bruce Croft & John Lafferty ● 出版者 Springer Netherlands ● 发布时间 2013 ● 下载 3 时 ● 货币 EUR ● ID 4707306 ● 复制保护 Adobe DRM

需要具备DRM功能的电子书阅读器

来自同一作者的更多电子书 / 编辑

W. Bruce Croft John Lafferty

89,533 此类电子书

的封面 Ronald Bruce St John: Libya

Ronald Bruce St John: Libya

Retaining the conceptual framework of the first edition through emphasis on the dual themes of continuity and change, the second edition of Libya is revised and updated to include discussion of key d …

的封面 Devendra Panigrahi: India''s Partition

Devendra Panigrahi: India”s Partition

Based on new source material available in both England and India, India”s Partition examines the partition in the context of the retreat of the British Empire. The freeing of India from British rule …

的封面 Jim Mann: Beijing Jeep

Jim Mann: Beijing Jeep

When China opened its doors to the West in the late 1970s, Western businesses jumped at the chance to sell their products to the most populous nation in the world. Boardrooms everywhere buzzed with e …

的封面 Rami Ginat: Egypt''s Incomplete Revolution

Rami Ginat: Egypt”s Incomplete Revolution

The importance of Lutfi al-Khuli and the intellectual circle associated with the Nasserist regime is examined here. Rami Ginat looks at al-Khuli”s contribution to the short-lived yet formidable succ …

的封面 Sumi Madhok: Rethinking Agency

Sumi Madhok: Rethinking Agency

This book proposes a new theoretical framework for agency thinking by examining the ethical, discursive and practical engagements of a group of women development workers in north-west India with deve …

的封面 H.H. RAJA RAJGAN: Travels China, Japan & Java

H.H. RAJA RAJGAN: Travels China, Japan & Java

First published in 2006. This unique perspective on China, Japan, and Java is written by His Highness the Indian Raja-I-Rajgan Jagatjit Singh of Kapurthala as a record of his brief visit to the Far E …

的封面 Peter Olsthoorn: Military Ethics and Virtues

Peter Olsthoorn: Military Ethics and Virtues

This book examines the role of military virtues in today”s armed forces. Although long-established military virtues, such as honor, courage and loyalty, are what most armed forces today still use as …

的封面 Freda Matchett: Krsna: Lord or Avatara?

Freda Matchett: Krsna: Lord or Avatara?

This is a study of three Sanskrit texts, the Harivamsa, the Visnupurana, and the Bhagavatabelonging to the puranic genre, the chief source of knowledge of the origins of popular Hinduism. It treats t …

的封面 Winnand Callewaert: The Hagiographies of Anantadas

Winnand Callewaert: The Hagiographies of Anantadas

Anantadas is the first ”biographer” who, around 1600, wrote about the most popular bhakti poets of the 15th and 16th centuries in Northern India. This critical study of these manuscripts yields a b …

的封面 Rachel Kraus: Gendered Bodies and Leisure

Rachel Kraus: Gendered Bodies and Leisure

With its roots in Middle Eastern and North African dance, belly dance is a popular leisure activity in the West with women (and some men) of all ages and body types pursing the activity for diverse r …