Low resource social media text mining / Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Guha Jayachandran.
2021
QA76.9.D343
Formats
| Format | |
|---|---|
| BibTeX | |
| MARCXML | |
| TextMARC | |
| MARC | |
| DublinCore | |
| EndNote | |
| NLM | |
| RefWorks | |
| RIS |
Cite
Citation
Linked e-resources
Linked Resource
Concurrent users
Unlimited
Authorized users
Authorized users
Document Delivery Supplied
Can lend chapters, not whole ebooks
Details
Title
Low resource social media text mining / Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Guha Jayachandran.
ISBN
9789811656255 (electronic bk.)
9811656258 (electronic bk.)
9789811656248 (print)
981165624X
9811656258 (electronic bk.)
9789811656248 (print)
981165624X
Published
Singapore : Springer, 2021.
Language
English
Description
1 online resource (xi, 60 pages) : illustrations (some color).
Item Number
10.1007/978-981-16-5625-5 doi
Call Number
QA76.9.D343
Dewey Decimal Classification
006.3/12
Summary
This book focuses on methods that are unsupervised or require minimal supervision--vital in the low-resource domain. Over the past few years, rapid growth in Internet access across the globe has resulted in an explosion in user-generated text content in social media platforms. This effect is significantly pronounced in linguistically diverse areas of the world like South Asia, where over 400 million people regularly access social media platforms. YouTube, Facebook, and Twitter report a monthly active user base in excess of 200 million from this region. Natural language processing (NLP) research and publicly available resources such as models and corpora prioritize Web content authored primarily by a Western user base. Such content is authored in English by a user base fluent in the language and can be processed by a broad range of off-the-shelf NLP tools. In contrast, text from linguistically diverse regions features high levels of multilinguality, code-switching, and varied language skill levels. Resources like corpora and models are also scarce. Due to these factors, newer methods are needed to process such text. This book is designed for NLP practitioners well versed in recent advances in the field but unfamiliar with the landscape of low-resource multilingual NLP. The contents of this book introduce the various challenges associated with social media content, quantify these issues, and provide solutions and intuition. When possible, the methods discussed are evaluated on real-world social media data sets to emphasize their robustness to the noisy nature of the social media environment. On completion of the book, the reader will be well-versed with the complexity of text-mining in multilingual, low-resource environments; will be aware of a broad set of off-the-shelf tools that can be applied to various problems; and will be able to conduct sophisticated analyses of such text.
Bibliography, etc. Note
Includes bibliographical references.
Access Note
Access limited to authorized users.
Digital File Characteristics
text file
PDF
Source of Description
Online resource; title from PDF title page (SpringerLink, viewed October 5, 2021).
Series
SpringerBriefs in computer science.
Available in Other Form
Print version: 9789811656248
Linked Resources
Record Appears in
Table of Contents
Chapter 1: Introduction and outline
Chapter 2: Natural Language Processing Preliminary
Chapter 3: Low-Resource Multilingual Social Media Text and Challenges
Chapter4: Robust Language Identification
Chapter 5: Semantic Sampling
Chapter6: Unsupervised Machine Translation.
Chapter 2: Natural Language Processing Preliminary
Chapter 3: Low-Resource Multilingual Social Media Text and Challenges
Chapter4: Robust Language Identification
Chapter 5: Semantic Sampling
Chapter6: Unsupervised Machine Translation.