This book provides a principled data-driven framework that progressively constructs, enriches, and applies taxonomies without leveraging massive human annotated data. Traditionally, people construct domain-specific taxonomies by extensive manual curations, which is time-consuming and costly. In today’s information era, people are inundated with the vast amounts of text data. Despite their usefulness, people haven’t yet exploited the full power of taxonomies due to the heavy curation needed for creating and maintaining them. To bridge this gap, the authors discuss automated taxonomy discovery and exploration, with an emphasis on label-efficient machine learning methods and their real-world usages. Taxonomy organizes entities and concepts in a hierarchy way. It is ubiquitous in our daily life, ranging from product taxonomies used by online retailers, topic taxonomies deployed by news outlets and social media, as well as scientific taxonomies deployed by digital libraries across various domains. When properly analyzed, these taxonomies can play a vital role for science, engineering, business intelligence, policy design, e-commerce, and more. Intuitive examples are used throughout enabling readers to grasp concepts more easily.
Les mer
It is ubiquitous in our daily life, ranging from product taxonomies used by online retailers, topic taxonomies deployed by news outlets and social media, as well as scientific taxonomies deployed by digital libraries across various domains.
Les mer
Introduction.- Concept Set Expansion.- Taxonomy Construction.- Taxonomy Enrichment.- Taxonomy-Guided Classification.- Conclusions.
This book provides a principled data-driven framework that progressively constructs, enriches, and applies taxonomies without leveraging massive human annotated data. Traditionally, people construct domain-specific taxonomies by extensive manual curations, which is time-consuming and costly. In today’s information era, people are inundated with the vast amounts of text data. Despite their usefulness, people haven’t yet exploited the full power of taxonomies due to the heavy curation needed for creating and maintaining them. To bridge this gap, the authors discuss automated taxonomy discovery and exploration, with an emphasis on label-efficient machine learning methods and their real-world usages. Taxonomy organizes entities and concepts in a hierarchy way. It is ubiquitous in our daily life, ranging from product taxonomies used by online retailers, topic taxonomies deployed by news outlets and social media, as well as scientific taxonomies deployed by digital libraries across various domains. When properly analyzed, these taxonomies can play a vital role for science, engineering, business intelligence, policy design, ecommerce, and more. Intuitive examples are used throughout enabling readers to grasp concepts more easily. In addition, this book:Discusses the process of creating, maintaining, and applying taxonomies via simple, easy-to-understand examplesProvides a systematic review of the current research frontier of each task and discusses their real-world applications Includes supporting materials containing links to commonly used evaluation datasets and a code repository of representative algorithms
Les mer
Discusses the process of creating, maintaining, and applying taxonomies via simple, easy-to-understand examples Provides a systematic review of the current research frontier of each task and discusses their real-world applications Includes supporting materials containing links to commonly used evaluation datasets and a code repository of representative algorithms
Les mer

Produktdetaljer

ISBN
9783031114045
Publisert
2022-09-29
Utgiver
Vendor
Springer International Publishing AG
Høyde
240 mm
Bredde
168 mm
Aldersnivå
Professional/practitioner, P, 06
Språk
Product language
Engelsk
Format
Product format
Innbundet

Biographical note

Jiaming Shen, Ph.D., is a Research Scientist at Google Research working on data mining and natural language processing. His research aims to develop automated methods for mining knowledge from text data without excessive human annotations.  He completed his Ph.D. from the University of Illinois at Urbana-Champaign and a B.S. degree from Shanghai Jiao Tong University. His research has been awarded several fellowships and scholarships, including a Brian Totty Graduate Fellowship and a Yunni & Maxine Pao Memorial Fellowship.
Jiawei Han, Ph.D. is a Michael Aiken Chair Professor at the University of Illinois at Urbana-Champaign. His research areas encompass data mining, text mining, data warehousing, and information network analysis, with over 800 research publications. He is a Fellow of both ACM and the IEEE and has received numerous prominent awards, including the ACM SIGKDD Innovation Award (2004) and the IEEE Computer Society W. Wallace McDowell Award (2009).