Loading...
「ツール」は右上に移動しました。
29いいね 1667回再生

Text Entailment approach for Zero-shot Text Classification (Research Paper Walkthrough)

#zeroshot #textclassification #nlp
⏩ Abstract: Zero-shot text classification (0Shot-TC) is a challenging NLU problem to which little attention has been paid by the research community. 0Shot-TC aims to associate an appropriate label with a piece of text, irrespective of the text domain and the aspect (e.g., topic, emotion, event, etc.) described by the label. And there are only a few articles studying 0Shot-TC, all focusing only on topical categorization which, we argue, is just the tip of the iceberg in 0Shot-TC. In addition, the chaotic experiments in literature make no uniform comparison, which blurs the progress. This work benchmarks the 0Shot-TC problem by providing unified datasets, standardized evaluations, and state-of-the-art baselines. Our contributions include: i) The datasets we provide facilitate studying 0Shot-TC relative to conceptually different and diverse aspects: the “topic” aspect includes “sports” and “politics” as labels; the “emotion” aspect includes “joy” and “anger”; the “situation” aspect includes “medical assistance” and “water shortage”. ii) We extend the existing evaluation setup (label-partially-unseen) – given a dataset, train on some labels, test on all labels – to include a more challenging yet realistic evaluation label-fully-unseen 0Shot-TC (Chang et al., 2008), aiming at classifying text snippets without seeing task specific training data at all. iii) We unify the 0Shot-TC of diverse aspects within a textual entailment formulation and study it this way.

⏩ OUTLINE:
0:00 - Understanding Zero-shot Text Classification with example
01:45 - An entailment approach for zero-shot text classification
03:11 - Text entailment task
04:01 - Converting labels into hypothesis with example
07:32 - Results

⏩ Paper Title: Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach
⏩ Paper: aclanthology.org/D19-1404/
⏩ Author: Wenpeng Yin, Jamaal Hay, Dan Roth
⏩ Organisation: Cognitive Computation Group, Department of Computer and Information Science, University of Pennsylvania

Enjoy reading articles? then consider subscribing to Medium membership, it just 5$ a month for unlimited access to all free/paid content. Subscribe now - prakhar-mishra.medium.com/membership

*********************************************
If you want to support me financially which totally optional and voluntary :) ❤️
You can consider buying me chai ( because i don't drink coffee :) ) at www.buymeacoffee.com/TechvizCoffee
*********************************************

⏩ IMPORTANT LINKS
Zero-shot NLP Playlist:    • Zero-Shot Crosslingual Sentence Simpl...  
Watch some popular research papers in NLP:    • Simple Unsupervised Keyphrase Extract...  
Fill-in-the-blanks using Language Models like GPT:    • Enabling Language Models to Fill in t...  
BERT Goes Shopping: Comparing Distributional Models for Product Representations:    • BERT Goes Shopping: Comparing Distrib...  

*********************************************
⏩ Youtube -    / @techvizthedatascienceguy  
⏩ LinkedIn - linkedin.com/in/prakhar21
⏩ Medium - medium.com/@prakhar.mishra
⏩ GitHub - github.com/prakhar21
*********************************************

⏩ Please feel free to share out the content and subscribe to my channel -    / @techvizthedatascienceguy  

Tools I use for making videos :)
⏩ iPad - tinyurl.com/y39p6pwc
⏩ Apple Pencil - tinyurl.com/y5rk8txn
⏩ GoodNotes - tinyurl.com/y627cfsa

#techviz #datascienceguy #classification #naturallanguageprocessing #transformers

About Me:
I am Prakhar Mishra and this channel is my passion project. I am currently pursuing my MS (by research) in Data Science. I have an industry work-ex of 3 years in the field of Data Science and Machine Learning with a particular focus on Natural Language Processing (NLP).