text annotation
30 papers with code • 0 benchmarks • 3 datasets
Benchmarks
These leaderboards are used to track progress in text annotation
Most implemented papers
TeamTat: a collaborative text annotation tool
Manually annotated data is key to developing text-mining and information-extraction algorithms.
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Further, we propose a tri-modal model that jointly processes raw audio, video, and text captions from videos to learn a multi-modal semantic embedding space useful for text-video retrieval.
From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains
Entity linking (EL) is concerned with disambiguating entity mentions in a text against knowledge bases (KB).
DoTAT: A Domain-oriented Text Annotation Tool
Secondly, the tool provides annotation of event, nested event, and nested entity, which are frequently required in domain-related text structuring tasks.
Fine-grained Image Captioning with CLIP Reward
Toward more descriptive and distinctive caption generation, we propose using CLIP, a multimodal encoder trained on huge image-text pairs from web, to calculate multimodal similarity and use it as a reward function.
LViT: Language meets Vision Transformer in Medical Image Segmentation
In our LViT model, medical text annotation is incorporated to compensate for the quality deficiency in image data.
SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling
Compared to frequently used text annotation tools, our annotation tool allows for the development of weak labels in addition to providing a manual annotation experience.
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response
Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain.
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Voice conversion (VC) can be achieved by first extracting source content information and target speaker information, and then reconstructing waveform with these information.
Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks
When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels.