…This corpus has several outstanding characteristics: hundreds of hours of aligned audio from a diverse set of readers about a diverse set of topics in a well-researched textual genre licensed under a free license (CC BY-SA 4.0) Annotations can be mapped back to the original html phoneme-level alignments
1 PAPER • 1 BENCHMARK
license: apache-2.0 tags: human-feedback size_categories: 100K<n<1M pretty_name: OpenAssistant Conversations OpenAssistant Conversations Dataset (OASST1) Dataset Description Homepage: https://www.open-assistant.io
15 PAPERS • NO BENCHMARKS YET