News, Resources

Resource: PoeTree Poetry corpora in 11 languages

PoeTree is a standardized collection of poetry corpora comprising over 330,000 poems in 11 languages (Czech, English, French, German, Hungarian, Italian, Norwegian, Portuguese, Russian, Slovenian, Spanish). Each corpus has been deduplicated, enriched with Universal Dependencies, provided with additional metadata and converted into a unified JSON structure.

See full post.