Foreebank: Hand-annotated constituency trees with grammatical error corrections for 1,000 English and 1,000 French Norton Forum sentences (with Rasoul Kaljahi and Johann Roturier). Paper
SymForum: An English/French Norton Forum dataset for machine translation quality estimation (with Rasoul Kaljahi and Johann Roturier). Paper
Universal Dependencies for Irish: 1,000 Irish sentences annotated in the universal dependencies scheme (with Teresa Lynn).
GenERRate: A tool for automatically inserting a grammatical error into a sentence. Paper. GitHub Repository (thanks to Nicholas Dronen)
BNC Test Set: PTB-style hand-annotated phrase structure trees for 1,000 BNC sentences. Paper