基本信息
浏览量:166
职业迁徙
个人简介
Education
In high school, I took classes in German, Spanish, and Latin, and dabbled in Greek and Russian on my own. Interest in Tolkien and constructed languages led me to linguistics.
As an undergraduate, I studied classics and linguistics at Indiana University, completing the B.A. in 1983. I majored in Greek, and also took classes in Latin, German, French, and Hebrew. Our field methods course was on Soninke (a Mande language), and I got research funding to keep working with the consultant on my own during the summer [1]. My senior thesis was on comparative mythology.
I received my doctorate in linguistics from MIT in 1987. My dissertation [6] proposed the "DP-hypothesis," which treats functional elements uniformly as syntactic heads.
Bellcore (1987-1993)
As a student, I also began working on parsing [3,4,7], which led first to a summer internship, and then to a full-time position at Bell Communications Research (Bellcore). I was interested in emulating human parsing, and my approach was to factor parse trees into "chunks and dependencies." A branch of that work was the connection between syntactic structure and prosody.
At Bellcore, I began studying stochastic models. I had the good fortune to collaborate with Kevin Mark and Michael Miller one summer [22]. My contributions at the time were entirely linguistic rather than mathematical, but I did absorb the idea of random fields from them.
Tübingen, Germany (1993-1997)
At Tübingen, my work on chunk parsing culminated in the parser called Cass. Its major advantage was speed: in contrast to standard chart parsers, that ran at 1-10 words per second, Cass processed 10,000 words per second, allowing one to parse large corpora rapidly. What was missing in Cass was the dependencies part of "chunks and dependencies," and I began working on induction methods to acquire them, using Cass itself to bootstrap them from corpora, in collaboration with Mats Rooth and Marc Light.
I also spent time studying random fields and used them to formulate a probabilistic version of attribute-value grammars.
AT&T Laboratories (1997-2002)
I continued working on bootstrapping at AT&T Labs. I became especially interested in semisupervised learning and boosting. I also revived my earlier work on prosody.
My main projects, though, involved building systems:
Ionaut, built in collaboration with Michael Collins and Amit Singhal, combined web search with entity recognition and question answering.
Mage was a spoken dialogue system for phone-based email access in which I integrated several technologies developed by other groups at the Labs (speech recognition, speech synthesis, and telephony control) and added natural language processing and dialogue management.
PreTTS, built in collaboration with Don Hindle, was a system for parsing and preprocessing complex email messages in order to drive speech synthesis and "read" them comprehensibly.
University of Michigan (since 2002)
Since coming to the University of Michigan, my major projects have been:
Information extraction, especially in the biomedical domain.
Writing a book on semisupervised learning [62].
Language digitization, which is to say, language documentation and description that supports automated processing across languages.
研究兴趣
论文共 62 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Graham Neubig,Shruti Rijhwani,Alexis Palmer,Jordan MacKenzie,Hilaria Cruz,Xinjian Li,Matthew Lee,Aditi Chaudhary,Luke Gessler,Steven Abney,Shirley Anugrah Hayati,Antonios Anastasopoulos,Olga Zamaraeva,Emily Prud'hommeaux, Jennette Child,Sara Child,Rebecca Knowles,Sarah Moeller,Jeffrey Micher,Yiyuan Li, Sydney Zink,Mengzhou Xia,Roshan S. Sharma,Patrick Littell
SLTU/CCURLLRECpp.342-351, (2020)
引用5浏览0EI引用
5
0
Machine Translationno. 1 (2014): 61-63
Linguistic issues in language technology (2011)
Steven P. Abney,S. Kurohashi,S. Bangalore,Irene Langkilde-Geary,C. Brew,Mirella Lapata, Sharon A. Caraballo,C. Leacock, Bob Carpenter, B. Levin,Stanley F. Chen,D. Litman,Kenneth Ward Church,I. Mani,Michael Collins,Christopher Manning,Ann A. Copestake,D. Marcu, M. Crocker,E. Marsi,P. Deane,Diana McCarthy,Mona T. Diab,I. D. Melamed,M. Dras, J. Minett,Jason Eisner,Robert C. Moore, E. Fosler-Lussier, Thomas Morton,George Foster,H. Ney, R. Frank, G. Ngai, Jianfeng Gao,Kemal Oflazer,Claire Gardent,Massimo Poesio, Tanja Gaustad van Zaanen,Judita Preiss,D. Gildea,Ehud Reiter, Andrew R. Golding,P. Resnik,Joshua Goodman, Roni Rosenfeld, G. Grefenstette, Frank Schilder,Mohammad Haji-Abdolhosseini, Lenhart K. Schubert, P. Heeman,Advaith Siddharthan,D. Higgins,R. Sproat,J. Hockenmaier,M. Strube,H. Horacek, M. Swerts,D. Inkpen,Simone Teufel,Martin Jansche,Kees van Deemter,Mark Johnson, Ye-Yi Wang,Frank Keller,B. Webber,A. Kilgarriff, J. Wiebe,Kevin Knight, Florian Wolf
Computational Linguistics (2010): 579-579
SEMISUPERVISED LEARNING FOR COMPUTATIONAL LINGUISTICSpp.1-+, (2008)
引用187浏览0引用
187
0
加载更多
作者统计
#Papers: 60
#Citation: 12614
H-Index: 28
G-Index: 47
Sociability: 5
Diversity: 1
Activity: 0
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn