Table Of ContentTrendsinParsingTechnology
Text, Speech and Language Technology
VOLUME43
SeriesEditors
NancyIde,VassarCollege,NewYork
JeanVéronis,UniversitédeProvenceandCNRS,France
EditorialBoard
HaraldBaayen,MaxPlanckInstituteforPsycholinguistics,TheNetherlands
KennethW.Church,MicrosoftResearchLabs,RedmondWA,USA
JudithKlavans,ColumbiaUniversity,NewYork,USA
DavidT.Barnard,UniversityofRegina,Canada
DanTufis,RomanianAcademyofSciences,Romania
JoaquimLlisterri,UniversitatAutonomadeBarcelona,Spain
StigJohansson,UniversityofOslo,Norway
JosephMariani,LIMSI-CNRS,France
Forfurthervolumes:
http://www.springer.com/series/6636
Trends in Parsing
Technology
Dependency Parsing, Domain
Adaptation, and Deep Parsing
Editedby
Harry Bunt
TilburgUniversity,TheNetherlands
Paola Merlo
UniversityofGeneva,Switzerland
and
Joakim Nivre
VäxjöUniversityandUppsalaUniversity,Sweden
123
Editors
HarryBunt PaolaMerlo
TilburgUniversity UniversitédeGenève
TilburgCenterforCognitionand Dépt.Linguistique
Communication(TiCC)and ruedeCandolle2
Dept.ofCommunication 1211Genève
andInformationSciences Switzerland
Warandelaan2 [email protected]
5000LETilburg
Netherlands
[email protected]
JoakimNivre
VäxjöUniversity
UppsalaUniversity
Pimpstensvägen16
75267Uppsala
Sweden
joakim.nivre@lingfil.uu.se
ISSN1386-291X
ISBN978-90-481-9351-6 e-ISBN978-90-481-9352-3
DOI10.1007/978-90-481-9352-3
SpringerDordrechtHeidelbergLondonNewYork
LibraryofCongressControlNumber:2010936679
(cid:2)c SpringerScience+BusinessMediaB.V.2010
Nopartofthisworkmaybereproduced,storedinaretrievalsystem,ortransmittedinanyformorby
anymeans,electronic,mechanical,photocopying,microfilming,recordingorotherwise,withoutwritten
permissionfromthePublisher,withtheexceptionofanymaterialsuppliedspecificallyforthepurpose
ofbeingenteredandexecutedonacomputersystem,forexclusiveusebythepurchaserofthework.
Printedonacid-freepaper
SpringerispartofSpringerScience+BusinessMedia(www.springer.com)
Contents
1 CurrentTrendsinParsingTechnology ........................... 1
PaolaMerlo,HarryBunt,andJoakimNivre
2 Single Malt or Blended? A Study in Multilingual
ParserOptimization ........................................... 19
JohanHall,JensNilsson,andJoakimNivre
3 ALatentVariableModelforGenerativeDependencyParsing ....... 35
IvanTitovandJamesHenderson
4 DependencyParsingandDomainAdaptationwithData-DrivenLR
ModelsandParserEnsembles ................................... 57
KenjiSagaeandJun-ichiTsujii
5 DependencyParsingUsingGlobalFeatures ....................... 69
TetsujiNakagawa
6 Dependency Parsing with Second-Order Feature Maps
andAnnotatedSemanticInformation ............................ 87
MassimilianoCiaramitaandGiuseppeAttardi
7 StrictlyLexicalisedDependencyParsing ..........................105
QinIrisWang,DaleSchuurmans,andDekangLin
8 FavorShortDependencies:ParsingwithSoftandHardConstraints
onDependencyLength..........................................121
JasonEisnerandNoahA.Smith
9 CorrectiveDependencyParsing .................................151
KeithHallandVáclavNovák
v
vi Contents
10 InducingLexicalisedPCFGswithLatentHeads ...................169
DetlefPrescher
11 Self-Trained Bilexical Preferences to Improve Disambiguation
Accuracy .....................................................183
GertjanvanNoord
12 AreVeryLargeContext-FreeGrammarsTractable? ..............201
PierreBoullierandBenoîtSagot
13 EfficiencyinUnification-Based N-BestParsing ....................223
YiZhang,StephanOepen,andJohnCarroll
14 HPSGParsingwithaSupertagger ...............................243
Takashi Ninomiya, Takuya Matsuzaki, Yusuke Miyao, Yoshimasa
Tsuruoka,andJun-ichiTsujii
15 EvaluatingtheImpactofRe-trainingaLexicalDisambiguation
ModelonDomainAdaptationofanHPSGParser .................257
TadayoshiHara,YusukeMiyao,andJun-ichiTsujii
16 Semi-supervisedTrainingofaStatisticalParserfromUnlabeled
Partially-BracketedData .......................................277
RebeccaWatson,TedBriscoe,andJohnCarroll
Index .............................................................293
Contributors
GiuseppeAttardi UniversitàdiPisa,I-56127,Pisa,Italy,[email protected]
PierreBoullier INRIA-Rocquencourt,DomainedeVoluceau,78153LeChesnay
Cedex,France,[email protected]
TedBriscoe ComputerLaboratory,UniversityofCambridge,Cambridge,UK,
[email protected]
HarryBunt TilburgUniversity,TilburgCenterforCognitionandCommunication
(TiCC) and Department of Communication and Information Sciences, Tilburg,
TheNetherlands,[email protected]
John Carroll Department of Informatics, University of Sussex, UK,
[email protected]
Massimiliano Ciaramita Yahoo! Research, S-08018, Barcelona, Catalonia,
Spain,[email protected]
JasonEisner JohnsHopkinsUniversity,Baltimore,MD,USA,[email protected]
JohanHall VäxjöUniversity,Växjö,Sweden,[email protected]
KeithHall GoogleResearch,Zurich,Switzerland,[email protected]
Tadayoshi Hara Department of Computer Science, Faculty of Information
Science and Technology, University of Tokyo, Tokyo 113-0033, Japan,
[email protected]
James Henderson Department of Computer Science, University of Geneva,
Geneva,Switzerland,[email protected]
DekangLin GoogleInc.,MountainView,CA94043,USA,[email protected]
Takuya Matsuzaki Graduate School of Information Science and Technology,
TheUniversityofTokyo,7-3-1Hongo,Bunkyo-ku,Tokyo,Japan,
[email protected]
PaolaMerlo UniversityofGeneva,Geneva,Switzerland,[email protected]
vii
viii Contributors
YusukeMiyao DepartmentofComputerScience,FacultyofInformationScience
andTechnology,UniversityofTokyo,Tokyo,113-0033,Japan,[email protected]
Tetsuji Nakagawa Knowledge Creating Communication Research Center,
NationalInstituteofInformationandCommunicationsTechnology,
Kyoto619-0289,Japan,[email protected]
JensNilsson VäxjöUniversity,Växjö,Sweden,[email protected]
Takashi Ninomiya Graduate School of Science and Engineering, Ehime
University,3Bunkyo-cho,Matsuyama,Ehime,Japan,[email protected]
JoakimNivre UppsalaUniversity,Uppsala,Sweden,joakim.nivre@lingfil.uu.se
Václav Novák Charles University, Prague, Czech Republic,
[email protected]
StephanOepen DepartmentofInformatics,UniversityofOslo,Oslo,Norway,
oe@ifi.uio.no
DetlefPrescher 76307Karlsbad,CzechRepublic,[email protected]
KenjiSagae InstituteforCreativeTechnologies,UniversityofSouthernCalifornia,
MarinadelRey,CA90292,USA,[email protected]
BenoîtSagot INRIA-Rocquencourt,DomainedeVoluceau,78153LeChesnay
Cedex,France,[email protected]
Dale Schuurmans Department of Computing Science, University of Alberta,
Edmonton,AB,CanadaT6G2E8,[email protected]
Noah A. Smith Carnegie Mellon University, Pittsburgh, PA, USA,
[email protected]
Ivan Titov Cluster of Excellence, MMC, Saarland University, Saarbrücken,
Germany,[email protected]
Jun-ichiTsujii DepartmentofComputerScience,FacultyofInformationScience
andTechnology,UniversityofTokyo,Tokyo,113-0033,Japan;SchoolofComputer
Science, University of Manchester, Manchester, UK; National Center for Text
Mining(NaCTeM),[email protected]
YoshimasaTsuruoka SchoolofInformationScience,JapanAdvancedInstitute
of Science and Technology (JAIST), 1-1 Asahidai, Nomi, Ishikawa, Japan,
[email protected]
Gertjan van Noord Faculty of Arts, University of Groningen, 9700 AS
Groningen,TheNetherlands,[email protected]
QinIrisWang Yahoo!Inc.,SantaClara,CA95054,USA,[email protected]
Contributors ix
RebeccaWatson ComputerLaboratory,UniversityofCambridge,Cambridge,
UK,[email protected]
Yi Zhang Language Technology Laboratory, Department of Computational
Linguistics, Saarland University, DFKI GmbH, Saarbrücken, Germany,
[email protected]
Description:Parsing technology is a central area of research in the automatic processing of human language. It is concerned with the decomposition of complex structures into their constituent parts, in particular with the methods, the tools and the software to parse automatically. Parsers are used in many appli