Anders Søgaard

Associate Professor in Language Technology. Head of MA in IT & Cognition.

Born 1981 in Odense. Wikipedia: [Link].

2003: BA in Linguistics (University of Copenhagen). 2004: MA in Computational Linguistics (CBS). 2007: PhD in Language Technology (University of Copenhagen).

Previous positions: Senior researcher (University of Potsdam), Postdoctoral researcher (CBS).

Research interests: natural language processing, machine learning, philosophy of science.


News: Dirk Hovy and Barbara Plank and I will teach a COLING 2014 tutorial on bias correction for NLP.


Research publications, Reviews, Editorial work, Danish journals, Invited seminars, Projects, Other research activities, Interviews (Danish).


Research publications

2014

Søgaard, Anders; Johannsen, Anders; Plank, Barbara; Hovy, Dirk; Martinez, Hector. 2014. What’s in a p-value in NLP? The 18th Conference on Computational Natural Language Learning (CoNLL). Baltimore, MD.

Plank, Barbara; Hovy, Dirk; Søgaard, Anders. 2014. Linguistically debatable or just plain wrong? The 52nd Annual Meeting of the Association for Computational Linguistics (ACL). Baltimore, MD.

Hovy, Dirk; Plank, Barbara; Søgaard, Anders. 2014. Experiments with a crowd-sourced re-annotation of a POS tagging dataset. The 52nd Annual Meeting of the Association for Computational Linguistics (ACL). Baltimore, MD.

Plank, Barbara; Hovy, Dirk; Søgaard, Anders. 2014. Learning part-of-speech taggers with inter-annotator agreement loss. The 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL). Gothenburg, Sweden. Received Best Long Paper Award.

Hovy, Dirk; Plank, Barbara; Søgaard, Anders. 2014. When POS datasets do not add up: Combatting sample bias. Language Resources and Evaluation Conference 2014. Reykjavik, Iceland.

Fromheide, Hege; Søgaard, Anders. 2014. Crowdsourcing and annotating NER for Twitter #drift. Language Resources and Evaluation Conference 2014. Reykjavik, Iceland.

2013

Søgaard, Anders. Semi-supervised learning and domain adaptation for NLP. Morgan & Claypool. [HTML]

Søgaard, Anders; Martinez, Hector; Elming, Jakob; Johannsen, Anders. 2013. Using crowdsourcing to get representations based on regular expressions. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2013. Seattle, WA. [PDF]

Matthies, Franz; Søgaard, Anders. 2013. With blinkers on: Robust prediction of eye movements across readers. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2013. Seattle, WA. [PDF]

Søgaard, Anders. 2013. Part-of-speech tagging with antagonistic adversaries. The 51st Annual Meeting of the Association for Computational Linguistics (ACL). Sofia, Bulgaria. [PDF]

Elming, Jakob; Johannsen, Anders; Klerke, Sigrid; Lapponi, Emanuele; Alonso, Hector Martinez; Søgaard, Anders. 2013. Down-stream effects of tree-to-dependency conversions. North American Chapter of the Association for Computational Linguistics (NAACL). Atlanta, GA. [PDF]

Søgaard, Anders. 2013. Zipfian corruptions for robust POS tagging. North American Chapter of the Association for Computational Linguistics (NAACL). Atlanta, GA. [PDF] [Video]

Søgaard, Anders. 2013. Estimating effect size across datasets. North American Chapter of the Association for Computational Linguistics (NAACL). Atlanta, GA. [PDF]

Johannsen, Anders; Søgaard, Anders. 2013. Cross-domain answer ranking using importance sampling. The 6th International Joint Conference on Natural Language Processing (IJCNLP). Nagoya, Japan. [PDF]

Johannsen, Anders; Søgaard, Anders. 2013. Disambiguating explicit discourse connectives without oracles. The 6th International Joint Conference on Natural Language Processing (IJCNLP). Nagoya, Japan. [PDF]

Klerke, Sigrid; Søgaard, Anders. 2013. Simple readable sub-sentences. The 51st Annual Meeting of the Association for Computational Linguistics (ACL), Student Research Workshop. Sofia, Bulgaria. [PDF]

Søgaard, Anders. 2013. An empirical study of differences between conversion schemes and annotation guidelines. International Conference on Dependency Linguistics 2013. Prague, the Czech Republic. [PDF]

Klerke, Sigrid; Elbro, Carsten; Søgaard, Anders. 2013. Tracking readability in eye movements. 17th European Conference on Eye Movements. Lund, Sweden.

2012

Søgaard, Anders. 2012. Unsupervised dependency parsing without training. Natural Language Engineering 18(1):187-203. [Link]

Søgaard, Anders; Johannsen, Anders. 2012. Robust learning in random subspaces: equipping NLP for OOV effects. The 24th International Conference on Computational Linguistics (COLING). Mumbai, India. [PDF]

Søgaard, Anders; Wulff, Julie. 2012. An empirical study of non-lexical extensions to delexicalized transfer. The 24th International Conference on Computational Linguistics (COLING). Mumbai, India. [PDF]

Søgaard, Anders. 2012. Mining wisdom. Computational Linguistics in Literature, Nothern American Chapter of the Association of Computational Linguistics (NAACL). Montreal, Canada. [PDF]

Søgaard, Anders. 2012. Two baselines for unsupervised dependency parsing. Workshop on Inducing Linguistic Structure, North American Chapter of the Association of Computational Linguistics (NAACL). Montreal, Canada. [PDF]

Johannsen, Anders; Martinez, Hector; Klerke, Sigrid; Søgaard, Anders. 2012. EMNLP@CPH: Is frequency all there is to simplicity? SemEval-2012, 1st Joint Conference on Lexical and Computational Semantics. Montreal, Canada. [PDF]

Nisbeth, Niklas; Søgaard, Anders. 2012. Parser combination under sample bias. SPLeT, Language Resources and Evaluation Conference. Istanbul, Turkey.

Klerke, Sigrid; Søgaard, Anders. 2012. DSim, a Danish parallel corpus for text simplification. The 8th International Conference on Language Resources and Evaluation. Istanbul, Turkey. [PDF]

Søgaard, Anders; Kristiansen, Søren Lind. 2012. Using hybrid logic for querying dependency treebanks. Linguistic Issues in Language Technology 7(5). [PDF]

Plank, Barbara; Søgaard, Anders. 2012. Experiments in newswire-to-law adaptation of graph-based dependency parsers. 8* Convegno Nazionale dell'Associazione Italiana di Scienze della Voce. Rome, Italy. [PDF]

2011

Søgaard, Anders. 2011. A O(|G|n^6) time extension of inversion transduction grammars. Machine Translation 25(4):291-315. [Link]

Søgaard, Anders; Haulrich, Martin. 2011. Sentence-level instance-weighting for graph-based and transition-based dependency parsing. The 12th International Conference on Parsing Technologies (IWPT). Dublin, Ireland. [PDF]

Søgaard, Anders. 2011. Semi-supervised condensed nearest neighbor for part-of-speech tagging. The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon. Nominated for Best Short Paper Award. [PDF] [Code] [ACLWiki]

Søgaard, Anders. 2011. Data point selection for cross-language adaptation of dependency parsers. The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon. [PDF]

Johannsen, Anders; Martinez, Hector; Rishøj, Christian; Søgaard, Anders. 2011. Frustratingly hard compositionality prediction. Distributional Semantics and Compositionality, the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon. [PDF]

Søgaard, Anders. 2011. From ranked words to dependency trees: two-stage unsupervised non-projective dependency parsing. TextGraphs-6: Graph-based Methods for Natural Language Processing, the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon. Errata: The assertion that the algorithm described in the paper does not guarantee projectivity (p. 65) is not true. [PDF]

Rishøj, Christian; Søgaard, Anders. 2011. Factored translation using unsupervised word clusters. The 6th Workshop on Statistical Machine Translation, Conference on Empirical Methods in Natural Language Processing (EMNLP). Scotland, Edinburgh. [PDF]

Søgaard, Anders. 2011. Using graphical models for PP attachment. The 18th Nordic Conference on Computational Linguistics. Riga, Latvia. [PDF]

Søgaard, Anders. 2011. Learning grammatical functions in a realistic way. The 3rd Conference of the Scandinavian Association for Language and Cognition. Copenhagen, Denmark.

Søgaard, Anders. 2011. Cable television; Fanzines; Functionalist theories; Journalism. In Marcel Danesi (ed.), Encyclopedia of Media and Communication. Toronto: University of Toronto Press.

2010

Søgaard, Anders; Rishøj, Christian. 2010. Semi-supervised dependency parsing using generalized tri-training. The 23rd International Conference on Computational Linguistics (COLING). Beijing, China. [PDF] [Code]

Søgaard, Anders. 2010. Simple semi-supervised training of part-of-speech taggers. The 48th Annual Meeting of the Association for Computational Linguistics (ACL). Uppsala, Sweden. [PDF]

Søgaard, Anders; Rishøj, Christian. 2010. The effect of semi-supervised learning on parsing long distance dependencies in German and Swedish. The 7th International Conference on Natural Language Processing (IceTAL). Reykjavik, Iceland. [PDF]

Søgaard, Anders; Johannsen, Anders. 2010. Robust semi-supervised and ensemble-based methods in word sense disambiguation. The 7th International Conference on Natural Language Processing (IceTAL). Reykjavik, Iceland. [PDF]

Kristiansen, Søren Lind; Søgaard, Anders. 2010. Querying dependency treebanks in hybrid logic. Hybrid Logic and Applications, The 25th Annual IEEE Symposium on Logic in Computer Science (LICS). Edinburgh, Scotland.

Søgaard, Anders. 2010. Can inversion transduction grammars generate hand alignments? The 14th Annual Conference of the European Association for Machine Translation (EAMT). St. Raphael, France. Errata: The numbers in Figure 4 are incorrect. Email me for correct numbers. [PDF]

Søgaard, Anders; Haulrich, Martin. 2010. On the derivation perplexity of treebanks. Treebanks and Linguistic Theories 9. Riga, Latvia. [PDF]

2009

Søgaard, Anders; Lange, Martin. 2009. Polyadic dynamic logics for HPSG parsing. Journal of Logic, Language and Information 18(2): 159-198. (ERIH Category: A)

Søgaard, Anders; Wu, Dekai. 2009. Empirical lower bounds on translation unit error rate for the full class of inversion transduction grammars. The 11th International Conference on Parsing Technologies (IWPT). Paris, France. [PDF]

Søgaard, Anders; Kuhn, Jonas. 2009. Using a maximum entropy-based tagger to improve a very fast vine parser. The 11th International Conference on Parsing Technologies (IWPT). Paris, France. [PDF]

Søgaard, Anders; Kuhn, Jonas. 2009. Empirical lower bounds on aligment error rates in syntax-based machine translation. The 3rd Workshop on Syntax and Structure in Statistical Translation, North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT) 2009. Boulder, Colorado. Errata: Email me.

Søgaard, Anders. 2009. On the complexity of alignment problems in two synchronous grammar formalisms. The 3rd Workshop on Syntax and Structure in Statistical Translation, North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT) 2009. Boulder, Colorado. [PDF]

Søgaard, Anders; Østerskov, Stine. 2009. On definitions of consciousness. Journal of Consciousness Studies.

Søgaard, Anders. 2009. Compound constructions: a reply to Bundgaard et al. Semiotica: Journal of the International Association for Semiotic Studies 169(1): 163-169. (ERIH Category: A)

Søgaard, Anders. 2009. Ensemble-based POS tagging of Italian. The 11th Conference of the Italian Association for Artificial Intelligence, EVALITA. Reggio Emilia, Italy.

Søgaard, Anders; Rishøj, Christian. 2009. Vine parsing augmented Italian treebanks. The 11th Conference of the Italian Association for Artificial Intelligence, EVALITA. Reggio Emilia, Italy.

Søgaard, Anders. 2009. A linear time extension of deterministic pushdown automata. The 17th Nordic Conference on Computational Linguistics. Odense, Denmark.

Søgaard, Anders. 2009. Cubic time querying of treebanks for nonlocal multicomponent tree-adjoining grammar and head-driven phrase structure grammar. The 17th Nordic Conference on Computational Linguistics. Odense, Denmark.

Søgaard, Anders; Haugereid, Petter. 2009. Introduction. In Anders Søgaard and Petter Haugereid (eds.), Typed feature structure grammars. Berlin: Peter Lang.

Søgaard, Anders. 2009. From unordered context-free grammar to polysize HPSG without moving. In Anders Søgaard and Petter Haugereid (eds.), Typed feature structure grammars. Berlin: Peter Lang.

2008

Søgaard, Anders. 2008. Range concatenation grammars for translation. The 22nd International Conference on Computational Linguistics (COLING). Manchester, England. Pp. 103-106. [PDF]

Søgaard, Anders. 2008. On the weak generative capacity of weighted context-free grammars. The 22nd International Conference on Computational Linguistics (COLING). Manchester, England. Pp. 99-102. [PDF]

Maier, Wolfgang; Søgaard, Anders. 2008. Treebanks and mild context-sensitivity. The 13th Conference on Formal Grammar. Hamburg, Germany. Pp. 61-76. [PDF]

Søgaard, Anders. 2008. Learning context-sensitive synchronous rules. The 13th Annual Conference on the European Association for Machine Translation (EAMT). Hamburg, Germany. Pp. 170-175. Errata: email me.

2007

Søgaard, Anders; Haugereid, Petter. 2007. A tractable typed feature structure grammar for Mainland Scandinavian. Nordic Journal of Linguistics 30(1): 87-128. (ERIH Category: B)

Søgaard, Anders. 2007. The grammaticalization and disappearance of adpositions in nominal compounds. California Linguistic Notes 17(2): 1-24.

Søgaard, Anders; Lichte, Timm; Maier, Wolfgang. 2007. On the complexity of linguistically motivated extensions of tree-adjoining grammar. Recent Advances in Natural Language Processing 2007. Borovets, Bulgaria. [PDF]

Søgaard, Anders. 2007. Operations on polyadic structures. Model-theoretic syntax @ 10, the 19th European Summer School on Logic, Language and Information. Dublin, Ireland.

Søgaard, Anders. 2007. Complexity of conceptual integration. The 10th International Cognitive Linguistics Conference. Krakow, Poland.

Søgaard, Anders. 2007. Polynomial charts for totally unordered languages. Proceedings of the 16th Nordic Conference of Computational Linguistics. Tartu, Estonia. Pp. 183-190.

Søgaard, Anders. 2007. Propositional and first order verification of linguistic structures. Proceedings of the 2nd International Workshop on Typed Feature Structure Grammars. Tartu, Estonia. Pp. 17-24.

Søgaard, Anders; Nimb, Sanni. 2007. A typed account of adverbial quantifiers. Proceedings of the 2nd International Workshop on Typed Feature Structure Grammars. Tartu, Estonia. Pp. 53-61.

Søgaard, Anders. 2007. From unordered context-free grammar to polysize HPSG without moving. Proceedings of the 1st International Workshop on Typed Feature Structure Grammars. Aalborg, Denmark. Pp. 17-30.

Søgaard, Anders. 2007. Mathematical properties of natural language and mathematical properties of linguistic theories. The 21st Grammar in Focus. Lund, Sweden.

2006

Søgaard, Anders. 2006. Computational semantics as reasoning about programs. South Asian Language Review 16, Special Issue on Computational Semantics.

Søgaard, Anders. 2006. The semantics of possession in natural language and knowledge representation. Journal of Universal Language 6(2): 85-115.

Søgaard, Anders. 2006. Model generation in a dynamic environment. Takashi Washio, Akito Sakurai, Satoshi Tojo and Makoto Yokoo (eds.), New Frontiers in Artificial Intelligence. Berlin: Springer. Pp. 126-133.

Søgaard, Anders. 2006. Unification-based grammars and complexity classes. 8. Konferenz zur Verarbeitung natürlicher Sprache. Konstanz, Germany. Pp. 137-142.

Søgaard, Anders. 2006. Embodied construction grammar as layered modal languages. Proceedings of The Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics 2006, Third International Workshop on Scalable Natural Language Understanding. New York, New York. Pp. 65-72.

Søgaard, Anders. 2006. Logical investigations on the adequacy of certain feature-based theories of natural language. Proceedings of The Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics 2006, Doctoral Consortium. New York, New York. Pp. 239-242.

2005

Søgaard, Anders. 2005. Update semantics for HPSG grammars. H. Holmboe (ed.), Nordisk Sprogteknologi 2005. Copenhagen: Museum Tusculanum. Pp. 167-72.

Søgaard, Anders. 2005. Compounding theories and linguistic diversity. Zygmunt Frajzyngier, Adam Hodges and David S. Rood (eds.), Linguistic diversity and language theories. Amsterdam: John Benjamins. Pp. 319-37.

Søgaard, Anders; Haugereid, Petter. 2005. A brief documentation of a computational HPSG grammar specifying (most of) the common subset of linguistic types for Danish, Norwegian and Swedish. H. Holmboe (ed.), Nordisk Sprogteknologi 2004. Copenhagen: Museum Tusculanum. Pp. 247-56.

Søgaard, Anders. 2005. Where does the meaning of compounds and possessives come from? A contrastive view. The 3rd International Conference in Contrastive Semantics and Pragmatics. Shanghai, China.

Søgaard, Anders. 2005. Computing sense and reference. Computing and Philosophy 2005. Västerås, Sweden.

Søgaard, Anders; Haugereid, Petter. 2005. Functionality in grammar design. Stefan Werner (ed.), Proceedings of The 15th Nordic Conference of Computational Linguistics. Joensuu: University of Joensuu Electronic Publications in Linguistics and Language Technology, vol. 1. Pp. 193-202.

Søgaard, Anders. 2005. Model generation in a dynamic environment. Proceedings of Logic and Engineering of Natural Language Semantics 2005. Kitakyushu, Japan.

Søgaard, Anders. 2005. Extending the HPSG Grammar Matrix with richer lexical semantics. Proceedings of The 3rd International Workshop on Generative Approaches to the Lexicon. Geneva, Switzerland. [PDF]

2004

Søgaard, Anders. 2004. A compound matrix. Proceedings of The 11th International Conference on Head-Driven Phrase Structure Grammar (HPSG). Leuven, Belgium.

Søgaard, Anders. 2004. On appropriateness as a relation between ontologies. International Conference on Formal Ontology in Information Systems. Torino, Italy.

Søgaard, Anders. 2004. K-structure: (a prerequisite for) an interlingua. Papers from the 20th Scandinavian Conference of Linguistics. Helsinki, Finland.

Søgaard, Anders; Haugereid, Petter. 2004. The noun phrase in Mainland Scandinavian. The 3nd Meeting of the Scandinavian Network of Grammar Engineering and Machine Translation. Gothenburg, Sweden.

2003

Søgaard, Anders. 2003. A compound algorithm. Computational Linguistics in the Netherlands 2003. Antwerpen, Belgium.

Søgaard, Anders. 2003. Compounding and the generative lexicon. Sprache, Wissen, Wissenschaft. Munich, Germany.

Reviews

Søgaard, Anders. 2011. Stenning and van Lambalgen, Human reasoning and cognitive science. Studia Logica 97: 317-318. (ERIH Category: A)

Søgaard, Anders. 2008. Gisbert Fanselow et al. (eds.), Gradience in grammar. Nordic Journal of Linguistics 31(1): 109-116. (ERIH Category: B)

Søgaard, Anders. 2007. Dov Gabbay et al., Mathematical problems from applied logic. Studia Logica 87(2): 363-367. (ERIH Category: A)

Søgaard, Anders. 2007. Patrick Blackburn and Johan Bos, Representation and inference for natural language. Studia Logica 85(3): 413-418. (ERIH Category A)

Editorial work

Søgaard, Anders; Haugereid, Petter (eds.). 2009. Typed feature structure grammars. Berlin: Peter Lang GmbH. [Link]

Søgaard, Anders; Haugereid, Petter (eds.). 2007. Proceedings of the 2nd International Workshop on Typed Feature Structure Grammars. Center for Language Technology Working Papers 8.

Danish journals

Søgaard, Anders. 2010. Maskinoversættelse - giver det mening? Mål & Mæle.

Søgaard, Anders. 2003. Semantik og æstetik. Ny Poesi June 2003.

Søgaard, Anders. 2003. Om kognitiv morfologi. Semikolon 3(7): 81-91.

Søgaard, Anders. 2003. Mellem rum der blot er rum. Den Blå Port 59: 28-35.

Søgaard, Anders. 2002. En introduktion til den kognitive lingvistik. Apparatur 4: 1-15.

Søgaard, Anders. 2002. Noter vedrørende den generative morfologi. Semikolon 2(4): 79-83.

Søgaard, Anders. 2001. Watten, Pöppel, Lakoff, Turner. Kritik 153: 9-12.

Invited talks and seminars

Søgaard, Anders. 2013. My Statistics 101 for corpus linguistics. UCPH PhD Course. Copenhagen, Denmark.

Søgaard, Anders. 2013. Computers and language. Dpt.~of Psychology. Copenhagen, Denmark.

Søgaard, Anders. 2013. Beating the life out of Twitter. BioComplexity Wednesday Meetings, UCPH Niels Bohr Institute. Copenhagen, Denmark.

Søgaard, Anders. 2013. Learning linguistic models for big data analysis. DeIC Conference on the Future of e-Science. Middelfart, Denmark.

Søgaard, Anders. 2013. 6,909 reasons to mess up your data. Plenary talk, NODALIDA 2013. Oslo, Norway.

Søgaard, Anders. 2013. Learning with antagonistic adversaries. Language Technology and Learning. Uppsala, Sweden.

Søgaard, Anders. 2013. Censorship as a game. Rethinking Censorship. University of Copenhagen. Copenhagen, Denmark.

Søgaard, Anders. 2013. Parsing in the streets. Heinrich-Heine University. Düsseldorf, Germany.

Søgaard, Anders. 2012. Principles in evaluation and basic statistical issues. ELDA. Paris, France.

Søgaard, Anders. 2012. Crash-course in machine learning. University of Gothenburg. Gothenburg, Sweden.

Søgaard, Anders. 2012. Crash-course in machine learning. PhD Course in Corpus Linguistics. University of Copenhagen. Copenhagen, Denmark.

Søgaard, Anders. 2012. Learning under bias in NLP. Fred Jelinek Seminar Series (7th lecture). Charles University. Prague, the Czech Republic. [Video]

Søgaard, Anders. 2012. Learning under bias in NLP (5-day course). European Summer School in Logic, Language and Information. Opole, Poland. Slides can be found here, here, here, here, and here.

Søgaard, Anders. 2012. Perceptron learning under sample bias. University of Gothenburg. Gothenburg, Sweden.

Søgaard, Anders. 2011. Crash-course in machine learning. CLARA Summer School in Semantic and Nonverbal Corpus Annotation and Evaluation. Copenhagen, Denmark.

Søgaard, Anders. 2011. Opinion mining - thumbs up? Sprogteknologisk Forum 2011. Copenhagen, Denmark.

Søgaard, Anders. 2010. On the usefulness of word representations. Copenhagen Symposium on Approaches to the Lexicon. Copenhagen, Denmark.

Søgaard, Anders. 2010. Two robust semi-supervised learning algorithms for natural language processing. Cambridge University. Cambridge, England.

Søgaard, Anders. 2009. Integrating ensemble-based and semi-supervised dependency parsing. Uppsala University. Uppsala, Sweden.

Søgaard, Anders. 2009. Computers, math, and the humanities. One-day seminar on the humanities. University of Copenhagen. Copenhagen, Denmark.

Søgaard, Anders. 2009. Model-checking in parsing natural language. One-day seminar on hybrid logic. Roskilde University. Roskilde, Denmark.

Søgaard, Anders. 2008. Cubic time extensions of context-free grammar and beyond. Computerlinguistisches Kolloquium, Universität Potsdam. Berlin, Germany.

Søgaard, Anders. 2007. Grammar theories and possible languages. The Linguistic Circle of Copenhagen. Copenhagen, Denmark.

Søgaard, Anders. 2007. Hierarchies of unification grammars. Copenhagen Business School. Copenhagen, Denmark.

Søgaard, Anders. 2007. The complexity of reentrancy, unordering and synchronism. Dpt. of Linguistics, Universität Potsdam. Berlin, Germany.

Søgaard, Anders. 2007. Model-theoretic syntax in polyadic propositional dynamic logic. One-day seminar on hybrid logic (HyLoMOL). Roskilde, Denmark.

Søgaard, Anders. 2007. Logic and the adequacy of linguistic theories. Kolloquium/Ringvorlesung des Graduiertenkolleg Wissensrepräsentation, Universität Leipzig. Leipzig, Germany.

Søgaard, Anders. 2006. Logic, languages and linguistics. Seminar for students of Cognitive Artificial Intelligence at Utrecht University, the Netherlands. Copenhagen, Denmark.

Søgaard, Anders. 2006. Hybrid logic and natural language recognition. One-day seminar on hybrid logic (HyLoMOL). Roskilde, Denmark.

Søgaard, Anders; Haugereid, Petter. 2005. Specification in the Scandinavian Grammar Matrix. Linguistiches Graduiertenkolloquium 2005, Universität Bremen. Bremen, Germany.

Søgaard, Anders; Haugereid, Petter. 2004. The Scandinavian NP: a multilingual perspective. The 3nd Meeting of the Scandinavian Network of Grammar Engineering and Machine Translation. Gothenburg, Sweden.

Søgaard, Anders. 2004. Compound semantics in Danish: implementing qualia structure. The 2nd Meeting of the Scandinavian Network of Grammar Engineering and Machine Translation. Copenhagen, Denmark.

Søgaard, Anders. 2004. On formal semantics. Filosofisk Studenter-Kollokvium. Århus, Denmark.

Projects

Semantic processing across domains (2014-17). Research project supported by the Danish Research Council. Led jointly with Bolette S. Pedersen.

LOWLANDS: Parsing low-resource languages and domains (2013-17). Research project (ERC Starting Grant) supported by the European Research Council.

Experience-oriented sharing of health knowledge via information and communication technology (2010-13). Research project supported by the Danish Research Council. Led by Bente Maegaard. [Link]

Efficient syntax and semantics-based machine translation (2008-9). Research project supported by the Danish Research Council.

Ptolemaios: on grammar learning from parallel corpora. Research project supported by the German Research Council. Led by Jonas Kuhn. [Link]

The Scandinavian Grammar Matrix (2004-5). Research project supported by The Nordic Academy of Sciences. [Link]

Workshops

ROBUS-UNSUP 2012: Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP, European Association for Computational Linguistics (EACL), April 23 in Avignon, France. [Link]

Workshop on Robust Unsupervised and Semisupervised Methods for Natural Language Processing, Recent Advances in Natural Language Processing 2011 (RANLP), September 15-16 in Hissar, Bulgaria. [Link]

The 2nd International Workshop on Typed Feature Structure Grammars (TFSG), The 16th Nordic Conference of Computational Linguistics, May 24 2007 in Tartu, Estonia. [Link]

Workshop on Typed Feature Structure Grammars, The 22nd Scandinavian Conference of Linguistics, June 20 2006 in Aalborg, Denmark. [Link]

Other research activities

Member of Association of Computational Linguistics, Association of Symbolic Logic, European Association for Computer Science Logic, Gesellschaft für Semantik, Linguistic Circle of Copenhagen (Former member of the Committee), International Association for Computing and Philosophy, Northern European Association for Language Technology , Scandinavian Network of Grammar Engineering and Machine Translation, and The Danish Writers' Association (Member of the Committee of the Poetry Group) [Link].

Conference review committees: LNCS vol. 4012, LENLS 2006, TFSG-I, ACL-SRW 2007, NODALIDA 2007, LENLS 2007, ACL 2008, ACL-SRW 2008, M4M-6, NODALIDA 2009, LREC 2010, NA-CAP 2010, NODALIDA 2011, Formal Grammar 2011, DiSCO @ ACL 2011, EACL-SRW 2012, Formal Grammar 2012, LREC 2012, ESIRMT-HyTra @ EACL 2012, *SEM @ NAACL 2012, KONVENS 2013, NODALIDA 2013, Formal Grammar 2013, HyTra @ ACL 2013, ACL 2013, MT-Summit 2013, IJCNLP 2013, IWPT 2013, EMNLP 2013, EACL 2014, EACL-SRW 2014, LREC 2014, ACL 2014 (two tracks), HyTra @ EACL 2014, DeepLP4QApp @ EACL 2014, COLING 2014 (two tracks), EMNLP 2014.

Journal reviewing: Applications and Applied Mathematics, Artificial Intelligence, Computational Linguistics, IEEE Transactions on Audio, Speech and Language Processing, Journal of Logic, Language and Information, Language Resources and Evaluation Journal, Minds and Machines, Yearbook of Morphology 2011.

I have also reviewed for the Czech Science Foundation, the United States-Israel Binational Science Foundation, and the Volvo Foundation.

Scholarships and grants: Fondet til støtte af datamatisk lingvistik (2), Fullbright Research Scholarship, Hotelejer Anders Månsons og hustru Hanne Månssons legat, Konsul Axel Nielsens Mindelegat, Nordic Academy for Advanced Study (2), Microsoft Corporation, Kurt Gödel Society and Research Priority Area Body and Mind, University of Copenhagen, Elite Research Scholarship (The Danish Ministry of Science, Technology and Innovation), The Danish Research Council for the Humanities (2), Fagforbundet for Sprog og Kommunikation's Research Prize 2010, ERC Starting Grant 2012.

Supervisor for Martin Haulrich, MA thesis in Computational Linguistics (CBS): "Disambiguation by minimal model generation" (2007), Cristoph Teichmann, Magisterarbeit in Computational Linguistics (University of Potsdam): "Unification grammars and the Weir hierarchy" (2008), Christian Rishøj Jensen, MA thesis in IT & Cognition (University of Copenhagen): "Feature engineering in data-driven dependency parsing" (2009), Florian Marienfeld, Diplomarbeit in Computer Science (Technical University, Berlin): "A comparison of various metrics for word alignment quality" (2010), Dieuwke van Mulligan, MA thesis in IT & Cognition (University of Copenhagen): "Ask Eva: practical question answering" (2011), Troels Kjeldberg, MA thesis in IT & Cognition (University of Copenhagen): "Ensemble-based word sense induction" (2011), Kim Simonsen, MA thesis in IT & Cognition (University of Copenhagen): "Applying meta learning to explore the usage of semi-supervised and supervised support vector machines exemplified through text classification" (2012), Sigrid Klerke, MA thesis in IT & Cognition (University of Copenhagen): "Automatic text simplification for Danish" (2012), David Svendsen-Tune, MA thesis in IT & Cognition (University of Copenhagen): "Now you see me, now you don't: anonymization of legal documents" (2013), Franz Matties, MA thesis in IT & Cognition (University of Copenhagen): "Predicting eye movements in reading" (2013), Anders Johannsen, PhD thesis in language technology (University of Copenhagen): "The best explanation: beyond right and wrong in question answering" (2013), Rune Laugesen, MA thesis in IT & Cognition (University of Copenhagen): "Word completion for virtual keyboards using syntax and semantics" (2014).

I currently supervise PhD student Sigrid Klerke, as well as MA students Hege Fromreide, Claire Joyce, Fulvio Rizzollo and Julie Wulff.

PhD Committees: David Marecek, Charles University Prague; Oscar Täckström, Swedish Institute of Computer Science, Stockholm; Peter Exner, Lund University; Cristoph Teichmann, University of Leipzig.

Interviews (Danish)

2010. Sammenfald mellem kunst og forskning. ForskerForum 240, December 2010. [Link]

2010. DM-medlem finder Google Translates svagheder. Magisterbladet 12 2010. [Link].

2010. Matematisk litterat. Politiken July 5 2010. [Link]

2010. Dansk forskning vil fjerne Googles oversætterfejl. Politiken June 23 2010. [Link]

2008. Sudoku-elementet. Humanist 1/2008. [Link]

Other recent interviews and appearances: Radio24syv (AK 24syv) [Link], P1 (Sprogminuttet) [Link], PROSA [Link], Sydsvenskan [Link], University Post, Videnskab.dk/JyllandsPosten [Link].



Blå linie
Njalsgade 140-142, bygn. 25, DK-2300 KBH S
Tlf: +45 35329090 - Fax: +45 35329089
Valid HTML 4.01 Strict