Products
Code
- 
        AMR-to-English generatorConverts Abstract Meaning Representations (AMR) into English sentences. Built by Nima Pourdamghani.
- 
        ASTRAPOPTool for authorship style transfer. Built by Shuai Liu.
- 
        BolinasHyperedge replacement transducer package for graphs, built by Jacob Andreas, Daniel Bauer, David Chiang, Karl Moritz Hermann, Bevan Jones, and Kevin Knight.
- 
        CarmelFinite-state transducer package for strings, built by Jonathan Graehl. Latest version on Github.
- 
        English-to-AMR parserConverts English sentences into Abstract Meaning Representations (AMRs). Built by Michael Pust, Ulf Hermjakob, Kevin Knigh, Daniel Marcu, and Jonathan May (Download size = 719Mb).
- 
        EUREKACPU-based neural LSTM sequence-to-sequence modeling toolkit, built by Ashish Vaswani.
- 
        MonogizaExtracts a word-for-word translation table from non-parallel corpora. Built by Qing Dou.
- 
        MTDataA tool capable of retrieving thousands of parallel datasets for machine translation research. Built by Thamme Gowda.
- 
        NLCodec and NLDbA scalable tool for mapping words, characters, BPE subwords into integer sequences, and a storage layer for efficiently storing and retrieving large scale datasets. Built by Thamme Gowda.
- 
        NPLMNeural probabilistic language model toolkit, built by Ashish Vaswani, with contributions from David Chiang and Victoria Fossum.
- 
        Reader Translator Generator (RTG)A feature rich neural machine translation toolkit based on PyTorch, with focus on reproducible experiments. Buily by Thamme Gowda.
- 
        ReWrite DecoderGreedy Decoder for IBM SMT Models. Built by Daniel Marcu and Ulrich Germann.
- 
        SPADESentence-level Discourse Parser. Built by Radu Soricut.
- 
        TiburonFinite-state transducer package for trees, built by Jonathan May.
- 
        uromanConverts texts in any script to Latin alphabet. Click here to install. Built by Ulf Hermjakob.
- 
        utokenUniversal tokenizer, i.e. word segmenter for a wide variety of scripts and languages. Built by Ulf Hermjakob.
- 
        Zoph_RNNGPU-based neural LSTM sequence-to-sequence modeling toolkit, built by Barret Zoph.
Demos
- 
        BotEvalWeb interface to evaluate chat bots, with optional mturk integration. Built by Hyundong (Justin) Cho.
- 
        Many-English NMTA multilingual NMT model that can translate from 500 source languages to English. Built by Thamme Gowda.
- 
        Poetry generatorCreates a poem on any topic. Built by Marjan Ghazvininejad, Xing Shi, Yejin Choi, and Kevin Knight.
- 
        Poetry password demo and assignerShows poems create from randomly-generated 60-bit passwords. Built by Marjan Ghazvininejad.
- 
        Portmanteau generatorCreates a new word (neologism) from two existing words. Built by Aliya Deri.
- 
        SmatchEvaluates output of semantic parsing. Built by Shu Cai.
- 
        Spolin BotChat with our improvisation bot!
APIs
- 
        HowToSpeakAllows users to speak a language they don't understand, by phonetic rendering. Built by Xing Shi.
Tools
- 
        AMR EditorAllows human annotators to type in the meanings of English sentences, using the Abstract Meaning Representation framework. Built by Ulf Hermjakob. AMR Editor Overview video.
- 
        RST Annotation ToolEnables annotators to build Rhetorical Structure Representations for texts. Built by Benjamin Liberman.
- 
        Shannon GameCollects character-level text predictions from people, in order to estimate the entropy of translation. Built by Marjan Ghazvininejad.
Data
- 
        AMR parsingThis 2016 SemEval challenge asks participants to write software to convert English into Abstract Meaning Representations. Run by Jonathan May.
- 
        Bilingual compression challengeIf we exploit the high redundancy of human translated texts, what is the best compression rate we can achieve for bilingual texts? Run by Barret Zoph, Kevin Knight, and Marjan Ghazvininejad.
- 
        NewsEditsA news article revision dataset and a novel document-level reasoning challenge. Built by Alexander Spangher.