Once upon a time
The development of Frog’s modules started in the nineties at the ILK
Research Group (Tilburg University, the Netherlands) and the CLiPS
Research Centre (University of Antwerp, Belgium). Most modules rely on
Timbl, the Tilburg memory-based learning software package
:raw-latex:`\cite{timbl}` or MBT the memory-based tagger-generator
:raw-latex:`\cite{mbt}`. These modules were integrated into an NLP
pipeline that was first named MB-TALPA and later Tadpole
:raw-latex:`\cite{Tadpole}`. Over the years, the modules were refined
and retrained on larger data sets and the latest versions of each module
are discussed in this chapter. We thank all programmers who worked on
Frog and its predecessors in chapter [ch-credit].
The CliPS Research Centre also developed an English counterpart of Frog,
a python module called MBSP (MBSP website:
http://www.clips.ua.ac.be/pages/MBSP).
Credits
If you use Frog for your own work, please cite this reference manual
Frog, A Natural Language Processing Suite for Dutch, Reference
guide, Iris Hendrickx, Antal van den Bosch, Maarten van Gompel en Ko
van der Sloot, Language and Speech Technology Technical Report
Series 16-02, Radboud University Nijmegen, Draft 0.13.1 - June 2016
The following paper describes Tadpole, the predecessor of Frog. It
contains a subset of the components described in this paper:
Van den Bosch, A., Busser, G.J., Daelemans, W., and Canisius, S.
(2007). An efficient memory-based morphosyntactic tagger and parser
for Dutch, In F. van Eynde, P. Dirix, I. Schuurman, and V.
Vandeghinste (Eds.), Selected Papers of the 17th Computational
Linguistics in the Netherlands Meeting, Leuven, Belgium, pp. 99-114
We would like to thank everybody who worked on Frog and its
predecessors. Frog, formerly known as Tadpole and before that as
MB-TALPA, was coded by Bertjan Busser, Ko van der Sloot, Maarten van
Gompel, and Peter Berck, subsuming code by Sander Canisius (constraint
satisfaction inference-based dependency parser), Antal van den Bosch
(MBMA, MBLEM, tagger-lemmatizer integration), Jakub Zavrel (MBT), and
Maarten van Gompel (Ucto). In the context of the CLARIN-NL
infrastructure project TTNWW, Frederik Vaassen (CLiPS, Antwerp) created
the base phrase chunking module, and Bart Desmet (LT3, Ghent) provided
the data for the named-entity module.
Maarten van Gompel designed the FoLiA XML output format that Frog
produces, and also wrote a Frog binding for Python , as well as a
separate Frog client in Python . Wouter van Atteveldt wrote a Frog
client in R , and Machiel Molenaar wrote a Frog client for
Go .
The development of Frog relies on earlier work and ideas from Ko van der
Sloot (lead programmer of MBT and TiMBL and the TiMBL API), Walter
Daelemans, Jakub Zavrel, Peter Berck, Gert Durieux, and Ton Weijters.
The development and improvement of Frog also relies on your bug reports,
suggestions, and comments. Use the github issue tracker at
https://github.com/LanguageMachines/frog/issues/ or mail lamasoftware
@science.ru.nl.