Skip to content

delph-in/pet

Repository files navigation

This directory contains the open source PET system. For license details 
please refer to the LICENSE file.

Overview
========

PET consists of two major parts:

1) The `flop' preprocessor. It compiles a given grammar in TDL
   format into a binary form usable by the runtime system.

2) The `cheap' parser. It parses input with respect to a grammar in the
   binary form produced by `flop'.

There are no binaries provided due to the vast amount of different 
distributions around. You have to first compile the package to be able to
run it (see below).

Grammars
========

Two freely available open source grammars that are compatible with PET
are the ERG (English, available from http://lingo.stanford.edu/ftp/erg.tgz,
or http://lingo.stanford.edu) and JACY (Japanese,
http://www.dfki.de/~siegel/grammar-download/JACY-grammar.html).

If you want to contribute a grammar, or if you are looking for more
grammars please have a look at www.delph-in.net

Compiling PET
=============

In order to compile the source code, you first have to make sure that all
required and optional libraries as well as the corresponding header files
are installed on your system (see below under section `Dependencies').

PET uses the GNU Build System (aka GNU Autotools). GCC/g++ version > 3.1.2
is known to work fine.

If you have extracted a tarfile version, change into the directory
containing the `configure' script and type:

  ./configure --help

Have a look at the available options, and configure your system as needed.
If configure has been executed successfully, run `make' to build the
software, and finally `make install' to install it (depending on the
install paths, which can be influenced with `configure', this might
require administrator privileges).

Advanced issues:

* Non-standard preprocessor, compiler or linker options can be passed to
  configure as command line arguments (cf. ./configure --help).
  For example, the following line configures PET for best performance:
  
    ./configure --disable-assert "CXXFLAGS=-g -O3"

  The following line configures PET for the use with a debugger:
  
    ./configure "CXXFLAGS=-g -O0"
  
  The following line configures PET for profiling with gprof and gcov
  (cf. gcc man page):
  
    ./configure "CXXFLAGS=-pg -fprofile-arcs -ftest-coverage"

* The root build directory is the directory where `configure' is called.
  That is, PET can be built in any directory by changing into that
  directory and calling `configure' from there. For example:
  
    mkdir -p build/debug
    cd build/debug
    ../../configure "CXXFLAGS=-g -O0"
  
  All Makefiles will be created in the directory where `configure' is called.
  The advantage of having different build directories is that many
  configurations can be tested and maintained at the same time, and that the
  binary files are kept separate from the source files.

* A tarball distribution can be built and checked with `make distcheck'.
  Use the DISTCHECK_CONFIGURE_FLAGS environment variable to pass any
  parameters to configure. Example:
  
    export DISTCHECK_CONFIGURE_FLAGS="--with-xml"
    make distcheck

Bootstrapping PET from SVN
==========================

If you checked out your first SVN snapshot, you first have to run a current
autoconf and autoheader (both >=2.59), aclocal and automake (both >= 1.7) in
order to create the build system. Change into the directory containing
`configure.ac' and type:

  autoheader && aclocal -I m4 && automake -a && autoconf

or simply

  autoreconf -i

Having run these commands, proceed with configuration as described above.

Dependencies
============

Most current GNU/Linux distributions provide packages for the required and
optional libraries. Installing such packages usually requires administrator
privileges. Note that the header files are often distributed in separate
packages, usually ending in `-dev' or `-devel' (e.g. `libboost-dev'). Make
sure that these packages are installed, too.

If you install your own libraries in non-standard directories, make sure
that all these shared libraries and their dependencies can be found when
PET's configure is run. There are several ways to achieve this (e.g., using
ldconfig or the LD_LIBRARY_PATH environment variable, cf. the Program Library
HOWTO). The dependencies of a program or library can be inspected with ldd.
http://tldp.org/HOWTO/Program-Library-HOWTO/.

Required:

Boost provides free portable C++ libraries for various applications.
flop and cheap both use the Boost Graph library.
Current GNU/Linux distributions usually provide this library as a pre-built
package. Make sure that the header files are installed, too (see above).
http://www.boost.org/

Optional:

If you want UniCode support in PET, you need the ICU package freely
available from IBM.
Current GNU/Linux distributions usually provide this library as a pre-built
package. Make sure that the header files are installed, too (see above).
http://icu.sourceforge.net/download/

Install [incr tsdb()] for creating performance profiles of the grammar and
the system, and for parsing corpora.
http://www.delph-in.net/itsdb/

For the MRS output and FSPP preprocessor modules, you need a version of
Embeddable Common Lisp (ECL for short).
ATTENTION: The version that is known to work at the moment is 0.9h, while
0.9i seems to have problems. While some GNU/Linux distributions provide the
library as a pre-built package, this might be a newer version or it might
be compiled with the wrong options. In this case, you need to install it
yourself, which might also be problematic (e.g. on Ubuntu Linux 7.04,
ecl 0.9h does not compile without some modifications).
http://ecls.sourceforge.net/

For the XML input modes you need the Apache Xerces C++ library.
Current GNU/Linux distributions usually provide this library as a pre-built
package. Make sure that the header files are installed, too (see above).
http://xml.apache.org/xerces-c/

For unit tests you can optionally use cppunit. Note that so far only small
parts of the system take advantage of this.
Current GNU/Linux distributions usually provide this library as a pre-built
package. Make sure that the header files are installed, too (see above).
http://cppunit.sourceforge.net/

Doxygen compatible documentation is included in most header files and some
of the source files. Call 'make doc' in the root build directory to build
the API documentation.
Current GNU/Linux distributions usually provide this program as a pre-built
package.
http://www.doxygen.org/

Layout of the sources
=====================

common/ contains the sources shared between `flop' and `cheap'.

flop/ contains the sources for the preprocessor.

cheap/ contains the sources for the parser.

fspp/  lisp-based input preprocessor module for cheap. Very slow.

goofy/ contains the sources for an outdated GUI prototype. This hasn't been
compiled for two years and probably won't work.

Contact
=======

For bug reports or feature requests, check the ticket system at
 
  http://pet.opendfki.de

For questions and comments, please have a look at the mailing list
  
  [email protected]

or contact me directly at

  [email protected]


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published