Preparations before running the pipeline: 1. Extract resources.zip 2. Modify preprocessing.ini The most important settings to change are - $RESOURCES_FOLDER: The folder where the contents of resources.zip have been extracted to - INPUT_FOLDER: the folder where your input documents are located - OUTPUT_FOLDER: the folder where the processed documents should be saved to In order to activate/deactivate engines, set them to active=true or active=false Running the pipeline: call java -jar nlp-pipeline.jar preprocessing.ini via the command line