-
Fize Jacques authored3ecbe3e1
Forked from
Fize Jacques / biotex_python
7 commits behind the upstream repository.
README.md 1.18 KiB
Biotex
Biotex is a Automated Term Extractor (ATE) for Bio-medical terms (see here for more details).
This repository contains two ways of executing Biotex (Python and Java). A list of improvements made:
- Easy execution
- Python wrapper
- Parameters configuration without changing the source code
- ...
Installation
Requirements
- Python 3
- Java 7-8
Install
To make it works, clone this repository using :
git clone <repoAdress>
Then compile the project by using this command :
javac -cp .:JarBioTexExterne.jar:commons-io-2.5.jar Principal.java
Then create the necessary jar with this command :
jar cvfm Biotex.jar Manifest.txt Principal.class JarBioTexExterne.jar commons-io-2.5.jar
Running
Python
In Python, either you import the library and import the ’BiotexWrapper’ class or using this command :
python3 BiotexWrapper.py <input> [-d:debug] [-o:specify output filename] [-s:gramSize selected (ALL to get all gramSize)]
Java
Modify the configuration file 'configuration.txt' and change the value if needed (especially treetagger src). Then use :
java -jar Biotex.jar <input> <outputDir>