Biotex

Biotex is a Automated Term Extractor (ATE) for Bio-medical terms (see here for more details).

This repository contains two ways of executing Biotex (Python and Java). A list of improvements made:

  • Easy execution
  • Python wrapper
  • Parameters configuration without changing the source code
  • ...

Installation

Requirements

  • Python 3
  • Java 7-8

Install

To make it works, clone this repository using :

git clone <repoAdress>

Then install using the following command :

(sudo) pip3 install .

Example

from biotex import BiotexWrapper
wrapper = BiotexWrapper(lang="fr")
corpus= [Load your corpus here]
wrapper.create_corpus_from_txt(corpus)
terminology = wrap.extract_terminology("output.txt")