I am trying to create a validator that takes in words and tries to determine if the word is one of the following:

  1. It is a valid English word
  2. It is a part of an English word
  3. It is an abbreviation
  4. It is a portmanteau -- a word created by concatenating parts of valid English words

Are there Java or Python libraries/frameworks that can perform this task?

Samples of words: meds, ppg, reauthorization, appmetadata, reconsent, rawlog

I've tried Python NLTK (cursory investigation so far) and a Python library called enchant (this fails to identify many valid words/parts of words and portmanteaus).

0

There are 0 best solutions below