I am trying to create a validator that takes in words and tries to determine if the word is one of the following:
- It is a valid English word
- It is a part of an English word
- It is an abbreviation
- It is a portmanteau -- a word created by concatenating parts of valid English words
Are there Java or Python libraries/frameworks that can perform this task?
Samples of words: meds, ppg, reauthorization, appmetadata, reconsent, rawlog
I've tried Python NLTK (cursory investigation so far) and a Python library called enchant (this fails to identify many valid words/parts of words and portmanteaus).