I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python, so as to use NLTK Wordnet functions with our wordnet. Is there any way to add our own wordnet into NLTK? Or Are there any tools for Word Sense Disambiguation in Hindi (that can work with any Language Wordnet with some modifications) (which gives most suitable sense from wordnet)?
I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python. Is there any way to add our own wordnet into NLTK?
1.2k Views Asked by Aniruddha Tammewar At
1
There are 1 best solutions below
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in NLTK
- Issue in loading model in recommender system using streamlit
- The chatbot code works well on the console but not when deployed on the website
- Comparison between stemmiation and lemmatization
- How can i get the first content of a python synsets list?
- NameError: name 'sense2vec_instance' is not defined
- Problems with training a model with pytorch
- How I get precision, recall, and f1-score from nltk.naivebayesclassifier?
- removing paywall language from piece of text (pandas)
- How do I randomize responses?
- Why is my NLTK bot not working correctly?
- Inserting XML tags at specific part of file without disrupting format
- Why does KMeansClusterer from NLTK take a long time to execute with my user-item rating matrix?
- Shorten product title to a specific length using python nlp libraries
- NLTK, SSL Certificate Error, No module named pip
- how to include NLTK wordnet in a PYPI package
Related Questions in WORDNET
- NLTK, SSL Certificate Error, No module named pip
- Are nltk.corpus.wordnet and nltk.corpus.reader.wordnet different?
- How to know the semantic similarity of words in a text using word2vec or WordNet in R?
- Language confusion using Open Multilingual Wordnet with NLTK
- Lemmas are 'Form' objects
- Error while using "en-dictionary" and "en-wordnet" library
- How to use SemEval or SemCor dataset for word sense disabiguation model?
- Using WordNet and the program NLTK on Python, how can I check how many lemmas each language has in WordNet?
- Python NLTK Wordnet Issue
- The meaning of freq(w) in JCN similarity
- NLTK lemmatizer changing "less" to "le". Text doesn't make sense anymore
- NLTK Add Text to Dataset
- How to select a random word from WordNet Library C#
- What is the correct way of extracting hypernyms from words with wordnet library?
- NLTK wordnet not found
Related Questions in HINDI
- microsoft from 2.0 controler textbox to retrieve the hindi content in vba
- Issue generating Marathi Language PDF in Flutter
- Remove everything from text except Hindi and English letters and numbers and punctuation
- i am creating a textarea,which allows user to input in hindi language. But when i am entering in textarea it is not showing text in hindi language
- Reading Non-english text using pypdf2
- Incorrect Hindi text rendered with DITA OT 3.7.1 and FOP 2.6
- Regex expression to validate ONLY hindi (devnagri) letters in PYTHON
- Jspdf unable to show specific combination of hindi letters specifically the letter क्षा
- Converting a Hindi PDF to Editable Text in Docx with Proper Text Encoding Detection and Conversion
- How can I draw devanagari characters on a screen?
- Hindi accent("मात्रा") issue in word cloud using Devnagri font(gargi.ttf)
- Hindi text is not visible completely in Inno Setup WelcomeLabel2
- Ruby regex is modifying the user input when gsub using regex in regional languages?
- I want to convert 'Name' data of customers in English Text, present/written with me in Hindi Language text
- Hindi words length
Related Questions in WSD
- How to get a list of devices enumerated in Explorer network using C++
- Is there any way to extract the sense_id or sense_key of a particular word from semcor-tagged sentences?
- "An error occurred when verifying security for the message." WSDL Zeep
- Using WSDCreateHttpAddress
- Why CXF won't annotate bare parameter with @XmlJavaAdapter?
- How to extract feature in Hindi Word Sense Disambiguation task
- Filter Babelnet synsets by category
- WS Discovery vs. SSDP - Pros and Cons
- entity recognition and disambiguation api
- Soap Unable to connect from one server yet the same file connects from another
- Word sense disambiguation using WEKA
- how can I get lesk similarity score for all the senses of given words using ws4j library?
- I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python. Is there any way to add our own wordnet into NLTK?
- What other inputs are there to Word Sense Disambiguation task?
- calculate Similarity of two adverbs or two adjectives
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
If you look in your nltk_data folder, you'll see that wordnet like every other NLTK corpus is just a bunch of plain-text files. So, there must be a way to format your Hindi wordnet the same way as the NLTK one to use the functions. Here is the excerpt from the nltk.corpus.reader.wordnet object where these files are being read:
I suppose you don't really need to generate all these files but more importantly have to use the "index.sense" file for Word Sense Disambiguation. This is not generated by NLTK but have to be pre-processed before that or must be coming with your Hindi wordnet in the following format - http://wordnet.princeton.edu/wordnet/man/senseidx.5WN.html.
After you've done all steps I would just go to ../nltk/corpus/reader/wordnet.py and either create a copy of it where you can change the root and filenames and maybe some other dependencies but still use the functionality OR change what you need within existing classes (not recommended).
P.S. A little of googling gave me the link to http://www.cs.utexas.edu/~rashish/cs365ppt.pdf, which references a bunch of other sources on the subject.