Latent Semantic Analysis and Stemming

314 Views Asked by L D At 14 March 2017 at 23:30

Assume a very large corpus of any inflective language. Does the following make sense? By applying LSA on such corpus, words with similar concepts converge together in vector space, thus inflected word forms reffering to the same concept should ideally be identical with their lemma in the space. With such assumption, any lemmatization or stemming of queries or corpus is not necessary. Or am i totally wrong?

Original Q&A

There are 1 best solutions below

Ryan Boch On 22 May 2019 at 15:17 BEST ANSWER

According to the founders of LSA, stemming is not necessary. Though, I think there is general disagreement in the literature about this. I have read a few papers where stemming was found to improve results for a given information retrieval task.

Generally, there is recent research that shows stemming does not help in topic modeling and may even hurt topic coherence.

Latent Semantic Analysis and Stemming

There are 1 best solutions below

Related Questions in NLP

Related Questions in SVD

Related Questions in LEMMATIZATION

Related Questions in LSA

Related Questions in LATENT-SEMANTIC-ANALYSIS

Trending Questions

Popular # Hahtags

Popular Questions