Load Freebase using python

174 Views Asked by At

I got freebase-vectors-skipgram1000-en.bin.gz from https://code.google.com/archive/p/word2vec/ Then rename it to freebase-vectors-skipgram1000-en.bin and used it in my code

import gensim.models.keyedvectors as word2vec
gvc='freebase-vectors-skipgram1000-en.bin'
model=word2vec.KeyedVectors.load_word2vec_format(gvc,binary=True)

It gives me this error

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

I will be grateful for any help regarding this issue.

1

There are 1 best solutions below

0
J_H On

Rather than renaming what you downloaded, use this to properly unpack it:

$ gunzip -v freebase-vectors-skipgram1000-en.bin.gz