How do I convert the output data from the decoder LSTM back to words?

301 Views Asked by Haztec At 09 November 2020 at 22:09

I have an encoder decoder network mimicking the one produced in this tutorial: https://towardsdatascience.com/how-to-implement-seq2seq-lstm-model-in-keras-shortcutnlp-6f355f3e5639

However the output of the decoder LSTM will be numbers between 0 and 1. However the words were tokenized in this tutorial to be integers. How do I convert this output between 0 and 1 back to words using this tokenizing?

The other option could be to use the one hot encoding tokenization but surely you'd still have to round the output to turn the floating outputs to integers?

Original Q&A

There are 2 best solutions below

Andrey On 10 November 2020 at 11:12

Use tf.math.argmax() to find index of the best word. And then extract the word from your vocabulary

Pedro Marques On 10 November 2020 at 21:08

The tutorial you quote uses a vocabulary of 15000 tokens. It outputs a sequence of 10 tokens per sample such that each of the 10 words is a vector of 15000 floats. You can think of these as a probability such that ideally the sum of these 15000 floats adds up to one and the highest score tells you which token was chosen. So on input each token gets converted into a 1 hot encoded vector of size 15000 and on output you have a softmax node that outputs a score per token where the highest score give you the selected token.

How do I convert the output data from the decoder LSTM back to words?

There are 2 best solutions below

Related Questions in PYTHON

Related Questions in PYTHON-3.X

Related Questions in TENSORFLOW

Related Questions in KERAS

Related Questions in SEQUENCE-TO-SEQUENCE

Trending Questions

Popular # Hahtags

Popular Questions