I am trying to make predictions with Bert, where the prediction of subsequent words in the first input of each paragraph may be aided, so can Bert make use of previous predictions in subsequent predictions like an autoregressive language model?

0

There are 0 best solutions below