
I'm trying to get the text out of a Word document and I can't do it, at some point I looked at the document and saw that I had the text in some strange "blocks" and there was no information on how to work with it, help
from docx import Document
doc = Document('XXXXXXXXXXXXXXXX.docx')
for para in doc.paragraphs:
print(para.text)
Try using other libraries such as "python-docx2txt" or "textract". Because docx might not support all the formatting in the Word document.