PyPDF2
python-docx
docx2txt
Pillow
pytesseract
langchain
unstructured
