beautifulsoup4
pdfminer-six
scikit-learn
PyPDF2
