openteamsinc
pypi/unstructured
Risk Profile
Package
Source
Risk Profile
Maturity:
Mature
Health:
Healthy
Legal:
Caution Needed
License has additional text that may require review before use
License includes a patent grant clause
Security:
Caution Needed
At least one high or critical severity vulnerability has been reported in the last 600
Package
pypi
unstructured
Version
0.21.5
Last Release Date
9 days ago
License
Apache
Dependencies
beautifulsoup4
<5.0.0, >=4.14.3
charset-normalizer
<4.0.0, >=3.4.4
emoji
<3.0.0, >=2.15.0
filelock
<4.0.0, >=3.12.0
filetype
<2.0.0, >=1.2.0
html5lib
<2.0.0, >=1.1
installer
<1.0.0, >=0.7.0
langdetect
<2.0.0, >=1.0.9
lxml
<7.0.0, >=5.0.0
numba
<1.0.0, >=0.60.0
numpy
<3.0.0, >=1.26.0
psutil
<8.0.0, >=7.2.2
python-iso639
<2027.0.0, >=2026.1.31
python-magic
<1.0.0, >=0.4.27
python-oxmsg
<1.0.0, >=0.0.2
rapidfuzz
<4.0.0, >=3.14.3
regex
<2027.0.0, >=2024.0.0
requests
<3.0.0, >=2.32.5
spacy
<4.0.0, >=3.7.0
tqdm
<5.0.0, >=4.67.3
typing-extensions
<5.0.0, >=4.15.0
unstructured-client
<1.0.0, >=0.25.9
wrapt
<2.0.0, >=1.0.0
all-docs:
google-cloud-vision
<4.0.0, >=3.12.1
markdown
<4.0.0, >=3.10.1
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openpyxl
<4.0.0, >=3.1.5
pandas
<4.0.0, >=2.0.0
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
pypdf
<7.0.0, >=6.6.2
python-docx
<2.0.0, >=1.2.0
python-pptx
<2.0.0, >=1.0.2
unstructured-inference
<2.0.0, >=1.2.0
unstructured-inference
<2.0.0, >=1.2.0
unstructured-pytesseract
<1.0.0, >=0.3.15
xlrd
<3.0.0, >=2.0.1
chunking-tokens:
tiktoken
<1.0.0, >=0.12.0
csv:
pandas
<4.0.0, >=2.0.0
doc:
python-docx
<2.0.0, >=1.2.0
docx:
python-docx
<2.0.0, >=1.2.0
epub:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
huggingface:
sentencepiece
<1.0.0, >=0.2.0
torch
<3.0.0, >=2.10.0
torch
<3.0.0, >=2.10.0
transformers
<5.0.0, >=4.55.4
image:
google-cloud-vision
<4.0.0, >=3.12.1
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypdf
<7.0.0, >=6.6.2
unstructured-inference
<2.0.0, >=1.2.0
unstructured-inference
<2.0.0, >=1.2.0
unstructured-pytesseract
<1.0.0, >=0.3.15
ingest:
unstructured-ingest
[airtable,astradb,azure,azure-ai-search,bedrock,biomed,box,chroma,confluence,couchbase,databricks-volumes,delta-table,discord,dropbox,elasticsearch,gcs,github,gitlab,google-drive,hubspot,huggingface,jira,kafka,kdbai,milvus,mongodb,notion,octoai,onedrive,openai,opensearch,outlook,pinecone,postgres,qdrant,reddit,remote,s3,salesforce,sftp,sharepoint,singlestore,slack,vectara,vertexai,voyageai,weaviate,wikipedia]
<2.0.0, >=1.4.0
unstructured-ingest
[airtable,astradb,azure,azure-ai-search,bedrock,biomed,box,chroma,confluence,couchbase,databricks-volumes,delta-table,discord,dropbox,elasticsearch,gcs,github,gitlab,google-drive,hubspot,huggingface,jira,kafka,kdbai,milvus,mongodb,notion,octoai,onedrive,openai,opensearch,outlook,pinecone,postgres,qdrant,reddit,remote,s3,salesforce,sftp,sharepoint,singlestore,slack,vectara,vertexai,voyageai,weaviate,wikipedia]
<2.0.0, >=1.4.0
local-inference:
google-cloud-vision
<4.0.0, >=3.12.1
markdown
<4.0.0, >=3.10.1
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openpyxl
<4.0.0, >=3.1.5
pandas
<4.0.0, >=2.0.0
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
pypdf
<7.0.0, >=6.6.2
python-docx
<2.0.0, >=1.2.0
python-pptx
<2.0.0, >=1.0.2
unstructured-inference
<2.0.0, >=1.2.0
unstructured-inference
<2.0.0, >=1.2.0
unstructured-pytesseract
<1.0.0, >=0.3.15
xlrd
<3.0.0, >=2.0.1
md:
markdown
<4.0.0, >=3.10.1
odt:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
python-docx
<2.0.0, >=1.2.0
org:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
paddleocr:
paddlepaddle
<4.0.0, >=3.3.0
paddlepaddle
<4.0.0, >=3.3.0
unstructured-paddleocr
==2.10.0
pdf:
google-cloud-vision
<4.0.0, >=3.12.1
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypdf
<7.0.0, >=6.6.2
unstructured-inference
<2.0.0, >=1.2.0
unstructured-inference
<2.0.0, >=1.2.0
unstructured-pytesseract
<1.0.0, >=0.3.15
ppt:
python-pptx
<2.0.0, >=1.0.2
pptx:
python-pptx
<2.0.0, >=1.0.2
rst:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
rtf:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
tsv:
pandas
<4.0.0, >=2.0.0
xlsx:
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openpyxl
<4.0.0, >=3.1.5
pandas
<4.0.0, >=2.0.0
xlrd
<3.0.0, >=2.0.1
Source
Location
Unstructured-IO/unstructured
Last Source Update
7 days ago
Licenses
Apache License 2.0
MIT License
(text added)
MIT License
(text added)
Distribution Destinations
pypi/unstructured