openteamsinc
pypi/unstructured
Risk Profile
Package
Source
Risk Profile
Maturity:
Mature
Health:
Healthy
Legal:
Caution Needed
License has additional text that may require review before use
License includes a patent grant clause
Security:
Caution Needed
At least one high or critical severity vulnerability has been reported in the last 600
Package
pypi
unstructured
Version
0.22.31
Last Release Date
11 days ago
License
Apache
Dependencies
beautifulsoup4
<5.0.0, >=4.14.3
charset-normalizer
<4.0.0, >=3.4.4
emoji
<3.0.0, >=2.15.0
filelock
<4.0.0, >=3.12.0
filetype
<2.0.0, >=1.2.0
html5lib
<2.0.0, >=1.1
installer
<1.0.0, >=0.7.0
langdetect
<2.0.0, >=1.0.9
lxml
<7.0.0, >=5.0.0
numba
<1.0.0, >=0.60.0
numpy
<3.0.0, >=1.26.0
psutil
<8.0.0, >=7.2.2
python-iso639
<2027.0.0, >=2026.1.31
python-magic
<1.0.0, >=0.4.27
python-oxmsg
<1.0.0, >=0.0.2
rapidfuzz
<4.0.0, >=3.14.3
regex
<2027.0.0, >=2024.0.0
requests
<3.0.0, >=2.32.5
spacy
<4.0.0, >=3.7.0
tqdm
<5.0.0, >=4.67.3
typing-extensions
<5.0.0, >=4.15.0
unstructured-client
<1.0.0, >=0.25.9
wrapt
<3.0.0, >=2.1.1
all-docs:
google-cloud-vision
<4.0.0, >=3.12.1
markdown
<4.0.0, >=3.10.1
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openai-whisper
<20270000, >=20231117
openpyxl
<4.0.0, >=3.1.5
pandas
<3.0.0, >=2.0.0
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
pypdf
<7.0.0, >=6.6.2
python-docx
<2.0.0, >=1.2.0
python-pptx
<2.0.0, >=1.0.2
unstructured-inference
<2.0.0, >=1.6.10
unstructured-inference
<2.0.0, >=1.6.10
unstructured-pytesseract
<1.0.0, >=0.3.15
xlrd
<3.0.0, >=2.0.1
audio:
openai-whisper
<20270000, >=20231117
chunking-tokens:
tiktoken
<1.0.0, >=0.12.0
csv:
pandas
<3.0.0, >=2.0.0
doc:
python-docx
<2.0.0, >=1.2.0
docx:
python-docx
<2.0.0, >=1.2.0
epub:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
huggingface:
sentencepiece
<1.0.0, >=0.2.0
torch
<3.0.0, >=2.10.0
torch
<3.0.0, >=2.10.0
transformers
<6.0.0, >=5.2.0
image:
google-cloud-vision
<4.0.0, >=3.12.1
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypdf
<7.0.0, >=6.6.2
unstructured-inference
<2.0.0, >=1.6.10
unstructured-inference
<2.0.0, >=1.6.10
unstructured-pytesseract
<1.0.0, >=0.3.15
ingest:
unstructured-ingest
[airtable,astradb,azure,azure-ai-search,bedrock,biomed,box,chroma,confluence,couchbase,databricks-volumes,delta-table,discord,dropbox,elasticsearch,gcs,github,gitlab,google-drive,hubspot,huggingface,jira,kafka,kdbai,milvus,mongodb,notion,octoai,onedrive,openai,opensearch,outlook,pinecone,postgres,qdrant,reddit,remote,s3,salesforce,sftp,sharepoint,singlestore,slack,vectara,vertexai,voyageai,weaviate,wikipedia]
<2.0.0, >=1.4.0
unstructured-ingest
[airtable,astradb,azure,azure-ai-search,bedrock,biomed,box,chroma,confluence,couchbase,databricks-volumes,delta-table,discord,dropbox,elasticsearch,gcs,github,gitlab,google-drive,hubspot,huggingface,jira,kafka,kdbai,milvus,mongodb,notion,octoai,onedrive,openai,opensearch,outlook,pinecone,postgres,qdrant,reddit,remote,s3,salesforce,sftp,sharepoint,singlestore,slack,vectara,vertexai,voyageai,weaviate,wikipedia]
<2.0.0, >=1.4.0
local-inference:
google-cloud-vision
<4.0.0, >=3.12.1
markdown
<4.0.0, >=3.10.1
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openai-whisper
<20270000, >=20231117
openpyxl
<4.0.0, >=3.1.5
pandas
<3.0.0, >=2.0.0
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
pypdf
<7.0.0, >=6.6.2
python-docx
<2.0.0, >=1.2.0
python-pptx
<2.0.0, >=1.0.2
unstructured-inference
<2.0.0, >=1.6.10
unstructured-inference
<2.0.0, >=1.6.10
unstructured-pytesseract
<1.0.0, >=0.3.15
xlrd
<3.0.0, >=2.0.1
md:
markdown
<4.0.0, >=3.10.1
odt:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
python-docx
<2.0.0, >=1.2.0
org:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
paddleocr:
paddlepaddle
<4.0.0, >=3.3.0
paddlepaddle
<4.0.0, >=3.3.0
unstructured-paddleocr
==2.10.0
pdf:
google-cloud-vision
<4.0.0, >=3.12.1
pdf2image
<2.0.0, >=1.17.0
pdfminer-six
<20270000, >=20251230
pi-heif
<2.0.0, >=1.2.0
pikepdf
<11.0.0, >=10.3.0
pypdf
<7.0.0, >=6.6.2
unstructured-inference
<2.0.0, >=1.6.10
unstructured-inference
<2.0.0, >=1.6.10
unstructured-pytesseract
<1.0.0, >=0.3.15
ppt:
python-pptx
<2.0.0, >=1.0.2
pptx:
python-pptx
<2.0.0, >=1.0.2
rst:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
rtf:
pypandoc-binary
<2.0.0, >=1.16.2
pypandoc-binary
<2.0.0, >=1.16.2
tsv:
pandas
<3.0.0, >=2.0.0
xlsx:
msoffcrypto-tool
<7.0.0, >=6.0.0
networkx
<4.0.0, >=3.2.0
openpyxl
<4.0.0, >=3.1.5
pandas
<3.0.0, >=2.0.0
xlrd
<3.0.0, >=2.0.1
Source
Location
Unstructured-IO/unstructured
Last Source Update
11 days ago
Licenses
Apache License 2.0
MIT License
(text added)
MIT License
(text added)
Distribution Destinations
pypi/unstructured