textract provides a single interface for extracting content embedded from Word documents, PowerPoint presentations, PDFs and much more, which can be used for further textual analysis and visualization. WWW: https://github.com/deanmalmgren/textract PR: 265768
4 lines
202 B
Plaintext
4 lines
202 B
Plaintext
textract provides a single interface for extracting content embedded
|
|
from Word documents, PowerPoint presentations, PDFs and much more,
|
|
which can be used for further textual analysis and visualization.
|