Rip all the text from docx files and save them out to JSON. Rips all content through whole directory tree from root. Good for creating NLP training or research data. This little file works great running in Google Collab and easily outputing your new data into a google docs folder for use.
-
Notifications
You must be signed in to change notification settings - Fork 0
RyanSchattner/docrip
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Rip all the text from docx files and save them out to JSON. Rips all content through whole directory tree from root.. Good for creating NLP training or research data.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published