Command line interface utility to extract embedded files and URLs from PowerPoint 2007 or later slideshow files (*.pptx).
Command line utility to extract references to external linked and embedded files and URLs to websites. Works with PowerPoint 2007 or later. Requires just the JDK and PicoCLI, no other dependencies yet.
Following file types are extracted:
C:\>pptxtract TestData\powerpoint\Slideshow.pptx Slideshow.pptx;File1.txt Slideshow.pptx;image.bmp Slideshow.pptx;File3.ini Slideshow.pptx;File2.csvor:
C:\>type pptfiles.txt | pptxtract Slideshow1.pptx;File1.txt Slideshow2.pptx;image.bmp Slideshow3.pptx;File3.ini Slideshow4.pptx;File2.csv
Usage: pptxtract [-hoVx] FILE... FILE... PowerPoint FILE(S) where paths to embedded or linked documents shall be extracted. Must be of PowerPoint 2007 format (or later versions). Older *.ppt files must be converted into PowerPoint 2007 (or newer) format before use. -h, --help Show this help message and exit. -o When extracting embedded files, this option will force overwriting existing files. -V, --version Print version information and exit. -x, --extract-embeddings When set, embedded files such as *.docx, *.xlsx or other *.pptx files will be extracted.