Struggling with import win32com.client and reading pptx

Ahmed1 · February 26, 2023, 8:22pm

Hi. I am trying to write a code to dispatch PowerPoint using win32com.client. It runs fine when I run my stramlit app locally. But when I deployed the application it failed.
Here is an example:

import win32com.client
powerpoint = win32com.client.Dispatch(“Powerpoint.Application”)
deck = powerpoint.Presentations.Open(‘file.pptx’, WithWindow=False)

Franky1 · February 26, 2023, 8:27pm

You cannot use this package on streamlit cloud, since this is a windows-only package.
You have to go for a different approach.

Ahmed1 · February 26, 2023, 8:29pm

Thank you Franky for your fast response. Is there any approach that you would recommend? thanks again

Franky1 · February 26, 2023, 8:39pm

I can’t say with so little information.
It depends mainly on what you want to achieve.
But Microsoft Office on Headless Linux Server is quite a hairy thing.

Ahmed1 · February 26, 2023, 8:53pm

All that I need from this, is to take images from a PowerPoint file and save them in a dictionary for later use. If you come across any different approach, please dont hesitate to share. I have been struggling with this for some time now

Franky1 · February 26, 2023, 10:31pm

Yes there is indeed a real good trick for your problem
And you don’t need any external libraries.
But it only works for pptx files, not for older ppt files!
Because pptx files are technically zip files under the hood, so you can just unzip them and pick out the image files. Here are some code snippets to give you an idea how that can be accomplished:

import zipfile
from pathlib import Path

List all image files in pptx file

# load pptx file
with zipfile.ZipFile('test.pptx') as ziparchive:
    # filter all image files (png, jpg, gif...) from zip archive in pptx
    images = [f for f in ziparchive.namelist() if f.endswith(('.png', '.jpg', '.gif', '.jpeg', '.bmp', '.tiff', '.tif', '.svg'))]
# show only list of image files:
print(images)

Extract all image files to subdir and keeping file structure

# load pptx file
with zipfile.ZipFile('test.pptx') as ziparchive:
    # filter all image files (png, jpg, gif...) from zip archive in pptx
    images = [f for f in ziparchive.namelist() if f.endswith(('.png', '.jpg', '.gif', '.jpeg', '.bmp', '.tiff', '.tif', '.svg'))]
    for img in images:
        print(img)
        # extract image files from zip archive, it keeps the file structure
        ziparchive.extract(member=img, path='test')

Extract all image files to subdir and flatten the file structure

# load pptx file
with zipfile.ZipFile('test.pptx') as ziparchive:
    # filter all image files (png, jpg, gif...) from zip archive in pptx
    images = [f for f in ziparchive.namelist() if f.endswith(('.png', '.jpg', '.gif', '.jpeg', '.bmp', '.tiff', '.tif', '.svg'))]
    subdir = Path('test')
    subdir.mkdir(exist_ok=True)  # create directory if not exists
    for img in images:
        # extract image files from zip archive, it flattens out the file structure:
        img_file_export_path = subdir.joinpath(Path(img).name)
        print(img_file_export_path)
        with open(img_file_export_path, 'wb') as f:
            f.write(ziparchive.read(img))

You have to adjust the examples of course to your needs…

system · February 26, 2024, 10:32pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Upload a powerpoint for langchain Using Streamlit file-upload	4	3402	January 23, 2024
Extracting images from a docx Using Streamlit	1	490	October 25, 2022
Extract zip file in deployed app Community Cloud	2	1412	January 12, 2022
Export a PowerPoint Report Show the Community!	3	2748	January 11, 2024
Download docx file in streamlit Community Cloud	0	824	September 10, 2021

Struggling with import win32com.client and reading pptx

List all image files in pptx file

Extract all image files to subdir and keeping file structure

Extract all image files to subdir and flatten the file structure

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies