Feb-01-2025, 10:12 AM
After exporting from PDF to Word (via), the file contains many unnecessary artifacts in the text:
- meaningless parts of images
- empty tables of 1 cell
The following solution based on Python and Google Colab needs to be implemented:
1. I add a Word file using the “upload” button.
2. Display thumbnails of all images and tables in the interface (in 1 copy) with a check mark next to it.
3. I uncheck unnecessary images and tables.
4. I confirm.
5. Images and tables that I unchecked are deleted from the Word file.
6. Auto-download the Word file to your PC.
Script for removing images and tables from Word
https://colab.research.google.com/drive/...oKxazRKrbj
Sample file
https://disk.yandex.ru/i/VQkZzn7LQflE1Q
The table is displayed in the interface.
It does not display images and figures from Word. Please help me find the error. What should I fix?
- meaningless parts of images
- empty tables of 1 cell
The following solution based on Python and Google Colab needs to be implemented:
1. I add a Word file using the “upload” button.
2. Display thumbnails of all images and tables in the interface (in 1 copy) with a check mark next to it.
3. I uncheck unnecessary images and tables.
4. I confirm.
5. Images and tables that I unchecked are deleted from the Word file.
6. Auto-download the Word file to your PC.
Script for removing images and tables from Word
https://colab.research.google.com/drive/...oKxazRKrbj
Sample file
https://disk.yandex.ru/i/VQkZzn7LQflE1Q
The table is displayed in the interface.
It does not display images and figures from Word. Please help me find the error. What should I fix?