Search text in PDF and output its page number.

Thread Rating:

0 Vote(s) - 0 Average
1
2
3
4
5

Thread Modes

Search text in PDF and output its page number.

snippsat

Administrators

Posts: 7,093

Threads: 122

Joined: Sep 2016

Reputation: 499

#22

Jan-21-2022, 06:20 AM

(Jan-21-2022, 03:51 AM)atomxkai Wrote: Can pdfplumber search part of a word then print results with the whole word?

It's more up to you to do that task as pdfplumber return plaint text.
So for this task can use regex.
Eg a pattern(search) r"\bpage\s\d+\b" will find page 1,page 2 or page 50.
Also it find page \s(whitespace character) \d(matches a digit) +(matches the previous digit between one and unlimited times)
Example.

import pdfplumber
import re

pdf_file = "sample.pdf"
pattern = re.compile(r"\bpage\s\d+\b")
with pdfplumber.open(pdf_file) as pdf:
    pages = pdf.pages
    for page_nr, pg in enumerate(pages, 1):
        content = pg.extract_text()
        for match in pattern.finditer(content):
            print(match.group(), page_nr, content.index(match.group()))

Output:page 2 1 568
page 1 2 39

atomxkai likes this post

Find

Messages In This Thread

Search text in PDF and output its page number. - by atomxkai - Jan-07-2022, 06:56 PM

RE: Search text in PDF and output its page number. - by cubangt - Jan-07-2022, 07:19 PM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-08-2022, 12:28 AM

RE: Search text in PDF and output its page number. - by BashBedlam - Jan-07-2022, 11:21 PM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-08-2022, 12:26 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-08-2022, 12:27 AM

RE: Search text in PDF and output its page number. - by snippsat - Jan-08-2022, 10:09 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-10-2022, 02:13 AM

RE: Search text in PDF and output its page number. - by BashBedlam - Jan-08-2022, 03:07 PM

RE: Search text in PDF and output its page number. - by snippsat - Jan-08-2022, 05:23 PM

RE: Search text in PDF and output its page number. - by snippsat - Jan-10-2022, 05:51 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-10-2022, 10:25 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-10-2022, 10:43 AM

RE: Search text in PDF and output its page number. - by snippsat - Jan-10-2022, 01:56 PM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-11-2022, 06:19 PM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-14-2022, 10:27 PM

RE: Search text in PDF and output its page number. - by snippsat - Jan-15-2022, 12:58 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-18-2022, 11:11 PM

RE: Search text in PDF and output its page number. - by snippsat - Jan-19-2022, 12:17 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-21-2022, 03:45 AM

RE: Search text in PDF and output its page number. - by atomxkai - Jan-21-2022, 03:51 AM

RE: Search text in PDF and output its page number. - by snippsat - Jan-21-2022, 06:20 AM

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Number stored as text with openpyxl	CAD79	2	458	Apr-17-2024, 10:17 AM Last Post: CAD79
	capturing multiline output for number of parameters	jss	3	827	Sep-01-2023, 05:42 PM Last Post: jss
	Formatting float number output	barryjo	2	934	May-04-2023, 02:04 PM Last Post: barryjo
	fuzzywuzzy search string in text file	marfer	9	4,642	Aug-03-2021, 02:41 AM Last Post: deanhystad
	Getting a GET request output text into a variable to work with it.	LeoT	2	3,041	Feb-24-2021, 02:05 PM Last Post: LeoT
	Increment text files output and limit contains	Kaminsky	1	3,216	Jan-30-2021, 06:58 PM Last Post: bowlofred
	How to Split Output Audio on Text to Speech Code	Base12	2	6,881	Aug-29-2020, 03:23 AM Last Post: Base12
	Search Results Web results Printing the number of days in a given month and year	afefDXCTN	1	2,248	Aug-21-2020, 12:20 PM Last Post: DeaD_EyE
	Import Text, output curve geometry	Alyner	0	1,993	Feb-03-2020, 03:05 AM Last Post: Alyner
	Search for the line number corresponding to a value	Lali	0	1,660	Oct-22-2019, 08:56 AM Last Post: Lali

Users browsing this thread: 1 Guest(s)

View a Printable Version

Search text in PDF and output its page number.

User Panel Messages

Announcements