May-22-2023, 10:35 PM
I am trying to simply load a pdf doc and use langchain to process so I could query it with ChatGPT. I cannot get past loading the pdf doc. Cannot figure out what I am doing wrong here. Tried this code hoping and expecting to get the text on the first page just to see if it is loading, but I get "index out of range" erro.
import langchain import pypdf from langchain.document_loaders import PyPDFLoader PDFLoader= PyPDFLoader("RegexSplitTest.pdf") pages = PDFLoader.load_and_split() pages[0]
Error:IndexError Traceback (most recent call last)
Cell In[34], line 7
5 PDFLoader= PyPDFLoader("RegexSplitTest.pdf")
6 pages = PDFLoader.load_and_split()
----> 7 pages[0]
IndexError: list index out of range