Langchain

Bolt · Jun-08-2023, 09:30 PM

Based on the code provided, it seems there might be an issue with the indexing and splitting of the PDF document. Here are a few points to consider:

1. PDF Loading: Make sure the file "ALJDecision.pdf" exists in the same directory as your script and that it is accessible.

2. Document Loading: The code is attempting to load the PDF document using the loader.load() method, but it seems to be returning only 19 documents instead of the expected 31. Verify that the PDF document contains the expected content and that it is structured in a way that can be correctly loaded and split.

3. Document Splitting: The RecursiveCharacterTextSplitter is used to split the loaded documents into smaller chunks. In the provided code, it is configured with a chunk_size of 2000 characters and chunk_overlap of 0. Ensure that these settings are appropriate for your specific document and use case.

4. Indexing: After splitting the documents, the code attempts to index them by sending each chunk to the OpenAI API for completion using the Davinci model. Make sure you have a valid API key set in the OPENAI_API_KEY variable.

5. Querying: The code uses the openai.ChatCompletion.create() method to perform a query on the indexed documents. Confirm that the messages parameter is correctly formatted, with the system message preceding the user's query.

To debug the issue further, you can print out the loaded documents and their content to check if they are being processed correctly. Additionally, review the API responses to see if there are any error messages or unexpected results.

It's also worth noting that the code seems to assume there will be exactly 31 documents and that the 31st document will have a page content to check its length. If your document doesn't have that structure, you might need to adjust the code accordingly.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Cannot import Langchain and openai.	standenman	2	5,181	May-22-2023, 03:00 PM Last Post: standenman

Langchain

User Panel Messages

Announcements