A Langchain-powered retrieval-augmented-generation pipeline for comprehensive multi-modal analysis of PDFs, specifically tailored for ESG document probing.
To weave the environment for this digital alchemy, follow these incantations:
conda env create -f environment.yml conda activate pdfRAG If the above does not work for you, fear not. Try these alternative spells:
conda create -n "pdfRAG-env" python==3.10 conda activate pdfRAG-env pip install -U langchain openai chromadb langchain-experimental pip install "unstructured[all-docs]" pillow pydantic lxml pillow matplotlib chromadb tiktoken pip intall streamlit Whisper your OPENAI API-key:
export OPENAI_API_KEY=<your-api-key-here>
To set sail, chant:
streamlit run app.py
- Knowledge Graphs (very soon)
- Corrective Strategy (supervising LLM)
- RPN-based chunk optimization
- Reciprocal Reranking
- Multimodal OpenLLM based local engine
