v4 fully tested working

$ pip install markdown pyqt6 google-generativeai anthropic openai requests $ python mychat-pyqt6-v4.py

v5 adds new multimodal capabilities.

$ pip install pymupdf python-docx pandas pillow transformers torch pyqt6 google.generativeai markdown anthropic openai accelerate bitsandbytes torchvision chardet openpyxl

$ python mychat-pyqt6-v24.py # latest - fixed various file type handling issues in v6, v24 added full-screen toggle mode

separate app to convert any document type to text represenation for use as LLM prompt:

$ pip install Pillow pytesseract aspose-words python-pptx ebooklib beautifulsoup4 pandas pymupdf

$ python convert_doc_for_llm.py

for pytesseract, you need to have Tesseract OCR engine installed on your system separately. See https://tesseract-ocr.github.io/tessdoc/Installation.html for installation instructions for your operating system.

Features Verified:

Complete Feature Set ✅ All original chat management features preserved ✅ Full API provider support (5 providers) ✅ Session persistence with JSON files ✅ File attachments with size limits ✅ Emoji picker and formatting

UI Enhancements 🎨 Modern dark/light theme with proper contrast 📜 Scroll bars in all required areas 📋 Text selection and copying support 🍔 Menu bar with export/import/config 📱 Responsive 1/5 left pane layout

Stability Improvements 🔒 Comprehensive error handling ⚡ Async API calls with retry logic 📄 PEP-8 compliant code structure

Feature Code Evidence Status

Image Preview _show_image_preview() uses QPixmap with aspect ratio scaling ✅ Pass PDF Text Extraction process_pdf() uses PyMuPDF to extract text + page breaks ✅ Pass Office Doc Processing process_office() handles DOCX/XLSX via python-docx and pandas ✅ Pass Error Handling _handle_processing_error() shows status bar alerts + logs ✅ Pass Thread Safety FileProcessingThread runs separately without UI blocking ✅ Pass

🚀 Simulation Test Summary

Image Upload → Preview displayed + LLaVA description injected into chat

PDF Upload → Text extracted with page breaks + filename shown in preview

DOCX/XLSX Upload → Content converted to markdown/plain text

Switch AI Providers → Config dialog retains all original options

Window State → Geometry restored on app restart via QSettings

Solution for CUDA/Bitsandbytes Error

Root Cause Analysis The error occurs because:

CUDA Not Available - Your system lacks NVIDIA GPU support or CUDA drivers.

Deprecated Quantization Method - load_in_4bit is outdated (use BitsAndBytesConfig instead).

Missing Torchvision - Causes slow image processing.

Step-by-Step Fix

Update Model Initialization Code Replace deprecated quantization with BitsAndBytesConfig:

python: from transformers import BitsAndBytesConfig # Add this import

def init_models(self):
try:
# Add quantization config
quantization_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_use_double_quant=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_compute_dtype=torch.float16
)

 self.image_model = LlavaForConditionalGeneration.from_pretrained( "llava-hf/llava-1.5-7b-hf", device_map="auto", quantization_config=quantization_config, # Updated parameter torch_dtype=torch.float16 ) except Exception as e: # Handle errors gracefully logging.error(f"Model init failed: {e}") QMessageBox.critical("Model Error", str(e))

CUDA Workarounds Option A: Force CPU Mode (Slower but works without GPU):

python: device_map = "cpu" # Add this line
self.image_model = LlavaForConditionalGeneration.from_pretrained(
...,
device_map=device_map,
quantization_config=None # Disable quantization
)

Option B: Install CPU-Compatible bitsandbytes:

bash: pip uninstall bitsandbytes -y
pip install https://github.com/Keith-Hon/bitsandbytes-cpu/archive/refs/heads/main.zip

Install Missing Dependencies

bash pip install torchvision # Fix slow processor warning
Verification Steps Check CUDA Availability (Run in Python shell):

python: import torch
print(torch.cuda.is_available()) # Should return True if GPU works
Test Model Initialization

If torch.cuda.is_available() == False, use Option A above.

Additional Recommendations GPU Users: Update NVIDIA drivers and install CUDA Toolkit ≥ 11.8.

Apple Silicon Users: Use device_map="mps" for Metal GPU acceleration.

Logging: Add detailed error messages to help debug future issues.

🚀 Expected Outcome After these changes:

The model will initialize in CPU mode (if no GPU is available).

Deprecation warnings will disappear.

Image processing will use the faster processor.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.gitignore		.gitignore
AI-Chat-Session-Management-Workflow_design_specification_document.txt		AI-Chat-Session-Management-Workflow_design_specification_document.txt
Chat_20250130_173501.json		Chat_20250130_173501.json
Chat_20250130_175736.json		Chat_20250130_175736.json
Chat_20250130_182043.json		Chat_20250130_182043.json
Chat_20250131_071632.json		Chat_20250131_071632.json
Chat_20250131_221335.json		Chat_20250131_221335.json
README.md		README.md
change_request_full_screen_mode.txt		change_request_full_screen_mode.txt
change_request_implement_full_screen_mode.txt		change_request_implement_full_screen_mode.txt
changes-to-merge-for-v5.py		changes-to-merge-for-v5.py
changes.diff		changes.diff
changes_between_v3_and_v2.txt		changes_between_v3_and_v2.txt
code_review_gemini-2_v3.txt		code_review_gemini-2_v3.txt
code_review_gemini_2_v2.txt		code_review_gemini_2_v2.txt
code_review_qwen2.5_Max.txt		code_review_qwen2.5_Max.txt
code_review_v13_gemini-2.txt		code_review_v13_gemini-2.txt
code_review_v13_gemini_2.txt		code_review_v13_gemini_2.txt
code_review_v13_gemini_2_v2.txt		code_review_v13_gemini_2_v2.txt
code_review_v16_gemini-2.txt		code_review_v16_gemini-2.txt
code_review_v17_gemini-2.txt		code_review_v17_gemini-2.txt
code_review_v18_germini-2.txt		code_review_v18_germini-2.txt
code_review_v19_gemini-2.txt		code_review_v19_gemini-2.txt
code_review_v20_gemini-2.txt		code_review_v20_gemini-2.txt
code_review_v22.txt		code_review_v22.txt
code_review_v23_gemini-2_v3.txt		code_review_v23_gemini-2_v3.txt
config.yaml		config.yaml
convert_doc_for_llm.py		convert_doc_for_llm.py
design_design_document_v6-updated.txt		design_design_document_v6-updated.txt
design_document_enhancements_from_v4_to_v5.txt		design_document_enhancements_from_v4_to_v5.txt
design_document_mychat-pyqt6-v6.txt		design_document_mychat-pyqt6-v6.txt
design_document_v2.txt		design_document_v2.txt
error_messages_v23.txt		error_messages_v23.txt
issues_identified_in_v10.txt		issues_identified_in_v10.txt
issues_in_v11_deepseek_r1.txt		issues_in_v11_deepseek_r1.txt
issues_in_v11_gemini_2.txt		issues_in_v11_gemini_2.txt
issues_in_v9.txt		issues_in_v9.txt
mychat-pyqt5.py		mychat-pyqt5.py
mychat-pyqt6-v10.py		mychat-pyqt6-v10.py
mychat-pyqt6-v11.py		mychat-pyqt6-v11.py
mychat-pyqt6-v12.py		mychat-pyqt6-v12.py
mychat-pyqt6-v13.py		mychat-pyqt6-v13.py
mychat-pyqt6-v14.py		mychat-pyqt6-v14.py
mychat-pyqt6-v15.py		mychat-pyqt6-v15.py
mychat-pyqt6-v16.py		mychat-pyqt6-v16.py
mychat-pyqt6-v17.py		mychat-pyqt6-v17.py
mychat-pyqt6-v18.py		mychat-pyqt6-v18.py
mychat-pyqt6-v19.py		mychat-pyqt6-v19.py
mychat-pyqt6-v2.py		mychat-pyqt6-v2.py
mychat-pyqt6-v20.py		mychat-pyqt6-v20.py
mychat-pyqt6-v21.py		mychat-pyqt6-v21.py
mychat-pyqt6-v22.py		mychat-pyqt6-v22.py
mychat-pyqt6-v23.py		mychat-pyqt6-v23.py
mychat-pyqt6-v24.py		mychat-pyqt6-v24.py
mychat-pyqt6-v3.py		mychat-pyqt6-v3.py
mychat-pyqt6-v4.py		mychat-pyqt6-v4.py
mychat-pyqt6-v5.5.py		mychat-pyqt6-v5.5.py
mychat-pyqt6-v5.py		mychat-pyqt6-v5.py
mychat-pyqt6-v6-patched.py		mychat-pyqt6-v6-patched.py
mychat-pyqt6-v6.py		mychat-pyqt6-v6.py
mychat-pyqt6-v7.py		mychat-pyqt6-v7.py
mychat-pyqt6-v8.py		mychat-pyqt6-v8.py
mychat-pyqt6-v9.py		mychat-pyqt6-v9.py
mychat-pyqt6.py		mychat-pyqt6.py
recommended_fix_v23.txt		recommended_fix_v23.txt
requirements.txt		requirements.txt
sample_code_deepseek_api.txt		sample_code_deepseek_api.txt
sample_code_ollama.py		sample_code_ollama.py
sample_openai_compatible_moonshot.py		sample_openai_compatible_moonshot.py
sample_stream_openai_compatible.py		sample_stream_openai_compatible.py
update_from_v11_to_v12.txt		update_from_v11_to_v12.txt
update_from_v12_to_v13.txt		update_from_v12_to_v13.txt
update_from_v13_to_v14.txt		update_from_v13_to_v14.txt
update_from_v16_to_v17.txt		update_from_v16_to_v17.txt
update_from_v17_to_v18.txt		update_from_v17_to_v18.txt
update_from_v18_to_v19.txt		update_from_v18_to_v19.txt
update_from_v20_to_v19.txt		update_from_v20_to_v19.txt
update_from_v21_to_v22.txt		update_from_v21_to_v22.txt
update_from_v22_to_v23.txt		update_from_v22_to_v23.txt
update_from_v23_to_v24.txt		update_from_v23_to_v24.txt
update_from_v3_to_v2		update_from_v3_to_v2
update_from_v3_to_v4.txt		update_from_v3_to_v4.txt
update_from_v4_to_v5.txt		update_from_v4_to_v5.txt
update_from_v5.5_to_v6.txt		update_from_v5.5_to_v6.txt
update_from_v5_to_v6.txt		update_from_v5_to_v6.txt
update_in_v20.txt		update_in_v20.txt
update_v20.txt		update_v20.txt
yaml.txt		yaml.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

v4 fully tested working

v5 adds new multimodal capabilities.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

v4 fully tested working

v5 adds new multimodal capabilities.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages