AI-Powered Multilingual Summarization & Translation Platform
Extract web content, generate intelligent summaries, and translate across 12+ languages
SummarEase Pro is an advanced AI-powered web application that provides intelligent text summarization and multilingual translation capabilities. Built with state-of-the-art transformer models, it enables users to extract key information from articles, documents, and web pages, then translate the summaries into multiple languages.
- Smart Summarization: Specialized models for different languages
- Multilingual Translation: Support for 12+ languages
- Advanced Models: BART, mT5, M2M100, and MarianMT transformers
- Web Scraping: Extract content from any URL
- Multiple Inputs: Text input, URL scraping, file upload
- Content Cleaning: Remove scripts and non-content elements automatically
- Core Languages: French, English, Spanish, German, Arabic
- Additional Languages: Italian, Portuguese, Dutch, Russian, Chinese, Japanese, Korean, Hindi
- Bidirectional Translation: Translate between any language pair
- Modern Interface: Streamlit dashboard
- Real-time Processing: Live progress tracking
- Export Results: Download in JSON and CSV formats
- History Tracking: Review previous operations
- Python 3.8 or higher
- pip package manager
- Clone the repository
git clone https://github.com/Fzmes/Summarease-Pro.git cd Summarease-Pro- Install dependencies
pip install -r requirements.txt- Launch the application
streamlit run app.py- Open your browser
- Application opens automatically at
http://localhost:8501 - Or manually navigate to the displayed URL
- Application opens automatically at
- Go to "Charger Article" section
- Select "Texte Manuel"
- Paste your text and configure options
- Click "Lancer le RΓ©sumΓ© et la Traduction"
- Select "Lien Web"
- Enter article URL (Wikipedia, news sites, blogs)
- Click "Extraire le contenu"
- Process the extracted content
- Choose "Fichier Texte"
- Upload .txt files
- View preview and process
| Language | Code | Summarization | Translation |
|---|---|---|---|
| FranΓ§ais | fr | Specialized | All pairs |
| English | en | Specialized | All pairs |
| EspaΓ±ol | es | Available | All pairs |
| Deutsch | de | Available | All pairs |
| Ψ§ΩΨΉΨ±Ψ¨ΩΨ© | ar | Available | All pairs |
| δΈζ | zh | Available | All pairs |
| ζ₯ζ¬θͺ | ja | Available | All pairs |
Summarization: - French: moussaKam/barthez-orangesum-abstract - English: facebook/bart-large-cnn - Multilingual: google/mt5-small Translation: - M2M100: facebook/m2m100_418M (100 languages) - MBART-50: facebook/mbart-large-50-many-to-many-mmt - MarianMT: Helsinki-NLP specialized models - ExtendedMultilingualModels: Manages AI models and processing
- SummarizationApp: Streamlit web interface controller
- Web Scraper: Extracts content from websites
- Translation Engine: Handles multilingual translations
- Summarize market research reports
- Translate business documents
- Analyze competitor content
- Summarize research papers
- Translate academic content
- Study aid for language learning
- Localize articles for different regions
- Create multilingual content
- Research and content curation
- Quick understanding of foreign articles
- Language learning assistance
- Personal research organization
- Summary Length: Short, Medium, Long
- Target Languages: Multiple simultaneous translations
- Source Language: Auto-detection with manual override
- Summarization: 85-95% content retention
- Translation: High accuracy across languages
- Speed: 2-10 seconds processing time
- Web Extraction: Robust with multiple fallback strategies
We welcome contributions! Here's how you can help:
- Use the Issues section
- Describe the problem clearly
- Include steps to reproduce
- Open an issue with your idea
- Explain how it would be useful
- Fork the project
- Create a feature branch
- Make your changes
- Test thoroughly
- Submit a Pull Request
- New language support
- UI/UX improvements
- Performance optimization
- Documentation
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
- Documentation: Check this README
- Issues: Open a GitHub issue
- Questions: Use the discussions section
- Hugging Face for transformer models
- Streamlit for the web framework
- Facebook AI for M2M100 and MBART models
- Helsinki NLP for MarianMT models
Made with β€οΈ by Team SummarEase
π€ Fatima Zahra - Models & Deployment
π¨ Najlae - Visual Interface
π Ikram - Data Preparation
Making multilingual content accessible to everyone
Have questions or suggestions? Feel free to reach out through GitHub issues or discussions.
Happy Summarizing! π