Spaces:
Running
A newer version of the Gradio SDK is available:
5.28.0
title: ChatDocxAI
emoji: π
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.27.0
app_file: app.py
pinned: false
ChatDocxAI
ChatDocxAI is an advanced document analysis and question-answering system that leverages the power of Google's Gemini AI to provide intelligent insights from your documents. This versatile tool allows you to upload various document formats and engage in natural conversations about their contents. Whether you're analyzing research papers, business reports, or any other document type, ChatDocxAI helps you extract meaningful information through simple questions.
The system employs sophisticated natural language processing techniques to:
- Break down complex documents into digestible chunks
- Maintain context awareness across the entire document
- Provide accurate, contextually relevant answers
- Handle multiple document formats seamlessly
Perfect for researchers, business analysts, students, and anyone who needs to quickly extract information from documents without manually reading through them.
Features
- Support for multiple document formats:
- PDF (.pdf)
- Text (.txt)
- Word Documents (.docx)
- Excel Spreadsheets (.xlsx)
- PowerPoint Presentations (.pptx)
- XML files (.xml)
- CSV files (.csv)
- JSON files (.json)
- Interactive web interface using Gradio
- Intelligent document processing and chunking
- Advanced context retrieval for accurate answers
- Train on multiple documents at once
Installation
- Clone this repository:
git clone https://github.com/yourusername/ChatDocxAI.git
cd ChatDocxAI
- Install the required dependencies:
pip install -r requirements.txt
Usage
- Run the application:
python app.py
The application will start and provide you with two URLs:
- A local URL (http://127.0.0.1:7860) for local access
- A public URL (https://xxx.gradio.live) that anyone can access
You can share the public URL with others to let them interact with your document Q&A system.
Upload your documents using the file upload interface
Click "Process Document" to analyze your uploaded files
Ask questions about your documents in the question box
How It Works
- Document Processing: When you upload documents, they are processed and split into manageable chunks
- Vector Storage: The chunks are converted into vectors and stored for efficient retrieval
- Question Handling: When you ask a question, the system:
- Retrieves relevant context from your documents
- Builds an intelligent prompt
- Uses Gemini AI to generate accurate answers
Dependencies
- gradio: For the web interface
- google-genai: For Gemini AI integration
- langchain: For document processing and chain operations
- faiss-cpu: For vector storage and retrieval
- sentence-transformers: For text embeddings
- unstructured: For document parsing
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.