entaoai

Chat and Ask on your own data. Accelerator to quickly upload your own enterprise data and use OpenAI services to chat to that uploaded data and ask questions

867
Stars
+-8
Stars/month
0
Releases (6m)

Star Growth

850868885Mar 27Apr 1

Overview

EntaoAI is an open-source sample application that demonstrates how to create ChatGPT-like conversational experiences over enterprise data. Built on Azure OpenAI Service, it provides a complete pipeline for uploading proprietary documents and enabling natural language interactions with that data. The tool supports multiple vector storage options including Pinecone, Redis, and Azure Cognitive Search for data indexing and retrieval. EntaoAI implements Retrieval Augmented Generation (RAG) patterns, allowing organizations to leverage large language models while maintaining control over their data. The platform includes advanced features like multi-modal RAG capabilities, streaming chat responses, and comprehensive evaluation tools through Azure ML Prompt Flow. With built-in evaluation metrics for groundedness, coherence, and similarity scoring, teams can assess and improve their RAG implementations. The tool has evolved through multiple iterations, with recent updates focusing on core chat, Q&A, upload, and admin functionality. While designed as a sample application, EntaoAI provides a solid foundation for organizations looking to implement enterprise AI chat solutions using their own data sources.

Deep Analysis

Key Differentiator

vs simple PDF chatbots: enterprise Azure-native document AI platform with SQL agents, PromptFlow evaluation, speech integration, function calling, and session persistence — the most feature-rich Azure OpenAI reference implementation

Capabilities

  • Enterprise document Q&A with Azure OpenAI Service
  • Multiple vector store backends (Pinecone, Redis, Azure Cognitive Search)
  • Chat and QA modes with source citations and follow-up questions
  • Document upload (PDF, DOCX, CSV, Markdown, ZIP)
  • SQL natural language querying via database agents
  • Prompt Flow integration for LLMOps evaluation
  • Speech-to-text and text-to-speech for chat
  • Function calling (Weather, Stock APIs)
  • Smart Agent for cross-document Q&A
  • Session management with CosmosDB

🔗 Integrations

Azure OpenAIAzure Cognitive SearchPineconeRedisCosmosDBAzure Data FactorySAP HCMPrompt FlowStreamlit

Best For

  • Enterprise teams on Azure wanting comprehensive document AI with evaluation
  • Organizations needing multi-source document Q&A with citations
  • Azure-first teams wanting PromptFlow-integrated RAG evaluation

Not Ideal For

  • Non-Azure environments (deeply Azure-integrated)
  • Simple prototyping (requires extensive Azure setup)
  • Cost-sensitive projects (multiple Azure services)

Languages

PythonJavaScript/React

Deployment

Azure (App Service + Functions)Docker

Known Limitations

  • Heavily Azure-dependent (multiple Azure services required)
  • Complex configuration with many API keys
  • Davinci model support being removed progressively
  • Setup requires significant Azure infrastructure

Pros

  • + Supports multiple vector stores (Pinecone, Redis, Azure Cognitive Search) providing flexibility in deployment options
  • + Includes comprehensive evaluation framework with Prompt Flow integration and metrics like groundedness and Ada similarity
  • + Active development with regular updates and refactoring to improve core functionality and remove complexity

Cons

  • - Designed as a sample application rather than production-ready solution, requiring additional development for enterprise deployment
  • - Specifically tied to Azure OpenAI Service, limiting flexibility in LLM provider choice
  • - Has undergone multiple refactoring cycles that removed features, suggesting potential instability in feature set

Use Cases

  • Enterprise document Q&A systems where employees need to query internal knowledge bases using natural language
  • Internal chatbots for customer support teams to quickly access company policies and procedures
  • Research and development teams building custom RAG applications for proprietary data analysis

Getting Started

1. Clone the repository and install Python dependencies, 2. Configure Azure OpenAI Service connection and choose your preferred vector store (Pinecone, Redis, or Azure Cognitive Search), 3. Use the upload functionality to add your enterprise data and start chatting through the web interface

Compare entaoai