ProdigyFlow is an AI-powered multi-agent data analytics framework designed to automate end-to-end exploratory analysis, visualization, insights generation, and reporting. It enables students, analysts, and researchers to turn raw datasets into meaningful business insights with minimal manual effort.
This project has been developed as part of an academic capstone initiative by:
Komal Harshita & Priyamvadha Sahasvi Nune,
Department of Computer Science & Engineering.
In real-world business environments, analysts spend 70โ80% of time cleaning, exploring, and summarizing data before any modelling or decision-making. This process is repetitive, time-consuming, and prone to error.
We wanted to build a system that:
ProdigyFlow reflects our goal to create simple, useful, modular tools that solve real analytical problems while being easy for students and businesses to adopt.
| Feature | Description |
|---|---|
| Automated Data Analysis Agent | Generates insights, metadata & summaries |
| Visualization Agent | Creates automated charts & visual summaries |
| Gemini-powered AI Summary | Natural-language insights from data |
| Structured Output Formatting | Clean and professional console reporting |
| Modular Agent Design | Add or replace agents independently |
| CSV/Excel Ingestion Support | Easily test custom datasets |
| Agent Name | Responsibility | Output |
|---|---|---|
analysis_agent.py |
Reads dataset, extracts statistics, generates Gemini summary | Insights, metadata JSON |
visualization_agent.py |
Generates visual graphs and saves locally | PNG charts |
cleaning_agent.py |
Cleans missing values, formatting, and structure | Cleaned dataset |
test_gemini.py |
Tests Gemini API connection | Model response output |
python -m venv .venv
.\.venv\Scripts\activate
source .venv/bin/activate
pip install -r requirements.txt
python agents/analysis_agent.py
python agents/test_gemini.py
Place your dataset inside the data/ folder and update path in code:
file_path = "data/your_file.csv"
๐ Running a dry test of analysis_agent...
๐ Using file: data/student_marks.csv
๐ INSIGHTS (Structured Data Overview)
------------------------------------
{ ... dataset overview JSON ... }
๐ค AI-GENERATED SUMMARY
-----------------------
โข Key performance trends detected
โข Distribution shows variation in subject performance
โข Potential improvement insights
๐ METADATA
-----------
{ ... summary JSON ... }
โ Analysis completed successfully!
๐น Build a web-based interface using FastAPI/Streamlit ๐น Add database integration and Auto-EDA dashboards ๐น Support PDF report generation ๐น Multi-file dataset comparison ๐น Plug-and-play Machine Learning agent
ProdigyFlow is only the beginning โ we plan to expand it into a fully intelligent analytical automation assistant.
| Name | Role |
|---|---|
| Komal Harshita | Lead Developer, Agent Architecture, AI Integration |
| Priyamvadha Sahasvi Nune | Data Research, Analytics, Testing & Documentation |
If you like this project, please โญ star the repository and share feedback!
๐ฆ Repository https://github.com/komalharshita/prodigyflow
๐ Project Documentation https://komalharshita.github.io/prodigyflow/
This project is released under MIT License โ feel free to use or modify with attribution.