Andy Lee commited on
Commit
0604037
·
1 Parent(s): 750c136

docs: README

Browse files
Files changed (1) hide show
  1. README.md +144 -0
README.md ADDED
@@ -0,0 +1,144 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: Omniscient
3
+ emoji: 👁️‍🗨️
4
+ colorFrom: indigo
5
+ colorTo: purple
6
+ sdk: streamlit
7
+ python_version: 3.11
8
+ sdk_version: "1.35.0"
9
+ app_file: app.py
10
+ pinned: false
11
+ ---
12
+
13
+ # Omniscient
14
+ "The all-knowing AI that sees everything, knows everything"
15
+
16
+ A versatile AI bot for image analysis and dataset curation with support for multiple AI models.
17
+
18
+ ## Features
19
+
20
+ - **Dataset Curation**: Generate and curate image datasets
21
+ - **Single Image Analysis**: Benchmark different AI models on individual images
22
+ - **Agentic Analysis**: Multi-step AI reasoning and analysis
23
+ - **Multiple AI Providers**: Support for OpenAI, Anthropic, and Google AI
24
+
25
+ ## Quick Start
26
+
27
+ ### 1. Setup Environment
28
+
29
+ ```bash
30
+ cd simple_G_ai_bot
31
+ ```
32
+
33
+ Create a `.env` file in the project root:
34
+
35
+ ```bash
36
+ # .env
37
+ OPENAI_API_KEY=your_openai_api_key_here
38
+ ANTHROPIC_API_KEY=your_anthropic_api_key_here
39
+ GOOGLE_API_KEY=your_google_api_key_here
40
+ ```
41
+
42
+ ### 2. Install Dependencies
43
+
44
+ ```bash
45
+ uv sync
46
+ ```
47
+
48
+ ### 3. Usage Examples
49
+
50
+ #### Dataset Curation
51
+ Generate 50 urban outdoor samples:
52
+ ```bash
53
+ python main.py --mode data --samples 50 --urban --no-indoor
54
+ ```
55
+
56
+ #### Single Image Analysis
57
+ Benchmark GPT-4o on 5 samples:
58
+ ```bash
59
+ python main.py --mode benchmark --models gpt-4o --samples 5
60
+ ```
61
+
62
+ #### Agentic Analysis
63
+ Run multi-step analysis with Gemini:
64
+ ```bash
65
+ python main.py --mode agent --model gemini-2.5-pro --steps 10 --samples 5
66
+ ```
67
+
68
+ ## Configuration
69
+
70
+ ### Environment Variables
71
+
72
+ | Variable | Description | Required |
73
+ |----------|-------------|----------|
74
+ | `OPENAI_API_KEY` | OpenAI API key for GPT models | Optional |
75
+ | `ANTHROPIC_API_KEY` | Anthropic API key for Claude models | Optional |
76
+ | `GOOGLE_API_KEY` | Google AI API key for Gemini models | Optional |
77
+
78
+ ### Command Line Options
79
+
80
+ #### Common Options
81
+ - `--mode`: Operation mode (`data`, `benchmark`, `agent`)
82
+ - `--samples`: Number of samples to process (default: 10)
83
+
84
+ #### Data Mode Options
85
+ - `--urban`: Focus on urban environments
86
+ - `--no-indoor`: Exclude indoor scenes
87
+
88
+ #### Benchmark Mode Options
89
+ - `--models`: AI model to use (e.g., `gpt-4o`, `claude-3`, `gemini-pro`)
90
+
91
+ #### Agent Mode Options
92
+ - `--model`: AI model for agentic analysis
93
+ - `--steps`: Number of reasoning steps (default: 5)
94
+
95
+ ## Supported Models
96
+
97
+ - **OpenAI**: GPT-4o, GPT-4, GPT-3.5-turbo
98
+ - **Anthropic**: Claude-3-opus, Claude-3-sonnet, Claude-3-haiku
99
+ - **Google**: Gemini-2.5-pro, Gemini-pro, Gemini-pro-vision
100
+
101
+ ## Requirements
102
+
103
+ - Python 3.8+
104
+ - UV package manager
105
+ - Valid API keys for desired AI providers
106
+
107
+ ## Installation
108
+
109
+ 1. Clone the repository
110
+ 2. Navigate to project directory: `cd simple_G_ai_bot`
111
+ 3. Create `.env` file with your API keys
112
+ 4. Install dependencies: `uv sync`
113
+ 5. Run the bot with desired mode and options
114
+
115
+ ## Examples
116
+
117
+ ### Basic Dataset Generation
118
+ ```bash
119
+ python main.py --mode data --samples 20
120
+ ```
121
+
122
+ ### Urban Scene Analysis
123
+ ```bash
124
+ python main.py --mode data --samples 30 --urban --no-indoor
125
+ ```
126
+
127
+ ### Model Comparison
128
+ ```bash
129
+ python main.py --mode benchmark --models gpt-4o --samples 10
130
+ python main.py --mode benchmark --models claude-3-opus --samples 10
131
+ ```
132
+
133
+ ### Advanced Agentic Workflow
134
+ ```bash
135
+ python main.py --mode agent --model gemini-2.5-pro --steps 15 --samples 3
136
+ ```
137
+
138
+ ## Security Note
139
+
140
+ ⚠️ **Important**: Never commit your `.env` file to version control. Add `.env` to your `.gitignore` file to keep your API keys secure.
141
+
142
+ ## License
143
+
144
+ MIT License - see LICENSE file for details.