openfree's picture

openfree PRO

openfree

AI & ML interests

None yet

Recent Activity

liked a Space about 12 hours ago
openfree/Korean-Exam-Leaderboard
replied to their post about 13 hours ago
Korean Exam Leaderboard: LLMs vs Civil Service and Professional Qualification Exams ๐Ÿ“ https://huggingface.co/spaces/openfree/Korean-Exam-Leaderboard ## ๐Ÿ“Š What is this leaderboard? This leaderboard evaluates the performance of various AI models on 22 Korean civil service and professional qualification exams. All scores are converted to a 100-point scale to show how well different LLMs can solve actual Korean civil service and professional qualification tests! ## ๐Ÿ† Current Top Performers - **OpenAI/GPT-o1**: Bar Exam 52.5 points ๐Ÿฅ‡ - **OpenAI/GPT-4.5**: Bar Exam 49.33 points ๐Ÿฅˆ - **OpenAI/GPT-4o**: Bar Exam 49.11 points ๐Ÿฅ‰ - **deepseek-ai/DeepSeek-R1**: Bar Exam 47.33 points ## ๐Ÿ“‹ Exams Being Evaluated The leaderboard includes various Korean civil service and professional qualification exams: - Korean Bar Exam - Senior Civil Service Grade 5 - Judicial Service Grade 5 - National Assembly Grade 5 - Judicial Scrivener - Police Executive Candidate - And more exams! ## ๐Ÿค– Models Being Evaluated We are testing a variety of models: - OpenAI: GPT-o1, GPT-o3-mini, GPT-4.5, GPT-4o - Anthropic: Claude 3.7 Sonnet - Google: Gemini 2.0 Flash/PRO/Flash Thinking - Meta: Llama 3.3 70B Instruct, Llama 3.2 90B Vision - DeepSeek: DeepSeek-R1 - Qwen: QwQ-32B, Qwen2.5 Coder - Mistral: Mistral-Small-3.1-24B - NVIDIA models: NVIDIA Nemotron variant models - And many more! ## ๐Ÿ” Why This Matters Korean civil service exams are known for their high difficulty and comprehensive knowledge assessment. These exams test deep knowledge across legal, administrative, and public service domains. Success in these exams demonstrates not just language understanding but also domain expertise and reasoning ability. ## ๐Ÿงช Evaluation Methodology ๐Ÿ”œ Future Plans We are continuously expanding our test coverage across all 22 exam categories. We will keep updating the scores marked "TBD" so please stay tuned!
reacted to their post with ๐Ÿค— about 14 hours ago
Korean Exam Leaderboard: LLMs vs Civil Service and Professional Qualification Exams ๐Ÿ“ https://huggingface.co/spaces/openfree/Korean-Exam-Leaderboard ## ๐Ÿ“Š What is this leaderboard? This leaderboard evaluates the performance of various AI models on 22 Korean civil service and professional qualification exams. All scores are converted to a 100-point scale to show how well different LLMs can solve actual Korean civil service and professional qualification tests! ## ๐Ÿ† Current Top Performers - **OpenAI/GPT-o1**: Bar Exam 52.5 points ๐Ÿฅ‡ - **OpenAI/GPT-4.5**: Bar Exam 49.33 points ๐Ÿฅˆ - **OpenAI/GPT-4o**: Bar Exam 49.11 points ๐Ÿฅ‰ - **deepseek-ai/DeepSeek-R1**: Bar Exam 47.33 points ## ๐Ÿ“‹ Exams Being Evaluated The leaderboard includes various Korean civil service and professional qualification exams: - Korean Bar Exam - Senior Civil Service Grade 5 - Judicial Service Grade 5 - National Assembly Grade 5 - Judicial Scrivener - Police Executive Candidate - And more exams! ## ๐Ÿค– Models Being Evaluated We are testing a variety of models: - OpenAI: GPT-o1, GPT-o3-mini, GPT-4.5, GPT-4o - Anthropic: Claude 3.7 Sonnet - Google: Gemini 2.0 Flash/PRO/Flash Thinking - Meta: Llama 3.3 70B Instruct, Llama 3.2 90B Vision - DeepSeek: DeepSeek-R1 - Qwen: QwQ-32B, Qwen2.5 Coder - Mistral: Mistral-Small-3.1-24B - NVIDIA models: NVIDIA Nemotron variant models - And many more! ## ๐Ÿ” Why This Matters Korean civil service exams are known for their high difficulty and comprehensive knowledge assessment. These exams test deep knowledge across legal, administrative, and public service domains. Success in these exams demonstrates not just language understanding but also domain expertise and reasoning ability. ## ๐Ÿงช Evaluation Methodology ๐Ÿ”œ Future Plans We are continuously expanding our test coverage across all 22 exam categories. We will keep updating the scores marked "TBD" so please stay tuned!
View all activity

Organizations

VIDraft's profile picture korea forestry's profile picture

Posts 19

view post
Post
1335
Korean Exam Leaderboard: LLMs vs Civil Service and Professional Qualification Exams ๐Ÿ“

openfree/Korean-Exam-Leaderboard

## ๐Ÿ“Š What is this leaderboard?
This leaderboard evaluates the performance of various AI models on 22 Korean civil service and professional qualification exams. All scores are converted to a 100-point scale to show how well different LLMs can solve actual Korean civil service and professional qualification tests!

## ๐Ÿ† Current Top Performers
- **OpenAI/GPT-o1**: Bar Exam 52.5 points ๐Ÿฅ‡
- **OpenAI/GPT-4.5**: Bar Exam 49.33 points ๐Ÿฅˆ
- **OpenAI/GPT-4o**: Bar Exam 49.11 points ๐Ÿฅ‰
- **deepseek-ai/DeepSeek-R1**: Bar Exam 47.33 points

## ๐Ÿ“‹ Exams Being Evaluated
The leaderboard includes various Korean civil service and professional qualification exams:
- Korean Bar Exam
- Senior Civil Service Grade 5
- Judicial Service Grade 5
- National Assembly Grade 5
- Judicial Scrivener
- Police Executive Candidate
- And more exams!

## ๐Ÿค– Models Being Evaluated
We are testing a variety of models:
- OpenAI: GPT-o1, GPT-o3-mini, GPT-4.5, GPT-4o
- Anthropic: Claude 3.7 Sonnet
- Google: Gemini 2.0 Flash/PRO/Flash Thinking
- Meta: Llama 3.3 70B Instruct, Llama 3.2 90B Vision
- DeepSeek: DeepSeek-R1
- Qwen: QwQ-32B, Qwen2.5 Coder
- Mistral: Mistral-Small-3.1-24B
- NVIDIA models: NVIDIA Nemotron variant models
- And many more!

## ๐Ÿ” Why This Matters
Korean civil service exams are known for their high difficulty and comprehensive knowledge assessment. These exams test deep knowledge across legal, administrative, and public service domains. Success in these exams demonstrates not just language understanding but also domain expertise and reasoning ability.

## ๐Ÿงช Evaluation Methodology

๐Ÿ”œ Future Plans
We are continuously expanding our test coverage across all 22 exam categories. We will keep updating the scores marked "TBD" so please stay tuned!
view post
Post
4507
๐Ÿš€ Idea Transformer:

Idea Transformer: Infinity is an innovative tool that unlocks infinite creativity by generating unique transformation ideas and design images from up to three keywords and a chosen category. Leveraging a state-of-the-art diffusion pipeline, real-time translation, and a powerful LLM, it delivers fresh ideas every time. ๐ŸŽจโœจ

openfree/Idea-Transformer

Key Features

Diverse Ideas:
Randomly selects creative variations from your keywords and category โ€” the possibilities are nearly endless! ๐ŸŽฒ
Unique Design Images:
Your text prompt produces striking, varied design images via the diffusion model. ๐Ÿ–ผ๏ธ
Real-Time Translation & Expansion:
Korean inputs are automatically translated and enriched using an advanced LLM for high-quality output. ๐Ÿ”„
Dual-Language Support:
Enjoy an intuitive Gradio interface with separate English and Korean tabs for a global audience. ๐ŸŒ
Explore a Wide Range of Categories:

Sensor Functions ๐Ÿ“ก: Creative changes in sensor technologies.
Size & Shape Change ๐Ÿ“: Ideas altering physical dimensions and forms.
Surface & Appearance Change ๐ŸŽจ: Transformations in color, texture, and visual effects.
Material State Change ๐Ÿ”ฅ: Transitions between different material states.
Movement Characteristics Change ๐Ÿƒโ€โ™‚๏ธ๐Ÿ’จ: Innovations in motion, speed, and vibration.
Structural Change ๐Ÿ› ๏ธ: Reconfigurations via assembly/disassembly and design modifications.
Spatial Movement ๐Ÿš€: Ideas on repositioning and directional shifts.
Time-Related Change โณ: Concepts influenced by aging, wear, and lifecycle.
Light & Visual Effects ๐Ÿ’ก: Alterations in illumination, transparency, and holographic effects.
Sound & Vibration Effects ๐Ÿ”Š: Innovations in auditory and vibrational dynamics.
Business Ideas ๐Ÿ’ผ: Strategies for market redefinition, business model innovation, and more.
Why Choose Idea Transformer?

Infinite Creativity & Cutting-Edge Technology : Your keywords and randomized transformations produce an endless stream of unique ideas!