Spaces:
Running
Running
Commit
·
79cfa91
1
Parent(s):
d470fba
Final additions
Browse files- .github/workflows/check.yml +14 -0
- .github/workflows/main.yml +18 -0
- README.md +30 -7
.github/workflows/check.yml
ADDED
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
name: Check file size
|
2 |
+
on:
|
3 |
+
pull_request:
|
4 |
+
branches: [main]
|
5 |
+
workflow_dispatch:
|
6 |
+
|
7 |
+
jobs:
|
8 |
+
sync-to-hub:
|
9 |
+
runs-on: ubuntu-latest
|
10 |
+
steps:
|
11 |
+
- name: Check large files
|
12 |
+
uses: ActionsDesk/[email protected]
|
13 |
+
with:
|
14 |
+
filesizelimit: 10485760 # 10MB, huggingface limit
|
.github/workflows/main.yml
ADDED
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
name: Sync to Hugging Face hub
|
2 |
+
on:
|
3 |
+
push:
|
4 |
+
branches: [main]
|
5 |
+
workflow_dispatch:
|
6 |
+
|
7 |
+
jobs:
|
8 |
+
sync-to-hub:
|
9 |
+
runs-on: ubuntu-latest
|
10 |
+
steps:
|
11 |
+
- uses: actions/checkout@v3
|
12 |
+
with:
|
13 |
+
fetch-depth: 0
|
14 |
+
lfs: true
|
15 |
+
- name: Push to hub
|
16 |
+
env:
|
17 |
+
HF_TOKEN: ${{ secrets.HF_TOKEN }}
|
18 |
+
run: git push https://Rom89823974978:[email protected]/spaces/Rom89823974978/MDA main
|
README.md
CHANGED
@@ -1,9 +1,32 @@
|
|
1 |
---
|
2 |
-
title: EU Explorer (MDA Assignment)
|
3 |
-
emoji: 🤖
|
4 |
-
colorFrom: purple
|
5 |
-
colorTo: indigo
|
6 |
-
sdk: docker
|
7 |
-
app_port: 4444
|
8 |
-
pinned: false
|
9 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
title: EU Explorer (MDA Assignment)
|
3 |
+
emoji: 🤖
|
4 |
+
colorFrom: purple
|
5 |
+
colorTo: indigo
|
6 |
+
sdk: docker
|
7 |
+
app_port: 4444
|
8 |
+
pinned: false
|
9 |
---
|
10 |
+
|
11 |
+
# Interactive Retrieval-Augmented Generation for Semantic Exploration of Horizon Europe Research Data
|
12 |
+
|
13 |
+
**A Cloud-Native Web Application for Multilingual Question Answering and Research Trend Analysis**
|
14 |
+
|
15 |
+
This project presents a scalable, cloud-native system that leverages Retrieval-Augmented Generation (RAG) to provide semantic access to the Horizon Europe research project database (CORDIS). Combining dense and sparse retrieval methods with advanced multilingual language models, the system enables users to ask natural language questions and receive document-grounded answers, complete with citations.
|
16 |
+
|
17 |
+
The backend, built using FastAPI and integrated with tools like FAISS, Whoosh, and LangChain, supports both semantic and keyword search, hybrid retrieval, and re-ranking. A user-facing web application and chatbot interface make the system interactive and intuitive, allowing researchers, policymakers, and the public to explore EU-funded research projects in an intelligent, multilingual, and conversational manner.
|
18 |
+
|
19 |
+
## Table of Contents
|
20 |
+
|
21 |
+
- [Overview](#overview)
|
22 |
+
- [Dataset: Horizon Europe Projects](#dataset-horizon-europe-projects)
|
23 |
+
- [Features](#features)
|
24 |
+
- [System Architecture](#system-architecture)
|
25 |
+
- [Technologies Used](#technologies-used)
|
26 |
+
- [Installation](#installation)
|
27 |
+
- [Usage](#usage)
|
28 |
+
- [Web Application](#web-application)
|
29 |
+
- [API Endpoints](#api-endpoints)
|
30 |
+
- [Retrieval-Augmented Generation Pipeline](#retrieval-augmented-generation-pipeline)
|
31 |
+
- [Limitations and Future Work](#limitations-and-future-work)
|
32 |
+
|