ScyllaProject (ScyllaProject)

luohy

updated a dataset 3 months ago

ScyllaProject/q7_ood_length_data

Viewer • Updated Mar 22 • 32k • 23

luohy

published a dataset 3 months ago

ScyllaProject/q7_ood_length_data

Viewer • Updated Mar 22 • 32k • 23

luohy

updated a dataset 3 months ago

ScyllaProject/q7_id_ood

Viewer • Updated Mar 14 • 62.2k • 30

luohy

published a dataset 3 months ago

ScyllaProject/q7_id_ood

Viewer • Updated Mar 14 • 62.2k • 30

luohy

updated a dataset 3 months ago

ScyllaProject/q7_id

Viewer • Updated Mar 10 • 31.5k • 32

luohy

published a dataset 3 months ago

ScyllaProject/q7_id

Viewer • Updated Mar 10 • 31.5k • 32

zhenting

authored a paper 8 months ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 14

luohy

authored a paper 8 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 151

Xulianghuang

authored a paper 8 months ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 14

luohy

authored a paper 8 months ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 14

zhenting

authored 5 papers 10 months ago

QTSumm: A New Benchmark for Query-Focused Table Summarization

Paper • 2305.14303 • Published May 23, 2023

FOLIO: Natural Language Reasoning with First-Order Logic

Paper • 2209.00840 • Published Sep 2, 2022

Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model

Paper • 2209.11477 • Published Sep 23, 2022

ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples

Paper • 2210.12374 • Published Oct 22, 2022

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 74

luohy

authored a paper 11 months ago

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7, 2024 • 10

luohy

authored a paper 12 months ago

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

Paper • 2406.12034 • Published Jun 17, 2024 • 16

swdanielli

authored 3 papers about 1 year ago

SUPERB: Speech processing Universal PERformance Benchmark

Paper • 2105.01051 • Published May 3, 2021 • 1

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

Paper • 2203.04911 • Published Mar 9, 2022

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings

Paper • 2204.10298 • Published Apr 21, 2022 • 1

ScyllaProject

AI & ML interests

ScyllaProject's activity

ScyllaProject/q7_ood_length_data

ScyllaProject/q7_ood_length_data

ScyllaProject/q7_id_ood

ScyllaProject/q7_id_ood

ScyllaProject/q7_id

ScyllaProject/q7_id

Quantifying Generalization Complexity for Large Language Models

Addition is All You Need for Energy-efficient Language Models

Quantifying Generalization Complexity for Large Language Models

Quantifying Generalization Complexity for Large Language Models

QTSumm: A New Benchmark for Query-Focused Table Summarization

FOLIO: Natural Language Reasoning with First-Order Logic

Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model

ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Training Task Experts through Retrieval Based Distillation

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

SUPERB: Speech processing Universal PERformance Benchmark

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings

AI & ML interests

Team members 5

ScyllaProject's activity