Ivan Vykopal's picture

Ivan Vykopal

ivykopal

·

ivanvykopal

AI & ML interests

NLP, Computer Vision

Recent Activity

upvoted a collection 1 day ago

upvoted a paper 7 days ago

BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction

upvoted a paper 7 days ago

AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization

View all activity

Organizations

ivykopal's activity

upvoted a collection 1 day ago

Llama 4

Llama 4 release • 10 items • Updated 1 day ago • 350

upvoted 3 papers 7 days ago

BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction

Paper • 2503.19658 • Published 13 days ago • 2

AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization

Paper • 2503.22526 • Published 9 days ago • 2

TextBite: A Historical Czech Document Dataset for Logical Page Segmentation

Paper • 2503.16664 • Published 17 days ago • 2

upvoted a paper about 2 months ago

Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages

Paper • 2502.10140 • Published Feb 14 • 9

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

upvoted 2 collections 4 months ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Feb 13 • 84

EU20-Benchmarks

Evaluation Benchmarks for 20 European languages. • 5 items • Updated Oct 11, 2024 • 8

upvoted a collection 7 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 581

upvoted a paper 9 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 137

upvoted a paper 12 months ago

Pre-training Small Base LMs with Fewer Tokens

Paper • 2404.08634 • Published Apr 12, 2024 • 35

upvoted a paper about 1 year ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 78