INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 14
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization Paper • 2508.04796 • Published Aug 6
From Citations to Criticality: Predicting Legal Decision Influence in the Multilingual Swiss Jurisprudence Paper • 2410.13460 • Published Oct 17, 2024
Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland Paper • 2410.13456 • Published Oct 17, 2024
Can Large Language Models Capture Human Annotator Disagreements? Paper • 2506.19467 • Published Jun 24 • 18
FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning Paper • 2404.02127 • Published Apr 2, 2024
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models Paper • 2308.11462 • Published Aug 20, 2023 • 3
LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text Paper • 2402.04335 • Published Feb 6, 2024
Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents Paper • 2309.08695 • Published Sep 15, 2023 • 1
SCALE: Scaling up the Complexity for Advanced Language Model Evaluation Paper • 2306.09237 • Published Jun 15, 2023
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch? Paper • 2211.17135 • Published Nov 30, 2022
MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset Paper • 2305.01211 • Published May 2, 2023