openlifescienceai (Open Life Science AI)

aaditya

updated a dataset 23 days ago

openlifescienceai/requests

Preview • Updated 23 days ago • 2.98k • 1

pminervini

authored 2 papers 2 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 54

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19 • 34

clefourrier

posted an update 2 months ago

Post

1005

Always surprised that so few people actually read the FineTasks blog, on
✨how to select training evals with the highest signal✨

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"👌
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks

2 replies

·

aryopg

authored a paper 4 months ago

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Paper • 2503.23415 • Published Mar 30 • 1

clefourrier

posted an update 4 months ago

Post

2531

Gemma3 family is out! Reading the tech report, and this section was really interesting to me from a methods/scientific fairness pov.

Instead of doing over-hyped comparisons, they clearly state that **results are reported in a setup which is advantageous to their models**.
(Which everybody does, but people usually don't say)

For a tech report, it makes a lot of sense to report model performance when used optimally!
On leaderboards on the other hand, comparison will be apples to apples, but in a potentially unoptimal way for a given model family (like some user interact sub-optimally with models)

Also contains a cool section (6) on training data memorization rate too! Important to see if your model will output the training data it has seen as such: always an issue for privacy/copyright/... but also very much for evaluation!

Because if your model knows its evals by heart, you're not testing for generalization.

aryopg

authored a paper 5 months ago

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Paper • 2502.05092 • Published Feb 7 • 8

pminervini

authored 6 papers 5 months ago

FLARE: Faithful Logic-Aided Reasoning and Exploration

Paper • 2410.11900 • Published Oct 14, 2024 • 4

SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

Paper • 2406.14425 • Published Jun 20, 2024 • 2

clefourrier

authored a paper 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 236

aaditya

posted an update 7 months ago

Post

4758

Last Week in Medical AI: Top Research Papers/Models 🔥
🏅 (December 15 – December 21, 2024)

Medical LLM & Other Models
- MedMax: Mixed-Modal Biomedical Assistant
- Advanced multimodal instruction tuning
- Enhanced biomedical knowledge integration
- Comprehensive assistant capabilities
- MGH Radiology Llama 70B
- Specialized radiology focus
- State-of-the-art performance
- Enhanced report generation capabilities
- HC-LLM: Historical Radiology Reports
- Context-aware report generation
- Historical data integration
- Improved accuracy in diagnostics

Frameworks & Methods
- ReflecTool: Reflection-Aware Clinical Agents
- Process-Supervised Clinical Notes
- Federated Learning with RAG
- Query Pipeline Optimization

Benchmarks & Evaluations
- Multi-OphthaLingua
- Multilingual ophthalmology benchmark
- Focus on LMICs healthcare
- Bias assessment framework
- ACE-M3 Evaluation Framework
- Multimodal medical model testing
- Comprehensive capability assessment
- Standardized evaluation metrics

LLM Applications
- Patient-Friendly Video Reports
- Medical Video QA Systems
- Gene Ontology Annotation
- Healthcare Recommendations

Special Focus: Medical Ethics & AI
- Clinical Trust Impact Study
- Mental Health AI Challenges
- Hospital Monitoring Ethics
- Radiology AI Integration

Now you can watch and listen to the latest Medical AI papers daily on our YouTube and Spotify channels as well!

- Full thread in detail:
https://x.com/OpenlifesciAI/status/1870504774162063760
- Youtube Link: youtu.be/SbFp4fnuxjo
- Spotify: https://t.co/QPmdrXuWP9

1 reply

·

aaditya

posted an update 7 months ago

Post

3481

Last Week in Medical AI: Top Research Papers/Models 🔥
🏅 (December 7 – December 14, 2024)

Medical LLM & Other Models
- PediaBench: Chinese Pediatric LLM
- Comprehensive pediatric dataset
- Advanced benchmarking platform
- Chinese healthcare innovation
- BiMediX: Bilingual Medical LLM
- Multilingual medical expertise
- Diverse medical knowledge integration
- Cross-cultural healthcare insights
- MMedPO: Vision-Language Medical LLM
- Clinical multimodal optimization
- Advanced medical image understanding
- Precision healthcare modeling

Frameworks and Methodologies
- TOP-Training: Medical Q&A Framework
- Hybrid RAG: Secure Medical Data Management
- Zero-Shot ATC Clinical Coding
- Chest X-Ray Diagnosis Architecture
- Medical Imaging AI Democratization

Benchmarks & Evaluations
- KorMedMCQA: Korean Healthcare Licensing Benchmark
- Large Language Model Medical Tasks
- Clinical T5 Model Performance Study
- Radiology Report Quality Assessment
- Genomic Analysis Benchmarking

Medical LLM Applications
- BRAD: Digital Biology Language Model
- TCM-FTP: Herbal Prescription Prediction
- LLaSA: Activity Analysis via Sensors
- Emergency Department Visit Predictions
- Neurodegenerative Disease AI Diagnosis
- Kidney Disease Explainable AI Model

Ethical AI & Privacy
- Privacy-Preserving LLM Mechanisms
- AI-Driven Digital Organism Modeling
- Biomedical Research Automation
- Multimodality in Medical Practice

Full thread in detail: https://x.com/OpenlifesciAI/status/1867999825721242101

4 replies

·

aaditya

posted an update 8 months ago

Post

2073

Last Week in Medical AI: Top Research Papers/Models 🔥
🏅 (December 2 – December 7, 2024)

Medical LLM & Models
- Block MedCare: Blockchain AI & IoT
- LLMs4Life: Biomedical Ontology Learning
- LLaMA II for Multimodal Diagnosis
- Compact LLM for EHR Privacy

Frameworks & Methods
- RARE: Retrieval-Augmented Reasoning
- STORM: Strategies for Rare Events
- TransFair: Fair Disease Classification
- PePR: Performance Per Resource
- Medical LLM Best Practices

LLM Applications
- Medchain: LLMs in Clinical Practice
- Query Nursing Note Summarization
- CLINICSUM: Patient Conversation Summaries
- Text Embeddings for Classifiers

LLM Benchmarks
- Polish Medical Exams Transfer
- Single-Cell Omics Annotation
- LLMs in Precision Medicine
- Low-Resource Healthcare Challenges

Other Models
- LLM Chatbot Hallucinations
- Multi-stage Chest X-ray Diagnosis
- EchoONE: Echocardiography AI
- Radiology Report Grounding

Ethics & Fairness
- Privacy in Medical Imaging
- Demographic Fairness in AI

Datasets
- LLM Scientific Knowledge Extraction
- Biomedical Knowledge Review

Now you can watch and listen to the latest Medical AI papers daily on our YouTube and Spotify channels as well!

- Full thread in detail: https://x.com/OpenlifesciAI/status/1865584829057929303
- Youtube Link: https://youtu.be/SwawtIFy-BI
- Spotify: https://open.spotify.com/episode/17Cxk0NLfKhiRWPoykzjem

clefourrier

authored a paper 8 months ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 19

aaditya

posted an update 8 months ago

Post

3381

Last Week in Medical AI: Top Research Papers/Models 🔥
(November 2 -November 9, 2024)

🏅 Medical AI Paper of the Week:
Exploring Large Language Models for Specialist-level Oncology Care

Medical LLM & Other Models:
- GSCo: Generalist-Specialist AI Collaboration
- PediatricsGPT: Chinese Pediatric Assistant
- MEG: Knowledge-Enhanced Medical QA
- AutoProteinEngine: Multimodal Protein LLM

Frameworks and Methodologies:
- BrainSegFounder: 3D Neuroimage Analysis
- PASSION: Sub-Saharan Dermatology Dataset
- SAM for Lung X-ray Segmentation
- Label Critic: Data-First Approach
- Medprompt Runtime Strategies

Medical LLM Applications:
- CataractBot: Patient Support System
- CheX-GPT: X-ray Report Enhancement
- CardioAI: Cancer Cardiotoxicity Monitor
- HealthQ: Healthcare Conversation Chain
- PRObot: Diabetic Retinopathy Assistant

Medical LLMs & Benchmarks:
- MediQ: Clinical Reasoning Benchmark
- Touchstone: Segmentation Evaluation
- Medical LLM Adaptation Progress
- Fine-Tuning Medical QA Strategies

AI in Healthcare Ethics:
- Healthcare Robotics with LLMs
- XAI in Clinical Practice
- Precision Rehabilitation Framework
- Multimodal AI Challenges

Now you can watch and listen to the latest Medical AI papers daily on our YouTube and Spotify channels as well!

- Full Thread: https://x.com/OpenlifesciAI/status/1855207141302473090
- YouTube: https://youtu.be/ad0uTnYuTo8
- Spotify: https://open.spotify.com/episode/6s39t1UJZk1i10szuXP2qN

aaditya

posted an update 9 months ago

Post

2834

Last Week in Medical AI: Top Research Papers/Models 🔥
🏅 (October 26 - November 2, 2024)

🏅 Medical AI Paper of the Week:
Google Presents MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making

Medical LLM & Other Models:
- Matchmaker: Schema Matching with LLMs
- UltraMedical: Specialized Biomedical Models
- ZALM3: Vision-Language Medical Dialogue
- EchoFM: Echocardiogram Foundation Model

Frameworks and Methodologies:
- FEDKIM: Federated Medical Knowledge Injection
- Flex-MoE: Flexible Modality Combination
- MAISI: Synthetic Medical Imaging
- Cough-E: Edge Privacy Detection
- MassSpecGym: Molecule Identification

Medical LLM Applications:
- DiaMond: Multi-Modal Dementia Diagnosis
- LLM-Forest: Health Data Imputation
- Medical Multimodal Visual Grounding
- Clinical Evidence Synthesis with LLMs

Medical LLMs & Benchmarks:
- Histopathology Models Beyond H&E
- LLMs in Mental Health Counseling
- Medical Dataset Reuse Analysis

AI in Healthcare Ethics:
- LLMs in Medical Education
- Medical Exam Question Generation
- Clinical Knowledge Graph Integration

Now you can watch and listen to the latest Medical AI papers daily on our YouTube and Spotify channels as well!

- Full Thread: https://x.com/OpenlifesciAI/status/1852685220912464066
- YouTube: https://youtu.be/3O3xjaMCXHI
- Spotify: https://open.spotify.com/episode/05trbTbtVZcfI7ycA5Z3Tt?si=706b74626f714aa1

Open Life Science AI

AI & ML interests

Recent Activity

openlifescienceai/requests

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Neurosymbolic Diffusion Models

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

FLARE: Faithful Logic-Aided Reasoning and Exploration

SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Mixtures of In-Context Learners

Aligning Generalisation Between Humans and Machines

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

AI & ML interests

Recent Activity

Team members 5

openlifescienceai's activity