Ā·
AI & ML interests
artificial intelligence
Recent Activity
reacted
to
ajibawa-2023's
post with š 9 days ago PHP-Code-Large
Dataset: https://huggingface.co/datasets/ajibawa-2023/PHP-Code-Large
PHP-Code-Large is a large-scale corpus of PHP source code comprising more than 12 million lines of PHP code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the PHP ecosystem.
By providing a high-volume, language-specific corpus, PHP-Code-Large enables systematic experimentation in PHP-focused model training, domain adaptation, and downstream code understanding tasks.
PHP-Code-Large addresses the need for a dedicated PHP-only dataset at substantial scale, enabling focused research across backend systems, CMS platforms, APIs, and full-stack PHP environments. View all activity
Organizations
None yet