You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Mahjong AI Project

This repository contains the code and resources for developing a Mahjong AI model, focusing on tabular classification of board states. The primary goal is to predict outcomes or optimal moves based on the current game state.

Project Structure

Notebooks (.ipynb):
- preprocess_*.ipynb: Scripts for processing raw Mahjong game data into features suitable for model training.
- transform_hf_dataset.ipynb: Script for transforming data into a Hugging Face dataset format.
- push_parquet_to_hub_by_year.ipynb: Script to upload processed data (likely in Parquet format) to the Hugging Face Hub.
- tenhou_prediction_deepLearning_basic.ipynb: This notebook shows the basic process of how the classification model was trained, using features derived from player hand information (columns 68-135) and discard pool information (columns 238-373). The model predicts the discarded tile (column 510).
- tenhou_prediction_multitask_hf.ipynb: This notebook explores a new, currently non-functional approach attempting multitask learning with dynamic inputs.
Python Scripts (.py):
- tools.py: Utility functions used across different notebooks.
Model & Config:
- config.json, model.safetensors: Configuration and saved model files associated with the trained Mahjong AI.
Data: The project utilizes the pjura/mahjong_board_states dataset from Hugging Face. The primary training notebook (tenhou_prediction_deepLearning_basic.ipynb) uses features derived from player hand information (columns 68-135) and discard pool information (columns 238-373), predicting the target value (discarded tile) found in column 510.

Getting Started

This model can be used in conjunction with the pjura/mahjong_vision model to create a fully automated Mahjong playing AI.

The model was trained using the pjura/mahjong_board_states dataset.

Downloads last month: 4

Safetensors

Model size

917k params

Tensor type

F32

Dataset used to train pjura/mahjong_ai

Evaluation results

Accuracy on pjura/mahjong_board_states
test set self-reported

0.661

View on Papers With Code