Spaces:
Running
A newer version of the Gradio SDK is available:
5.22.0
title: Manga Translator
short_description: Translate manga from Japanese to English
tags:
- manga
- translate
- manga panel
emoji: π
colorFrom: pink
colorTo: yellow
sdk: gradio
pinned: true
app_file: app.py
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
Manga Translator
Introduction
I love reading manga, and I can't wait for the next chapter of my favorite manga to be released. However, the newest chapters are usually in Japanese, and they are translated to English after some time. I want to read the newest chapters as soon as possible, so I decided to build a manga translator that can translate Japanese manga to English.
GitHub Project
The GitHub repository for this project can be found here.
Approach
I want to translate the text in the manga images from Japanese to English. I will first need to know where these speech bubbles are on the image. For this I will use Yolov8
to detect the speech bubbles. Once I have the speech bubbles, I will use manga-ocr
to extract the text from the speech bubbles. Finally, I will use deep-translator
to translate the text from Japanese to English.
Data Collection
This dataset contains over 8500 images of manga pages together with their annotations from Roboflow. I will use this dataset to train Yolov8
to detect the speech bubbles in the manga images. To use this dataset with Yolov8, I will need to convert the annotations to the YOLO format, which is a text file containing the class label and the bounding box coordinates of the object in the image.
This dataset is over 1.7GB in size, so I will need to download it to my local machine. The rest of the code should be run after the dataset has been downloaded and extracted in this directory.
The dataset contains mostly English manga, but that is fine since I am only interested in the speech bubbles.
Yolov8
Yolov8
is a state-of-the-art, real-time object detection system that I've used in the past before. I will use Yolov8
to detect the speech bubbles in the manga images.
Manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga. This Python package is built and trained specifically for extracting text from manga images. This makes it perfect for extracting text from the speech bubbles in the manga images.
Deep-translator
Deep-translator
is a Python package that uses the Google Translate API to translate text from one language to another. I will use deep-translator
to translate the text extracted from the manga images from Japanese to English.