-
-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
History Pack 1 Black Label Data Entry #118
Comments
Here's the breakdown of work I've been thinking of - is this how you're thinking about it, too?
|
I think you covered it all, yup! |
It's a lot! 😅 |
It is 😅 I also will need to update the SQL server script too, as I think about it. So, will take a few days, but should hopefully have the repo ready soon! Will give us time to recruit volunteers. |
Started setting stuff up over on this branch. I think the remaining things needing done are adding .ods and .csv files for the files you have listed, and finishing updating the scripts. I'll work more on the scripts either tonight or tomorrow, but if you have time I would appreciate help setting up the rest of the files! |
I'm curious if this is something you can partly automate using OCR and telling it the language? All the text sits within certain coordinates, and if the image sizes are identical to the rest (as most are), this should be -relatively- straight forward to transcribe automatically, with some gaps? I mention this because before I came across this repo, I was working on exactly that as the LSS data is just too erroneous (as we all know - lol) |
I have not messed around with OCR enough to want to dedicate time to making an OCR script for the repo right now - for now I will leave the data to be hand entered and double checked, but if you have an OCR script you'd like to use or add to the repo to input the data, I have no issues with that being an available resource for people! |
Yeah that's fair. It's pretty easy to scan images using something like tesseract, but I hear you. The best it could do would be to do the initial population of data, and based on text coordinates it would know whether it's a title, copyright info, card text.etc. But would still need input for things like card stats, such as pitch/attack.etc (although this can certainly be done using machine learning tools, but then that would start to cost money, so...). |
Honestly the text is the hardest part for me personally (at least if you're counting the amount of text bugfixes in the repo history....), so something to get the initial text in would be pretty rad in the long term. |
It's something I began working on for FaB DB. I'll see if it'll work if we set the language and share :) |
Hi ! I can look at the OCR script, but for me, the only things needed are Name and "Inner text". Every stats can be found with initial ref of the card ( EN : 1HP204 : https://storage.googleapis.com/fabmaster/media/images/1HP204.width-450.png - FR : https://storage.googleapis.com/fabmaster/cardfaces/2022-1HP/FR/FR_1HP204.png). I'll try to do something when I'll have an evening to spare ! :D |
that's a really good point about the card stats already being in place! Ez mode then! haha The biggest challenge will be the icons in the card text. |
Hello, my name is Carlos Gutiérrez and I would love to help with the Spanish translations. |
Hi, my name is Tim, if i can help with the translations in any way let me know. |
Hey there! |
Hello everyone, |
Hey. there everyone, thank you so much for all of the offers! I'm still recovering from Outsiders spoiler season, but after I get some rest I'll finish up getting the repo ready for you all to help out 😄 |
This issue will serve as the staging ground for the History Pack 1 Black Label data entry effort.
Please leave a comment here if you have interest in assisting with transcribing data, and which languages you are interested in transcribing for. When I have the repo ready for this effort, I will update this comment with more instructions and contact those who are interested!
The text was updated successfully, but these errors were encountered: