Jump to content

Indic-TechCom/Tools/IndicOCR

From Meta, a Wikimedia project coordination wiki

Main page

Tools

Requests

Management

Indic Wikimedia Portal

Talk

Demo of https://indic-ocr.toolforge.org

Indic-OCR is a tool for Indic community to OCR the Images on Wikisource. The tool URL is https://indic-ocr.toolforge.org/

What it do[edit]

It converts Image to text for Wikisource.

Beneficial Wikisources[edit]

Currently Google OCR is not working for 4 Indic languages, which is following

  1. Malayalam Wikisource
  2. Oriya Wikisource
  3. Gujrati Wikisource
  4. Tamil Wikisource


Demo of IndicOCR userscirpt




Onwiki[edit]

You can OCR hand to hand on wiki page. See demo for that.

Installation

Add the following code to your local wiki common.js page.

mw.loader.load('//meta.wikimedia.org/w/index.php?title=User:Indic-TechCom/Script/IndicOCR.js&action=raw&ctype=text/javascript');

If you want to add extra button in Visual Editor then add the following code also to your local wiki common.js page.

mw.loader.load('//meta.wikimedia.org/w/index.php?title=User:Indic-TechCom/Script/OCR4VE.js&action=raw&ctype=text/javascript');