Discussion:
Indic Wikisource Update December 2016
(too old to reply)
Jayanta Nath
2016-12-06 17:56:35 UTC
Permalink
Raw Message
Hello all,

We've just published the November 2016 Indic Wikisource statistics. After
implementing Google OCR script to our all Indic Wikisource , they are
growing rapidly.

Here is the few stats ans their top three rank...

As per Number of article
1. Sanskrit Wikisource ( 15692 pages) - supported by 0.05% scan pages.
2. Telugu Wikisource ( 11752 pages) - supported by 24.3% scan pages.
3. Kannada Wikisource ( 8414 pages) - supported by 0.99% scan pages.


As per Number of page Validation

1. Telugu Wikisource ( 18192 pages)
2. Tamil Wikisource (5179 pages)
3. Gujarati Wikisource ( 4064 pages)


As per Number of page Proofread

1. Telugu Wikisource ( 22565 pages)
2. Malayalam Wikisource ( 8065 pages)
3. Tamil Wikisource ( 7876 pages)

As per percentage supported by scan pages.
1. Bengali Wikisource (27.16%)
2. Telugu Wikisource ( 24.56%)
3. Gujarati Wikisource (17.51%)

I want to specially mention that there are no visible improvement on
proofreading works and page content supported by scan at Marathi,Assamese,
Sanskrit, Odia, and Kannada Wikisource.


Full Indic Wikisource stats here
https://wikisource.org/wiki/Wikisource:Indic_Wikisource_Stats

Regards,
Jayanta Nath
Indic Wikisource Community
mathieu stumpf guntz
2016-12-07 14:00:18 UTC
Permalink
Raw Message
Thank you for this report Jayanta, that's interesting. Would you expect
more impact Marathi, Assamese, Sanskrit, Odia, and Kannada following the
OCR avaibility? I mean, to my mind Sanskrit is somewhat like Latin in
occidental countries, so I'm wouldn't expect a very large frenzy when a
new handy tool come to help with transcription. For other languages, I
don't know their vernacular popularity, I'll check their Wikipedia
article later. Would you expect more possible contributors for them?
Post by Jayanta Nath
Hello all,
We've just published the November 2016 Indic Wikisource statistics.
After implementing Google OCR script to our all Indic Wikisource ,
they are growing rapidly.
Here is the few stats ans their top three rank...
As per Number of article
1. Sanskrit Wikisource ( 15692 pages) - supported by 0.05% scan pages.
2. Telugu Wikisource ( 11752 pages) - supported by 24.3% scan pages.
3. Kannada Wikisource ( 8414 pages) - supported by 0.99% scan pages.
As per Number of page Validation
1. Telugu Wikisource ( 18192 pages)
2. Tamil Wikisource (5179 pages)
3. Gujarati Wikisource ( 4064 pages)
As per Number of page Proofread
1. Telugu Wikisource ( 22565 pages)
2. Malayalam Wikisource ( 8065 pages)
3. Tamil Wikisource ( 7876 pages)
As per percentage supported by scan pages.
1. Bengali Wikisource (27.16%)
2. Telugu Wikisource ( 24.56%)
3. Gujarati Wikisource (17.51%)
I want to specially mention that there are no visible improvement on
proofreading works and page content supported by scan at
Marathi,Assamese, Sanskrit, Odia, and Kannada Wikisource.
Full Indic Wikisource stats here
https://wikisource.org/wiki/Wikisource:Indic_Wikisource_Stats
<https://wikisource.org/wiki/Wikisource:Indic_Wikisource_Stats>
Regards,
Jayanta Nath
Indic Wikisource Community
_______________________________________________
Wikisource-l mailing list
https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Loading...