Proposal:A central repository of all language independent data
Every proposal should be tied to one of the strategic priorities below.
Edit this page to help identify the priorities related to this proposal!
- Achieve continued growth in readership
- Focus on quality content
- Increase Participation
- Stabilize and improve the infrastructure
- Encourage Innovation
This proposal is associated with the bolded strategic priorities below.
It has been suggested that this page be merged with Proposal:Alignment. (Discuss) |
It has been suggested that this page be merged with Proposal:Base de donnée interlangue. (Discuss) |
It has been suggested that this page be merged with Proposal:To create standard basic table template accross Wiki. (Discuss) |
It has been suggested that this page be merged with Proposal:Structured Data. (Discuss) |
It has been suggested that this page be merged with Proposal:A 'common knowledge' database - like 'Cyc'. (Discuss) |
It has been suggested that this page be merged with Proposal:Building a database of all books ever published. (Discuss) |
It has been suggested that this page be merged with Proposal:Templates.wikimedia.org. (Discuss) |
It has been suggested that this page be merged with Proposal:Unification. (Discuss) |
It has been suggested that this page be merged with Proposal:A central wiki for interlanguage links. (Discuss) |
- See also: Proposal:Data.wikimedia.org
Summary
A central repository that will contain all data that are language-independent.
Proposal
- Create a central repository that will contain all data that are language-independent, with a special emphasis about data that change more often. Examples: census data about towns and countries, statistics about population and other things, names of current presidents and mayors, results of elections, etc.
- Create some system to use these data on all wikis in an automatic way, so that when a new president is elected in a country, we simply change the name on the repository and all wikis will see it without having to change it manually.
Motivation
On small wikies in particular, it's really difficult to keep data updated, and very often these wikies show wrong or old informations. Big wikies can cope enough well with this issue, but anyway this proposal will save a lot of time and will guarantee a higher degree of accuracy, avoiding possible inconsistencies between different wikies and reducing delays in the updating of data.
This system will also provide an automatic conversion between different units of measurement (example: the data will be stored in kilometers but delivered in miles to wikies that prefer this unit).
Key Questions
If someone asked for an interlingua for machine translation on http://meta.wikimedia.org/wiki/Requests_for_new_languages , would they be likely to succeed? If not -- and assuming that there hasn't been such an approval that I don't know about -- doesn't that indicate the flawed use of language demand statistics?
Is this the same proposal as Proposal:Data-driven_content in this strategic planning process and should they be merged? And are both a revival of the older Wikidata proposal? --Travelplanner 22:39, 22 August 2009 (UTC)
Potential Costs
Technically I think it's feasible. The work of: 1) establishing the central repository, and 2) "distributing" data to all the wikies, can be automatized with a bot. Obviously it will not be very easy, given the huge amount of data that Wikipedia contains.
References
- http://en.wikipedia.org/wiki/Interlingual_machine_translation
- http://en.wikipedia.org/wiki/Lincos_(language) -- could this be adapted into a new microformat?
- Wikidata
Community Discussion
Do you have a thought about this proposal? A suggestion? Discuss this proposal by going to Proposal talk:A central repository of all language independent data.
Want to work on this proposal?
- .. Sign your name here!