| Subject | Wikidata (Wikimedia Foundation) |
| Purpose | Data structure evaluation for WikiDeal |
| Author | ThΓ©o Bondolfi / Ynternet.org |
| Status | R&D Β· Open for review |
| Related | Wikimedia References |
Wikidata Analysis β Could WikiDeal Use Wikidata?
Wiki Core Β· AnalysisWikidata is the free, open, collaborative knowledge base maintained by the Wikimedia Foundation. It serves as the structured data backbone for Wikipedia and other Wikimedia projects. This page analyses whether WikiDeal could use or integrate with Wikidata for its own data structures β contracts, user groups, service categories, and more. WM-01
1. What is Wikidata?
Wikidata (wikidata.org) is a free, open, secondary database containing structured data β entities, properties, and statements β that can be queried by anyone. Key facts:
- Maintained by the Wikimedia Foundation since 2012.
- Contains over 100 million items as of 2025.
- Multilingual by design β each item has labels in dozens of languages.
- Queryable via SPARQL (Wikidata Query Service).
- All data released under CC0 (public domain).
- API available for reading and writing data.
2. Advantages of Using Wikidata
- β Linked data: Wikidata entities are interlinked with Wikipedia, OpenStreetMap, and hundreds of other datasets β enabling rich, contextual data for WikiDeal service categories, organizations, and locations.
- β Multilingual: All data is natively multilingual. WikiDeal operates in FR, EN, DE, IT, ES β Wikidata supports all of these.
- β Existing Infrastructure: No need to build and maintain a knowledge base from scratch. Wikidata already has millions of items relevant to WikiDeal (organizations, cities, service types, legal frameworks).
- β Community maintained: Thousands of volunteers continuously update and verify Wikidata. WikiDeal benefits from this shared maintenance.
- β CC0 licence: Public domain data can be integrated into WikiDeal without licence conflicts (though WikiDeal's own data would remain AGPL v3).
- β SPARQL queries: Complex data relationships (e.g., "all NGOs active in Geneva, categorized by theme") can be queried directly.
- β Alignment with mission: Using Wikidata is consistent with WikiDeal's open-knowledge philosophy and Wikimedia references (WM-01, WM-09, WM-10).
3. Disadvantages and Risks
- β οΈ Governance complexity: Wikidata has its own governance model, policies, and community norms. WikiDeal cannot unilaterally control or modify Wikidata content β changes must go through Wikidata's community process.
- β οΈ Dependency risk: If WikiDeal relies heavily on Wikidata, any change to Wikidata's API, data model, or availability could disrupt WikiDeal's operations.
- β οΈ Data sovereignty concerns: Personal data (user profiles, contract terms, sensitive community information) cannot be stored in Wikidata β it is a public database. WikiDeal must maintain its own private data layer.
- β οΈ Vandalism risk: Wikidata items can be edited by anyone. Critical WikiDeal reference data could be vandalized or incorrectly modified.
- β οΈ Latency and availability: Real-time applications (e.g., live session tracking for street fundraising) cannot depend on an external API with variable latency.
- β οΈ Schema mismatch: Wikidata's general-purpose schema may not fit WikiDeal's specific contract and marketplace data structures.
- β οΈ Write limitations: Creating new Wikidata items for every WikiDeal service or contract is impractical and contrary to Wikidata's notability guidelines.
4. Comparison Table
| Criterion | Wikidata | WikiDeal own DB |
|---|---|---|
| Multilingual | β Native, 200+ languages | Manual (custom implementation needed) |
| Linked to Wikipedia | β Direct Q-item links | Via API integration only |
| Data sovereignty | β οΈ Public β no private data possible | β Full control |
| Community maintenance | β 25,000+ active contributors | WikiDeal community only |
| SPARQL queries | β Built-in query service | Requires custom query layer |
| Real-time performance | β οΈ External dependency | β Internal, optimizable |
| Contract data structures | β οΈ Not designed for this | β Custom-built |
| Governance control | β οΈ Wikimedia community rules | β WikiDeal community |
| Licence | β CC0 (public domain) | AGPL v3 |
5. Proposed Conclusion
Hybrid approach proposed:
- Wikidata for reference data: Use Wikidata Q-items to identify organizations (NGOs, associations), geographic entities (cities, regions), service categories, and legal frameworks. Link WikiDeal items to Wikidata IDs where available.
- WikiDeal own database for operational data: All contract data, user profiles, Credits (WDC), session logs, and community governance data remain in WikiDeal's own AGPL v3 database.
- Wikidata contribution: Where WikiDeal creates new knowledge (e.g., new service categories, new contract types), contribute back to Wikidata following their notability and community guidelines.
This approach maximizes the benefits of linked data and multilingual support while maintaining data sovereignty and real-time performance. It is consistent with WikiDeal's libre licensed philosophy without creating undue external dependency.