This bipartite network denotes which languages are spoken in which countries. Nodes are countries and languages; edge weights denote the proportion (between zero and one) of the population of a given country speaking a given language. To quote the Unicode data description: "The main goal is to provide approximate figures for the literate, functional population for each language in each territory: that is, the population that is able to read and write each language, and is comfortable enough to use it with computers."
| Code | UL
| |
| Internal name | unicodelang
| |
| Name | Unicode languages | |
| Data source | http://www.unicode.org/cldr/charts/25/supplemental/territory_language_information.html | |
| Availability | Dataset is available for download | |
| Consistency check | Dataset passed all tests | |
| Category | Feature network | |
| Dataset timestamp | 2015 | |
| Node meaning | Country, language | |
| Edge meaning | Hosts | |
| Network format | Bipartite, undirected | |
| Edge type | Positive weights, no multiple edges | |
| Zero weights | Edges may have weight zero |
| [1] | Jérôme Kunegis. KONECT – The Koblenz Network Collection. In Proc. Int. Conf. on World Wide Web Companion, pages 1343–1350, 2013. [ http ] |