CLDF dataset derived from Mamta's "South Asian Numerals Database" from 2024

How to cite

If you use these data please cite

the original source

Mamta, K. (2024): South Asian Numerals Database (SAND). Leipzig: Max Planck Institute for Evolutionary Anthropology.
the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://github.com/numeralbank/sand

Statistics

Varieties: 131 (linked to 129 different Glottocodes)
Concepts: 130 (linked to 119 different Concepticon concept sets)
Lexemes: 15,364
Sources: 9
Synonymy: 1.02
Invalid lexemes: 0
Tokens: 140,583
Segments: 127 (0 BIPA errors, 0 CLTS sound class errors, 127 CLTS modified)
Inventory size (avg): 27.77

Possible Improvements:

Languages linked to bookkeeping languoids in Glottolog:
- Phangduwali phan1256

Contributors

Name	GitHub user	Description	Role
Mamta Kumari	@Mamta-Kum	Data Collection	Author
Johann-Mattis List	@LinguList	Prepared initial version of the CLDF data	Other
Christoph Rzymski	@chrzyki	Maintainer	Other

CLDF Datasets

The following CLDF datasets are available in cldf:

CLDF Wordlist at cldf/cldf-metadata.json

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.github/workflows		.github/workflows
cldf		cldf
etc		etc
raw		raw
.gitignore		.gitignore
.zenodo.json		.zenodo.json
CONTRIBUTORS.md		CONTRIBUTORS.md
FORMS.md		FORMS.md
LICENSE		LICENSE
README.md		README.md
TRANSCRIPTION.md		TRANSCRIPTION.md
languages.geojson		languages.geojson
lexibank_sand.py		lexibank_sand.py
metadata.json		metadata.json
setup.cfg		setup.cfg
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

CLDF dataset derived from Mamta's "South Asian Numerals Database" from 2024

How to cite

Description

Statistics

Possible Improvements:

Contributors

CLDF Datasets

About

Uh oh!

Releases 2

Packages

Contributors 4

Uh oh!

Languages

Uh oh!

License

Uh oh!

numeralbank/sand

Folders and files

Latest commit

History

Repository files navigation

CLDF dataset derived from Mamta's "South Asian Numerals Database" from 2024

How to cite

Description

Statistics

Possible Improvements:

Contributors

CLDF Datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Uh oh!

Languages

Packages