Preview

Vavilov Journal of Genetics and Breeding

Advanced search

Reconstructing the genetic structure of the Kazakh from clan distribution data

https://doi.org/10.18699/VJ18.431

Abstract

Applying quasigenetic markers - non-biological traits which are nevertheless inherited in generations - is one of the research fields within human population genetics. For the West European, East European, and Caucasus populations, surnames are typical quasigenetic markers. For Central Asian populations, particularly Kazakh, the clan affiliation serves as a good marker: a set of papers demonstrated that many clans include mainly persons which biologically descent from a recent common ancestor. In this study, we analyzed a large (~4.2 million persons) dataset on quasigenetic markers - the geographic distribution of 50 Kazakh clans at the beginning of the 20th century, and compared the dataset with the direct data of the Y-chro-mosomal diversity in modern Kazakh populations. The analysis included three steps: the isonymy method, which is standard for quasigenetic markers, comparing frequencies of quasigenetic markers, and comparing the quasigenetic and genetic datasets. We constructed 50 maps of frequency of the distribution of each clan and revealed that these maps correlate with the maps of genetic distances. The Mantel test also demonstrated a significant correlation between geographic and quasigenetic distances (г = 0.60; p < 0.05). The analysis of inter-population variability revealed the largest diversity between geographic territories corresponding to the social-territorial groups of the Kazakh Khanate (zhuzes) rather than to other historical groups that existed on the territory of Kazakhstan in preceding and modern epochs. The same is evidenced by the principal components and multidimensional scaling plots, which grouped geographic populations into three clusters corresponding to three zhuzes. This indicates that the final structuring of the Kazakh gene pool might have occurred during the Kazakh Khanate period.

About the Authors

M. K. Zhabagin
National Center for Biotechnology; National Laboratory Astana, Nazarbayev University
Kazakhstan
Astana


О. Е. Balanovsky
Vavilov Institute of General Genetics, RAS; Research Centre for Medical Genetics; Biobank of North Eurasia
Russian Federation

Moscow



Zh. M. Sabitov
L.N. Gumilyov Eurasian National University
Kazakhstan
Astana


A. Z. Temirgaliyev

Russian Federation


A. T. Agdzhoyan
Vavilov Institute of General Genetics, RAS; Research Centre for Medical Genetics
Russian Federation

Moscow



S. M. Koshel
Lomonosov Moscow State University
Russian Federation


Е. М. Ramankulov
National Center for Biotechnology
Kazakhstan
Astana


E. V. Balanovska
Research Centre for Medical Genetics; Biobank of North Eurasia
Russian Federation

Moscow



References

1. Amanzholov S.A. The Questions of Dialectology and the History of the Kazakh Language. Almaty, 1959. (in Russian)

2. Balanovska E.V., Balanovsky O.P. The Russian Gene Pool on the East European Plain. Moscow: Luch Publ., 2007. (in Russian)

3. Balanovskaya E.V., Balanovskii O.P., Ginter E.K., Pocheshkhova E.A. Gene-geographic analysis of a subdivided population. II. Geography of random inbreeding based on surname frequencies in Adygs. Rus. J. Genetics. 2000;36(8):936-948.

4. Balanovska E.V., Romanov A.G., Balanovsky O.P. Namesakes or relatives? Approaches to investigating the relationship between Y-chromosomal haplogroups and surnames. Molekulyarnaya Biologiya = Molecular Biology (Moscow). 2011;45(3):430-441. DOI 10.1134/S0026893311030022.

5. Balanovska E.V., Solovyeva D.S., Balanovsky O.P., Churno-sov M.I., Sorokina I.N., Evseeva I.V., Abolmasov N.N., Pochesh-kova E.A., Sereyogin Y.A., Pshenichnov A.S. “Family portraits” of five regions of Russia. Meditsinskaya Genetika = Medical Genetics. 2005;1:2-10. (in Russian)

6. Balanovsky O.P., Buzhilova A.P., Balanovskaya E.V. The Russian gene pool: gene geography of surnames. Rus. J. Genet. 2001;73(7): 807-822. DOI 10.1023/A:1016755111586.

7. Bochkov N.P., Zakharov A.F., Ivanov A.I. Medical Genetics. Moscow: Meditsina Publ., 1984. (in Russian)

8. Vos-trov V.B., Mukanov M.S. The Tribal Structure and Settlement of the Kazakhs (late XIX - early XX). Alma-Ata, 1968. (in Russian)]

9. El’chi-nova G.I., Terekhovskaya I.G., Osetrova A.A., Poryadina O.A., Zinchenko R.A. Surname distribution and random inbreeding in Kirov oblast. Rus. J. Genet. 2009;45(10):1247-1255. DOI 10.1134/S1022795409100135.

10. Zhabagin M.K., Dibirova Kh.D., Frolova S.A., Sabitov Z.M., Yusu-pov Y.M., Utevskaya O.M., Tarlykov P.V., Tazhigulova I.M., Bala-ganskaya O.A., Nimadava P., Zakharov I.A., Balanovsky O.P. The relation between Y chromosome variation and the clan structure: the gene pool of the steppe aristocracy and the steppe clergy of the Kazakhs. Vestnik Moskovskogo Universiteta. Seria XXIII. Antro-pologia = Vestnik of Moscow University. Ser. XXIII. Antropology. 2014;1:96-101. (in Russian)

11. Zhabagin M.K., Sabitov Zh.M., Agdzhoyan A.A., Yusupov Y.M., Bogunov Y.V., Lavryashina M.B., Tazhigulova I.M., Akil’zhanova A.R., Zhumadilov Zh. Sh., Balanovsky O.P., Balanov-ska E.V. Genesis of Argyns, the largest tribal-clan group of Kazakhs, in the context of population genetics. Vestnik Moskovskogo Universiteta. Seria XXIII. Antropologia = Bulletin of Moscow University. Ser. XXIII. Antropology. 2016;4:59-68. (in Russian)

12. History of the Peoples of Uzbekistan. Tashkent, 1947. (in Russian)

13. Klyashtornyy S.G., Sultanov T.I. Kazakhstan: Record of Three Millennia. Almaty, 1992. (in Russian)

14. Kucher A.N., Danilova A.L., Koneva L.A., Nogovitsina A.N. Marriage structure of Yakut populations: ethnic composition and isonymy inbreeding. Russ. J. Genet. 2010;46(3):408-416. DOI 10.1134/S1022795410030142.

15. Lavryashina M.B., Ul’yanova M.V., Tolochko T.A., Balaganskaya O.A., Romanov A.G., Balanovska E.V. The Shors: similarities and differences among territorial groups according to the surname range and autosomal DNA markers. Vest-nik Moskovskogo Universiteta. Seria XXIII. Antropologia = Bulletin of Moscow University. Ser. XXIII. Antropology. 2011;2:66-77. (in Russian)

16. Pocheshkhova E.A., Balanovska E.V., Seregin Yu.A., Golubtsov V.I., Balanovsky O.P. Temporal dynamics of gene pool reconstructed from genealogical and surname data. Meditsinskaya Gene-tika = Medical Genetics. 2008;7(8):25-29. (in Russian)

17. Revazov A.A., Paradeeva G.M., Rusako-va G.I. Possibility of using Russian family names as a quasi-genetic marker. Genetika. 1986;22(4):699-703. (in Russian)

18. Sabitov Zh.M. Kazakh zhuzes and the Golden Horde clan system. Vestnik Evraziyskogo Natsionalnogo Universiteta im. L.N. Gumileva = Bulletin of L.N. Gumilyov Eurasian National University. 2014;3(100):201-207. (in Russian)

19. Sorokina I.N., Balanovska E.V., Churnosov M.I. The gene pool of the Belgorod oblast population. I. Differentiation of all district populations based on anthroponymic data. Russ. J. Genet. 2007; 43(6):697-704. DOI 10.1134/S1022795407060142.

20. Tasilova N. “Materials on Kyrgyz (Kazakh) Land Use ...“ - as a Source on the History of Kazakhstan (late XIX century - early XX century). Almaty, 2017. (in Russian)

21. Temirgaliev A. Volosts, uyezds... Kazakhs: with a Schematic Map of Lower Administrative and Territorial Divisions of the Kazakhs’ Residence in 1897-1915. Almaty, 2010. (in Russian)

22. Ul’yanova M.V., Lavryashina M.B., Nikolaev V.V., Oktyabr’skaya I.V., Druzhinin V.G. Native populations of the northern Altai: demographic processes of the late 19th - early 21st century as reflected in surname dynamics. Arkheologiya, etnografiya i antropologiya Evrazii = Archaeology, Ethnology, and Anthropology of Eurasia. 2014;42(3): 128-140. DOI 10.1016/j.aeae.2015.04.015. (in Russian)

23. Abilev S., Malyarchuk B., Derenko M., Wozniak M., Grzybowski T., Zakharov I. The Y-chromosome C3* star-cluster attributed to Genghis Khan’s descendants is present at high frequency in the Kerey clan from Kazakhstan. Hum. Biol. 2012;84(1):79-89. DOI 10.3378/027.084.0106.

24. Balanovsky O., Dibirova K., Dybo A., Mudrak O., Frolova S., Pocheshkhova E., Haber M., Platt D., Schurr T., Haak W., Kuznetsova M., Radzhabov M., Balaganskaya O., Romanov A., Zakharova T., Soria Hernanz D.F., Zalloua P., Koshel S., Ruhlen M., Renfrew C., Wells R.S., Tyler-Smith C., Balanovska E., Genographic C. Parallel evolution of genes and languages in the Caucasus region. Mol. Biol. Evol. 2011;28(10):2905-2920. DOI 10.1093/molbev/msr126.

25. Balanovsky O., Rootsi S., Pshenichnov A., Kivisild T., Churnosov M., Evseeva I., Pocheshkhova E., Boldyreva M., Yankovsky N., Balanovska E., Villems R. Two sources of the Russian patrilineal heritage in their Eurasian context. Am. J. Hum. Genet. 2008;82(1):236-250. DOI 10.1016/j.ajhg.2007.09.019.

26. Balanovsky O., Zhabagin M., Agdzhoyan A., Chukhryaeva M., Zapo-rozhchenko V., Utevska O., Highnam G., Sabitov Z., Greenspan E., Dibirova K., Skhalyakho R., Kuznetsova M., Koshel S., Yusupov Y., Nymadawa P., Zhumadilov Z., Pocheshkhova E., Haber M., Zalloua P.A., Yepiskoposyan L., Dybo A., Tyler-Smith C., Balanovska E. Deep phylogenetic analysis of haplogroup G1 provides estimates of SNP and STR mutation rates on the human Y-chromosome and reveals migrations of Iranic speakers. PLoS One. 2015;10(4): e0122968. DOI 10.1371/journal.pone.0122968.

27. Balaresque P., Poulet N., Cussat-Blanc S., Gerard P., Quintana-Mur-ci L., Heyer E., Jobling M.A. Y-chromosome descent clusters and male differential reproductive success: young lineage expansions dominate Asian pastoral nomadic populations. Eur. J. Hum. Genet. 2015;23(10):1413-1422. DOI 10.1038/ejhg.2014.285.

28. Barrai I., Barbujani G., Beretta M., Maestri I., Russo A., Formica G., Pinto-Cisternas J. Surnames in Ferrara: distribution, isonymy and levels of inbreeding. Ann. Hum. Biol. 1987;14(5):415-423.

29. Barrai I., Formica G., Barale R., Beretta M. Isonymy and migration distance. Ann. Hum. Genet. 1989;53(3):249-262.

30. Barrai I., Rodriguez-Larralde A., Dipierri J., Alfaro E., Acevedo N., Mamolini E., Sandri M., Carrieri A., Scapoli C. Surnames in Chile: a study of the population of Chile through isonymy. Am. J. Phys. Anthropol. 2012;147(3):380-388. DOI 10.1002/ajpa.22000.

31. Barrai I., Scapoli C., Beretta M., Nesti C., Mamolini E., Rodriguez-Larralde A. Isonymy and the genetic structure of Switzerland. I. The distributions of surnames. Ann. Hum. Biol. 1996;23(6):431-455.

32. Biro A.Z., Zalan A., Volgyi A., Pamjav H. A Y-chromosomal comparison of the Madjars (Kazakhstan) and the Magyars (Hungary). Am. J. Phys. Anthropol. 2009;139(3):305-310. DOI 10.1002/ajpa.20984.

33. Cavalli-Sforza L.L., Bodmer W.F. The genetics of human populations. San Francisco: W.H. Freeman and Co; 1971. XVI; 965.

34. Chaix R., Austerlitz F., Khegay T., Jacquesson S., Hammer M.F., Heyer E., Quintana-Murci L. The genetic or mythical ancestry of descent groups: Lessons from the Y chromosome. Am. J. Hum. Genet. 2004;75(6):1113-1116. DOI 10.1086/425938.

35. Crow J.F., Mange A.P. Measurement of inbreeding from the frequency of marriages between persons of the same surname. Eugen. Quart. 1965;12(4):199-203. DOI 10.1080/19485565.1965.9987630.

36. Dipierri J., Rodriguez-Larralde A., Alfaro E., Scapoli C., Mamolini E., Salvatorelli G., Caramori G., De Lorenzi S., Sandri M., Carrieri A., Barrai I. A study of the population of Paraguay through isonymy. Ann. Hum. Genet. 2011;75(6):678-687. DOI 10.1111/j.1469-1809.2011.00676.x.

37. Dipierri J.E., Rodriguez-Larralde A., Barrai I., Redomero E.G., Alonso-Rodriguez C., Alfaro E.L. Consanguinity by random isonymy and socioeconomic development in Argentina: a population study. J. Bio-soc. Sci. 2017;49(3):322-333. DOI 10.1017/S0021932016000444.

38. Excoffier L., Lischer H.E. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Res. 2010;10(3):564-567. DOI 10.1111/j.1755-0998.2010.02847.x. PubMed PMID: 21565059.

39. Fisher R.A. The relation between the number of species and the number of individuals in a random sample of animal population. J. Anim. Ecol. 1943;12(1):42-58. DOI 10.2307/1411.

40. Herrera Paz E.F., Scapoli C., Mamolini E., Sandri M., Carrieri A., Rodriguez-Larralde A., Barrai I. Surnames in Honduras: A study of the population of Honduras through isonymy. Ann. Hum. Genet. 2014;78(3):165-177. DOI 10.1111/ahg.12057.

41. Karlin S., McGregor J. The number of mutant forms maintained in a population. Proc. 5th Berkeley Symp. Math., Stat. Prob. 1967;4: 415-438.

42. King T.E., Ballereau S.J., Schurer K.E., Jobling M.A. Genetic signatures of coancestry within surnames. Curr. Biol. 2006;16(4):384-388. DOI 10.1016/j.cub.2005.12.048.

43. King T.E., Jobling M.A. Founders, drift, and infidelity: the relationship between Y chromosome diversity and patrilineal surnames. Mol. Biol. Evol. 2009;26(5):1093-1102. Epub 2009/02/09. DOI 10.1093/molbev/msp022.

44. Koshel S.M. Geoinformation technologies in genogeography. Eds. I.K. Lure, V.I. Kravtsova. Modern Geographic Cartography. Moscow, 2012;158-166.

45. Martinez-Cadenas C., Blanco-Verea A., Hernando B., Busby G.B., Brion M., Carracedo A., Salas A., Capelli C. The relationship between surname frequency and Y chromosome variation in Spain. Eur. J. Hum. Genet. 2016;24(1):120-128. Epub 2015/04/22. DOI 10.1038/ejhg.2015.75.

46. Martrnez-Gonzalez L.J., Martmez-Esprn E., Alvarez J.C., Albar-daner F., Rickards O., Martrnez-Labarga C., Calafell F., Lorente J.A. Surname and Y chromosome in Southern Europe: a case study with Colom/Colombo. Eur. J. Hum. Genet. 2012;20(2):211-216. DOI 10.1038/ejhg.2011.162.

47. McEvoy B., Bradley D.G. Y-chromosomes and the extent of patrilineal ancestry in Irish surnames. Hum. Genet. 2006;119(1-2):212-219. DOI 10.1007/s00439-005-0131-8.

48. Mikerezi I., Xhina E., Scapoli C., Barbujani G., Mamolini E., Sandri M., Carrieri A., Rodriguez-Larralde A., Barrai I. Surnames in Albania: a study of the population of Albania through isonymy. Ann. Hum. Genet. 2013;77(3):232-243. DOI 10.1111/ahg.12015.

49. Nei M. Molecular Evolutionary Genetics. New York: Columbia University Press, 1987.

50. Piazza A., Mayr W.R., Contu L., Amoroso A., Borelli I., Curtoni E.S., Marcello C., Moroni A., Olivetti E., Richiardi P. Genetic and population structure of four Sardinian villages. Ann. Hum. Genet. 1985; 49(1):47-63.

51. Rodriguez-Larralde A., Dipierri J., Gomez E.A., Scapoli C., Mamo-lini E., Salvatorelli G., De Lorenzi S., Carrieri A., Barrai I. Surnames in Bolivia: a study of the population of Bolivia through isonymy. Am. J. Phys. Anthropol. 2011;144(2):177-184. Epub 2010/08/25. DOI 10.1002/ajpa.21379. PubMed PMID: 20740661.

52. Scapoli C., Mamolini E., Carrieri A., Rodriguez-Larralde A., Barrai I. Surnames in Western Europe: a comparison of the subcontinental populations through isonymy. Theor. Popul. Biol. 2007;71(1):37-48. DOI 10.1016/j.tpb.2006.06.010.

53. Sole-Morata N., Bertranpetit J., Comas D., Calafell F. Y-chromosome diversity in Catalan surname samples: insights into surname origin and frequency. Eur. J. Hum. Genet. 2015;23(11):1549-1557. Epub 2015/02/18. DOI 10.1038/ejhg.2015.14.

54. Tarskaia L., El’chinova G.I., Scapoli C., Mamolini E., Carrieri A., Rodriguez-Larralde A., Barrai I. Surnames in Siberia: a study of the population of Yakutia through isonymy. Am. J. Phys. Anthropol. 2009;138(2):190-198. DOI 10.1002/ajpa.20918.

55. Zei G., Guglielmino C.R., Siri E., Moroni A., Cavalli-Sforza L.L. Surnames as neutral alleles: observations in Sardinia. Hum. Biol. 1983; 55(2):357-365.

56. Zhabagin M., Balanovska E., Sabitov Z., Kuznetsova M., Agdzhoyan A., Balaganskaya O., Chukhryaeva M., Markina N., Romanov A., Skhalyakho R., Zaporozhchenko V, Saroyants L., Dalimova D., Davletchurin D., Turdikulova S., Yusupov Y., Tachigulova I., Akil-zhanova A., Tyler-Smith C., Balanovsky O. The connection of the genetic, cultural and geographic landscapes of Transoxiana. Sci. Rep. 2017;7. DOI 10.1038/s41598-017-03176-z.


Review

Views: 1855


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2500-3259 (Online)