Network of large pedigrees reveals social practices of Avar communities

Gnecchi-Ruscone, Guido Alberto; Rácz, Zsófia; Samu, Levente; Szeniczey, Tamás; Faragó, Norbert; Knipper, Corina; Friedrich, Ronny; Zlámalová, Denisa; Traverso, Luca; Liccardo, Salvatore; Wabnitz, Sandra; Popli, Divyaratan; Wang, Ke; Radzeviciute, Rita; Gulyás, Bence; Koncz, István; Balogh, Csilla; Lezsák, Gabriella M.; Mácsai, Viktor; Bunbury, Magdalena M. E.; Spekker, Olga; le Roux, Petrus; Szécsényi-Nagy, Anna; Mende, Balázs Gusztáv; Colleran, Heidi; Hajdu, Tamás; Geary, Patrick; Pohl, Walter; Vida, Tivadar; Krause, Johannes; Hofmanová, Zuzana

doi:10.1038/s41586-024-07312-4

Download PDF

Article
Open access
Published: 24 April 2024

Network of large pedigrees reveals social practices of Avar communities

Nature (2024)Cite this article

15k Accesses
1 Citations
515 Altmetric
Metrics details

Subjects

Abstract

From ad 567–568, at the onset of the Avar period, populations from the Eurasian Steppe settled in the Carpathian Basin for approximately 250 years¹. Extensive sampling for archaeogenomics (424 individuals) and isotopes, combined with archaeological, anthropological and historical contextualization of four Avar-period cemeteries, allowed for a detailed description of the genomic structure of these communities and their kinship and social practices. We present a set of large pedigrees, reconstructed using ancient DNA, spanning nine generations and comprising around 300 individuals. We uncover a strict patrilineal kinship system, in which patrilocality and female exogamy were the norm and multiple reproductive partnering and levirate unions were common. The absence of consanguinity indicates that this society maintained a detailed memory of ancestry over generations. These kinship practices correspond with previous evidence from historical sources and anthropological research on Eurasian Steppe societies². Network analyses of identity-by-descent DNA connections suggest that social cohesion between communities was maintained via female exogamy. Finally, despite the absence of major ancestry shifts, the level of resolution of our analyses allowed us to detect genetic discontinuity caused by the replacement of a community at one of the sites. This was paralleled with changes in the archaeological record and was probably a result of local political realignment.

Extensive pedigrees reveal the social organization of a Neolithic community

Article Open access 26 July 2023

Kinship practices at the early bronze age site of Leubingen in Central Germany

Article Open access 16 February 2024

Kinship practices in the early state El Argar society from Bronze Age Iberia

Article Open access 27 December 2022

Main

The kinship practices and social organization of past societies are hard to assess using only the fragmentary archaeological and historical information that has survived to modern times. Biological relatedness does not necessarily correspond to social kinship, but it can nevertheless provide a powerful tool to infer elements of past kinship practices. Ancient DNA has been used for pedigree inference^3,4,5, but being able to capture the extent of relationships in ancient populations requires a sampling approach that is focused on entire cemeteries of considerable size⁶. Only multiple observations of the same type of relatedness structure can exclude a random occurrence and indicate a reliable pattern. Archaeological contextualization adds social meaning and can disentangle the complex interplay between biological relatedness and human behaviour to help researchers to infer kinship practices on a larger scale.

From the late sixth century ad to the early ninth century, the Avars were the dominant power in eastern central Europe^1,7. Originating from eastern central Asia, probably from the Rouran khaganate destroyed by the Turks, the Avars’ core group of mounted steppe warriors and their families arrived north of the Caucasus in ad 557–558, where further groups joined the march into the Carpathian Basin in 567–568 (refs. ^1,7). This region became the centre of the Avar empire, where they settled among a diverse population derived from the previous Roman period followed by the Gepid and Longobard kingdoms^1,8. After extensive raids into the Byzantine Balkans ended in ad 626, the Avar society changed in many ways. The archaeological record indicates that a sedentary lifestyle in new, stable settlements emerged, with larger cemeteries containing hundreds of graves, and cultural expressions became more homogeneous⁹. The Avar realm persisted until it was overcome by the Frankish armies of Charlemagne in around ad 800. Turkic titles of rank (such as khagan, iugurrus, tudun and tarkhan) mentioned in written sources document that the central Asian character of the political structure was maintained until the end of Avar rule¹. In terms of social structure, patrilineal organization is the norm for Eurasian pastoralist steppe peoples², but we were unable to investigate the social practices of the Avars until now owing to a lack of historical sources (Supplementary Information).

By generating new genomic data (Supplementary Table 1) from the exhaustive sampling of four fully excavated cemeteries from present-day Hungary, combined with new isotope data and detailed archaeological and anthropological characterization, we aimed to investigate the population structure, kinship and social organization of these communities at a high level of resolution. We identified 298 biologically closely related individuals that allowed us to reconstruct extensive pedigrees and build networks of distant relatedness across the Great Hungarian Plain. We found striking evidence of recurrent patterns that allowed us to trace kinship and social practices, gain insights into the mobility of men and women and refine the chronology of the sites. In the largest cemetery, we were able to identify a community replacement associated with changes in the archaeological record and dietary habits, suggesting local political realignment. This replacement was not accompanied by an ancestry shift and was detected only by changes in the biological relatedness pattern.

Analysis of entire cemeteries

The Great Hungarian Plain was the main settlement area for steppe populations during the Avar period. We chose four cemeteries to cover equally its two main regions, divided by the river Tisza: the Transtisza region (TT) east of the river and the Danube–Tisza interfluve region (DTI) to the west (Fig. 1). The DTI was the power centre of the khaganate (the Avar empire), where burials of the highest Avar elite (for example, Kunbábony) were found, and these burials were also investigated in a previous genomic study¹⁰. From this region, we sampled the site of Kunpeszér (KUP; 33 burials), comprising an early Avar elite cemetery group with rich grave goods, exquisite gold- and silver-decorated swords, belts and jewellery, and a second cemetery group of poor late Avar burials¹¹. The site of Kunszállás (KFJ; 63 burials) from the same region, founded in the mid-seventh-century ad, already belongs to the more-uniform material culture of the middle and late Avar period¹². The TT region is well known for burial customs associated with the steppe, such as the placement of animals, animal skins or horse implements near the deceased. We chose Rákóczifalva (RK) because it is one of the region’s largest cemeteries (279 out of 308 graves sampled) that was continuously occupied from around ad 570 to the mid-ninth century¹³. From RK we additionally sequenced 56 individuals from the second to the sixth century to extend the available data on pre-Avar periods and to capture the Avar-period transition on a local scale. The cemetery of Hajdúnánás (HNJ; 18 burials) was selected to cover the northern section of the TT region^14,15 (Supplementary Information; Supplementary Table 2).

**Fig. 1: Map of the Great Hungarian Plain showing the locations of the four sites analysed in this study.**

After quality controls (Methods; Supplementary Table 1) we obtained genome-wide (around 1,240,000 single nucleotide polymorphisms (SNPs); Methods) data for 424 individuals with an average coverage of 2.6×. Furthermore, we produced new strontium, carbon and nitrogen isotope data (⁸⁷Sr/⁸⁶Sr; δ¹³C and δ¹⁵N; Supplementary Table 3) for 154 individuals from RK, KUP and KFJ, and 57 new radiocarbon dates for RK (Supplementary Tables 2 and 6).

Pedigrees: strict patriliny within sites

To reconstruct the pedigrees, we estimated close biological relatedness using recently published software, KIN (ref. ¹⁶), which was designed to identify first-, second- or third-degree related individuals (defined as close genetic relatedness) in low-coverage ancient DNA (Methods). We found no close genetic relatedness between sites, but most individuals in each site were closely related, constituting a total of 373 pairs of first-degree (235 parent–child and 138 siblings) and more than 500 pairs of second-degree relatives (Supplementary Table 4). Such a large number, especially of first-degree pairs, allowed us to reconstruct a total of 31 pedigrees of varying sizes, ranging from 2 to 146 individuals (Fig. 2a, Extended Data Figs. 1–3 and Supplementary Figs. 6, 10 and 15). These extended genealogies show a strict patrilineal descent with almost no exceptions. This finding provides compelling evidence for patrilocality and female exogamy, which explains the striking difference in Y-chromosome and mitochondrial DNA (mtDNA) diversity observed among related individuals (Extended Data Fig. 4).

**Fig. 2: The largest set of interconnected pedigrees reconstructed in RK and the cemetery map highlighting the burial location of related individuals.**

Within RK, 202 individuals had at least one close relative at the site and only 64 were unrelated. Among the related individuals, 146 formed an extended ‘macro’ pedigree spanning up to nine continuous generations. We divided this into five connected pedigrees (numbered 1 to 5; Methods) descended from 11 founding male individuals. A further 34 individuals formed 4 additional multigenerational pedigrees (numbered 6, 7, 8 and 12) chronologically dated to the early Avar period (Fig. 2a), and the rest formed smaller units (Supplementary Figs. 15 and 16).

**Fig. 3: Network analysis of ancIBD haplotype-IBD sharing between Avar-period individuals.**

Adults (more than 18 years old) represent 83% of the whole RK cemetery, with nearly equal numbers of male and female individuals (Table 1). However, the RK pedigrees contain twice as many male individuals as female ones. This male bias is due to a higher ratio of sons to daughters: we found 102 sons (77 adults and 25 subadults) and 20 daughters, mostly subadults (5 adults and 15 subadults). A strict patrilineality can be observed from the descent structure of the pedigrees. Only one adult daughter (RKC024) has offspring buried in the cemetery, and her son (RKC012) is second-degree related to other members of the pedigree through the missing father (Supplementary Table 4). Consequently, RKC024 and her missing partner are sixth-degree related. All the other mothers lack parents at the site and are considered exogamous partners. Instead, all the fathers are descendants of the founding male individual(s) of their respective pedigree, with no exceptions (Fig. 2a). The founding role in the life of the communities may have been especially important². In several cases, founding male individuals (or, in the case of brothers, one of the male individuals) were buried with valuable grave goods considered status symbols: horse harnesses and belt sets in the early Avar period and belt sets in the middle and late periods (Supplementary Fig. 10).

Table 1 RK Individuals with genetic data and age at death information

Full size table

The KFJ pedigree presents the same pattern of genetic relatedness (Extended Data Fig. 1). Here, of the 45 individuals forming the second-largest pedigree, 21 are sons (10 of whom are subadults) and 13 are daughters (11 subadults). We observe a similar tendency in the small KUP and HNJ cemeteries, although with fewer individuals (Extended Data Figs. 2 and 3).

These patterns are reflected in the striking difference between female lines (mtDNA haplogroups) and male lines (Y-chromosome haplogroups). Only two Y-chromosome lineages, J1a-Z2317 and J2b-CTS11760 (J1a and J2b hereafter), are found in pedigrees 1–8 and 12 in RK, compared with around 50 different mtDNA haplogroups; only one Y-chromosome lineage, N1a-Y16220 (N1a hereafter) is found in both KFJ and KUP among related and unrelated individuals (compared with around 20 mtDNA haplogroups); and another, Q1a-L715 (Q1a hereafter), is shared between all male individuals of HNJ pedigree 1 (Extended Data Fig. 4).

A comparison of pedigrees with the spatial arrangement of graves and grave groups allows us to assess how much biological and social relatedness correspond, and demonstrates that the concept of descent was central to the organization of the burial site. With few exceptions, all individuals from the same pedigree are found in the same burial cluster (Fig. 2b).

In terms of closer-descent units, we discovered that parents, infants, juveniles and even adult male siblings were buried near each other, forming clusters of close relatives (Supplementary Fig. 9). Within these groups we often find unrelated female individuals. In fact, there is a strong sex bias among the 64 unrelated individuals in RK, because 51 are female and only 13 are male. Most of these female individuals are young adults. Male individuals have a more balanced age distribution, and among related female individuals, older adults are more frequent (Supplementary Fig. 11). The age distribution, position in the pedigree clusters, chronology, burial customs and grave goods suggest that the unrelated female individuals are likely to be exogamous partners of lineage male individuals who had not yet reproduced, or whose children were not found at the site. Therefore, they are not detected as biologically related but could still be part of the social unit.

On the basis of the pedigrees, we can speculate that the beginning of the reproductive age for women was 18–20 years. The youngest mothers were 18–22 years old at death, whereas the youngest fathers were 24–29 years old at death. This is consistent with the observation that juveniles are buried next to their parents (female individuals of 16–19 and male ones of 18–22 years old at death), and lineage female individuals disappear from the pedigrees at late juvenile–early adult age.

Marriage strategies and levirate unions

Another consistent pattern between sites is that male and female individuals often had multiple reproductive partners. In RK only, we discovered 15 cases involving a male partner and 7 cases involving a female one (Supplementary Information). Male individuals had two partners in ten cases, three partners in four cases, and four partners in one case (RKF042); around 85% of these individuals are older men (aged 35–59). The young ages of female partners at death may indicate serial monogamy (RKC011), but the presence of older female partners in multiple partnerships suggest polygyny (RKF042 and RKF180). Multiple reproductive partners were also discovered in HNJ and KFJ (one and four cases, respectively). That means that polygyny might not have been restricted to the highest stratum of society that is known from the historical sources, but also occurred in the general population¹.

We also identified multiple cases (five in RK and two in KFJ) and, through indirect evidence, another case in KUP, of closely related male individuals having offspring with the same female partner: three pairs of fathers and sons, two pairs of full brothers, one pair of paternal half-brothers and one pair of paternal uncle and nephew (Extended Data Figs. 1 and 2). We assume that these unions were levirate matches (although in some cases concurrent polyandry cannot be excluded; Supplementary Information). Even though the word levirate has a biblical origin¹⁷, in historical and anthropological research the term has a wider application referring to marriages between a widow and an agnate of the deceased. Often found in pastoral societies, which are patrilocal, patrilineal and observe female exogamy, the levirate custom was common in Central Asia and the Caucasus until recent times^18,19. Although not mentioned for the Avars, levirate partnerships are attested to in contemporary written sources for several steppe peoples^20,21, which suggests that what we find in the pedigrees is probably formal levirate, not extramarital relationships.

According to the sources²², no levirate union could occur if the deceased’s agnate was related to the widow by blood. Indeed, we find no cases of biological consanguinity, based on the absence of long runs of homozygosity (ROH) segments in all analysed individuals (Extended Data Fig. 5). We do not even detect ROH patterns consistent with more-distant consanguineous unions, such as at the level of second-degree cousins, despite a high occurrence of levirate and multipartner unions. Among Eurasian steppe peoples, intermarriage within the paternal line was permitted only after a certain number of generations, which could range between five and nine^20,21. Such rules would explain the absence of even distant biological consanguinity. It is intriguing that the only case we detected of reproductive partners being related was to the sixth degree (which would still be consistent with such rules) and involves the only non-exogamous female individual in RK. This further suggests the uniqueness of this single case.

All the aforementioned phenomena lead us to assume that the segment of Avar society we investigated had a structure comparable to that of Eurasian pastoralist steppe people^2,21: the elementary social unit is the patrilineally organized family. Patrilineal genealogies are the constitutive elements of the society and, within them, descent lines are traced and ranked according to the birth order of the male founders. This concept results in a strictly hierarchical structure in the smaller, as well as in the larger, units of society, as evidenced in the archaeological material by various status indicators (Supplementary Fig. 10). We can consider as a contemporary parallel to Avar society the old Turkic kinship system that has been reconstructed on the basis of the Orkhon inscriptions, which date from the eighth century (Supplementary Information).

Community links through female exogamy

We observe that exogamous female individuals have a central role in connecting the different founding patrilines both within RK and between the sites. One unique case is represented by the female individual RKF140, who is part of two different levirate unions and had a total of four reproductive partners from two different pedigrees, linking the two large patrilineal units of the middle–late Avar period pedigrees (3 and 4–5). In fact, most of the large RK pedigrees are connected through female lines: one missing first-degree-related female individual (sister or mother) connects pedigrees 1 and 2, and two maternal second-degree relatives connect pedigrees 2 and 3.

The role of exogamous female individuals becomes even more evident when analysing the patterns of pairwise ancIBD haplotype-IBD (identical-by-descent) sharing within and between individuals from the four Avar sites (Fig. 3). In the network analyses of IBD sharing (Fig. 3a), we can observe tight clusters, reflecting the close genetic relatedness, expectedly within the large pedigrees of RK and KFJ. In the adults-only network (Fig. 3b), we estimate that many female individuals plot outside each site’s cluster and have significantly fewer IBD connections than do male individuals, reflected by the lower degree centrality distribution, corresponding to the number of connections each individual has (P < 0.05 using the Kolmogorov–Smirnov test after 1,000 permutations; Fig. 3c). Female individuals instead show significantly higher ratios of connections between sites (P < 0.05 using Welch’s t-test; Fig. 3c). Furthermore, we found seven cases of female individuals (and no male ones) who are unrelated within site presenting IBD connections with another site (Fig. 3a,b). Taken together, our evidence shows the existence of networks of communities centred tightly around a patriline and related to other communities by exogamous female individuals.

We included data from previously published Avar-period sites^10,23 in the IBD network, although there is a bias in sampling strategies between our entire-cemetery-sampling approach and previous sparse-sampling approaches in which only a few individuals per site were analysed, preventing us from observing the full extent of the connections (Extended Data Fig. 6a).

Nevertheless, it is possible to observe geographic structuring, because we find more intra- than inter-regional connections among the DTI and TT sites. Furthermore, the two nearby sites of Hortobágy-Árkus and HNJ (which are about 50 km apart) are particularly highly connected and share the same Y haplogroup (Extended Data Fig. 6a). We also observe that DTI male individuals with the N1a Y haplogroup tend to cluster together. This lineage is not only common between KUP and KFJ sites but is also shared among the early Avar-period DTI elite sites. Interestingly, the supposedly highest-status individual among these sites (the solitary burial of Kunbábony, which was interpreted as a possible khagan burial on the basis of rich status symbols found in the grave⁸) has the highest number of between-sites IBD connections among all of the new and previously published individuals analysed (Extended Data Fig. 6b).

Realignment of local power

Archaeologically, RK spans the whole Avar period. By incorporating the relative chronological framework provided by the generations of the pedigrees in the Bayesian modelling of ¹⁴C dates, which reduces the uncertainties of the dates by up to 60% (ref. ²⁴), we were able to refine the start and end events of three large pedigrees (with a maximum span of around 300 years) and place them in a relative order (Supplementary Fig. 43, Supplementary Table 6 and Supplementary Information). Integrating chronology and pedigrees allows us to observe a shift in the local community in the second half of the seventh century. First, ten smaller pedigrees are found in the early Avar period, but only three connected ones dominate in the middle and late phases (Fig. 2). Strikingly, the J1a male lineage is mostly found in the early pedigrees, whereas the J2b haplogroup appears and becomes the predominant male lineage in the later ones (pedigrees 3, 4 and 5 all carry haplogroup J2b). We can clearly pinpoint when this shift occurs: going from pedigree 2 to pedigree 3, through the connection between the two maternal half-brothers, with one from pedigree 2 carrying J1a and one from pedigree 3 carrying J2b. In fact, pedigree 2 is the only one spanning from the early to the late phase, continuing the only two remaining J1a descent lineages after this shift. The haplotype IBD network in RK shows an even clearer pattern, indicating the community shift, because all of the different J1a-carrying pedigrees and the J2b pedigrees 3, 4 and 5 share more IBDs within them than between them, forming two clearly distinct clusters separating J1a and J2b male individuals (Fig. 3a).

Interestingly, one generation above (generation 4, dating to the middle Avar period), in pedigree 2 there are 12 related male individuals, of whom only three had children buried at the site. All the remaining male individuals except two juveniles (aged 18–22 and 15–17) were adults with no children found in the cemetery. This evidence further supports the replacement of the patriline in the community buried in RK. No associated skeletal traumas were observed in these individuals, so the shift in the male lineage cannot be clearly attributed to an act of violence. In fact, all of them are buried close to unrelated female individuals, who were potentially their exogamous partners, suggesting that the change of community occurred in the following generation of their children not buried at site. Given the strong patrilineality observed in all the sites we analysed, this change must have had strong social implications.

This shift mirrors the archaeological evidence (Fig. 2b and Extended Data Fig. 7). First, the western grave group of the RK site is made up of the large pedigree 2 and the several smaller pedigrees from the early Avar period. This part of the burial place was abandoned, and it is conceivable that many of the descendants of pedigree 2 left at that time. Except for a few scattered early burials, the central part of the cemetery was then established in the middle Avar period by the founders of pedigrees 4–5, along with the eastern part, which is mostly composed of pedigree 3 and the latest group of individuals from pedigree 2, who are the ones biologically related to pedigree 3. New burial customs, such as wooden grave constructions, distinguish the graves of newly settled families, whereas the old ones, such as burial with a horse harness or a pot next to the deceased person’s head, were phased out (Fig. 2b and Extended Data Fig. 7).

In RK, we also found significantly higher δ¹³C and lower δ¹⁵N values in the early Avar period than in the subsequent phases (Supplementary Fig. 25). Especially during the early phase, the carbon isotope data revealed a gradual change in dietary composition from substantial contributions of a C₄ component, which was probably millet, a primary staple crop in Eastern Asia²⁵, to the predominance of C₃ plants. Although millet was also consumed in the subsequent phases, individuals with outstandingly high δ¹³C values are lacking in the middle and late Avar periods. The higher δ¹⁵N values in the later phases indicate an increase in the consumption of meat and dairy products. However, largely overlapping ranges indicate that this affected only some of the individuals (Supplementary Information and Supplementary Figs. 25 and 26). Starting in the early phase, but especially in the middle and late phases, we observe a number of burials of male individuals with outstandingly high δ¹⁵N values (Supplementary Figs. 25 and 26).

Taken together, these findings indicate that there was a replacement of the community buried, and thus likely to be living, in RK during the middle Avar period. Although the ancestry of the individuals and the descent system before and after the shift did not change (Fig. 4), the succeeding community differed in its burial customs and dietary habits. It is noteworthy that the HNJ and KFJ cemeteries were established in this later period, implying that larger transformations occurred in the Carpathian Basin in the second half of the seventh century^9,26.

**Fig. 4: PCA for newly sequenced Avar-period individuals.**

Steppe descent communities in Europe

Population-genomic analyses (Fig. 4, Extended Data Fig. 8, Supplementary Information and Supplementary Figs. 12–14) confirm that the four cemeteries belonged to communities with steppe descent. Most (88%) of the individuals carry portions of a northeast-Asian ancestry profile that is ultimately traceable to the eastern Eurasian Steppe^10,23,27 with varying degrees of admixture with western Eurasian sources. The northeast-Asian ancestry ranges from a median of about 100% in the DTI site of KUP to just 32% in the TT site of RK (Extended Data Fig. 8c and Supplementary Information). Independent evidence from the admixture modelling of qpWave/qpAdm and admixture dating of DATES reveals a process of continuous admixture between western and eastern sources over centuries that largely predates the Avar period, and therefore the arrival of these populations in the Carpathian Basin, and presumably took place in the steppe (Extended Data Figs. 8 and 9 and Supplementary Information). These analyses suggest that the post-arrival admixture with the local contemporary (post-sixth century) Carpathian Basin population was around 20% (Extended Data Figs. 8 and 9 and Supplementary Information).

Despite these clear patterns confirming a recent steppe origin for these populations, the strontium-isotope compositions (⁸⁷Sr/⁸⁶Sr) were largely similar, with values consistent with local and regional variations²⁸ (Supplementary Fig. 23). The datasets from KUP and KFJ were isotopically indistinguishable from one another, whereas the burials at RK yielded substantially more radiogenic strontium, indicating a small variation of the local baseline values (Supplementary Information). This homogeneity indicates that although local mobility (within the Great Hungarian Plain, for example) is plausible, migration between distant areas (such as across the Eurasian Steppe) is unlikely because they would always need to be isotopically indistinguishable. This implies that, with one potential exception (Supplementary Information and Supplementary Fig. 23), the first generation of migrants was not buried at the cemetery, and that there was high regional continuity across the Avar period.

The density of sampling allowed us to uncover a pattern of geographic structuring of genomic ancestry that went unnoticed in previous studies. This is most evident between the DTI sites (KUP and KFJ) and the TT site of RK, which cover the whole Avar period. However, their admixture profiles remain largely distinguishable and non-overlapping (Fig. 4b). In fact, although KFJ has high amounts of northeast-Asian ancestry in the late Avar period, RK individuals, even in the early phase, carry admixed ancestries, 95% of which were dated pre-arrival (Extended Data Fig. 9). This is in line with the observation of higher IBD sharing within DTI and TT than between the two regions (Extended Data Fig. 6a). These differences mirror cultural differences found in the archaeological record. Several features of the TT sites, especially at RK (Supplementary Information), are strikingly similar to those of nomadic burials of the sixth to seventh century in the Pontic Steppe, known as the Sivashovka horizon²⁹ (Supplementary Fig. 3). Instead, some cultural elements of the elite of the DTI area, which includes the early graves of the KUP site, can be traced back to the eastern Eurasian Steppe^30,31,32.

In conclusion, we confirm the arrival and establishment in the Carpathian Basin of entire communities of steppe descent. We reveal that genetically and culturally distinct steppe communities settled in the area and, despite some admixture with the local population, remained distinct during the course of the Avar period. This substantial post-arrival genetic continuity, together with striking isotopic homogeneity over time, poses a challenge to the long-lasting archaeological hypothesis^1,33 that there were successive large-scale migrations from the steppe, indicating instead a pattern of local, small-distance mobility once settled.

Conclusions

The reconstruction of extended multigenerational pedigrees from four Avar-period sites indicates a consistent reproductive strategy based on patrilineal descent, patrilocality, female exogamy, strict avoidance of consanguinity, and, in several cases, multiple reproductive partners and the practice of what seems to have been levirate unions. We found indications that social and biological relatedness overlapped to a large degree, because patterns of biological relatedness corresponded to the spatial distribution of the graves and grave goods. These social practices survived political changes, shifts in lifestyle reflected in material culture, dietary changes, and interactions with the local population from the late sixth century to the early ninth century ad. Descent units were strictly organized around patrilines but on a larger scale were connected by exogamous female individuals, and these connections may have been one of the main cohesive elements of Avar society. Mostly small pedigrees, of two to four generations, were found in the early phase, and larger ones, of four to seven generations, started in the mid-seventh century. This change reflects the increasing size of cemeteries and settlements since the middle Avar period and the development of the early medieval settlement system in the Carpathian Basin. The largest site we analysed (RK) experienced a community shift in the second half of the seventh century, which was probably caused by a realignment of local power, but it had no effect on the social organization or general ancestry patterns. Detecting this shift required the reconstruction of a biological-relatedness network of the entire cemetery and shows that genetic continuity at the level of ancestry might still conceal the replacement of whole communities.

Methods

Ancient-DNA laboratory analyses

For the archaeogenetic investigations, petrous bones and teeth were preferentially sampled (Supplementary Table 1). Samples were prepared in dedicated ancient-DNA laboratory facilities at the HUN-REN RCH Institute of Archaeogenomics in Budapest. Sample surfaces were decontaminated using UVC light and cleaned by mechanical removal. About 25–50 mg bone powder was obtained by drilling or powdering and transferred to MPI-EVA in Leipzig, Germany. DNA extraction and subsequent laboratory steps were done in the Ancient DNA Core Unit of the MPI-EVA. DNA was extracted from between 25 mg and 52 mg of powdered sample material using a silica-based method optimized for the recovery of short DNA fragments³⁵. Briefly, lysates were prepared by adding 1 ml extraction buffer (0.45 M EDTA, pH 8.0, 0.25 mg ml^–1 proteinase K, 0.05% Tween-20) to the sample material in 2.0-ml Eppendorf LoBind tubes and rotating the tubes at 37 °C for approximately 16 h^35,36. Using an automated liquid-handling system (Bravo NGS Workstation B, Agilent Technologies), DNA was purified from 150 µl lysate using silica-coated magnetic beads and binding buffer D, as described previously³⁶. Elution volume was 30 µl. Extraction blanks without sample material were carried alongside the samples during DNA extraction.

DNA libraries were prepared from 30 µl extract using an automated version of single-stranded DNA-library preparation³⁷ described in detail previously³⁸. Escherichia coli uracil–DNA–glycosylase (UDG) and E. coli endonuclease VIII were added during library preparation to remove uracils from the interior of molecules. Libraries were prepared from both the sample DNA extracts and the extraction blanks, and further negative controls (library blanks) were added. Library yields and efficiency of library preparation were determined using two quantitative PCR assays³⁸. Libraries were tagged with pairs of sample-specific indices by PCR extension using AccuPrime Pfx DNA polymerase as described previously³⁸. Indexed libraries were amplified and purified using SPRI (solid-phase reversible immobilization) technology³⁹ as described previously³⁸.

Sample and control libraries were enriched in solution for 1,237,207 informative SNPs (a method commonly used in the field and known as 1240k capture⁴⁰) targeting 394,577 SNPs first reported in ref. ⁴¹ (390k panel) and 842,630 SNPs first reported in ref. ⁴² (840k panel). Two consecutive rounds of 1240k capture were performed using the Bravo NGS workstation B. Up to 20 libraries were pooled together and sequenced single-read or pair-read on a HiSeq4000 sequencing platform (Illumina Technology). In total, 440 1240k-enriched libraries were sequenced and an average coverage of 2.6× (median 2.25×) for the 1,237,207 sites in the genome, corresponding to a median of 708,514 1240k SNPs covered at least once (Supplementary Table 1).

Ancient-DNA data process and quality controls

The raw sequenced read data (fastq files) were processed through a nf-core/eager v.2.3.2 pipeline⁴³ (https://nf-co.re/eager). To remove adaptors and short reads of less than 30 base pairs, AdapterRemoval v.2.3.1 was used⁴⁴. The reads were then mapped to the Human Reference Genome Hs37d5 using the bwa v0.7.17 aln/samse alignment algorithm⁴⁵ with the parameters -n and -l set to 0.01 and 1,024, respectively. The reads with phred mapping quality of less than 30 were then discarded using -q (q30-reads) in Samtools v1.9 (ref. ⁴⁶). We then used the Picard tools MarkDuplicates function (https://github.com/broadinstitute/picard) to remove PCR duplicates. To estimate the amount of cytosine-to-thymine taphonomic deamination at the ends of the mapped fragments, we used mapDamage v.2.0 (ref. ⁴⁷) run on a subset of 100,000 q30 reads. Exogenous human autosomal DNA contamination was estimated in male individuals by assessing X-chromosome heterozygosity levels using ANGSD v.0.910 (ref. ⁴⁸) and mtDNA contamination in male and female individuals was estimated using Schmutzi⁴⁹. Schmutzi was also used to reconstruct the consensus mitochondrial genome sequence of each individual used as input for HaploGrep2 (ref. ⁵⁰) to assign mitochondrial haplogroups. For the purpose of graphical representation in Extended Data Fig. 4, all the mitochondrial haplogroups were pruned to the first three characters. If two individuals had, respectively, a two- and three-characters resolution, both of their haplogroups were trimmed to the first two characters. Individuals with only a one-character resolution were excluded from the plot.

Y-chromosome haplogroups were inferred using two different methodologies and the results compared. The Y-chromosome variants were called from in the bam files from samples whose genetic sex was estimated to be male or unassigned using the Samtools v1.9⁴⁶ mpileup and PileupCaller (https://github.com/stschiff/sequenceTools) using the mode --majorityCall; Y-chromosome haplogroup assignment was performed using the software yHaplo (https://github.com/23andMe/yhaplo), with ISOGG panel v.11.349 as a reference (https://isogg.org/tree/; date of access: 2 February 2023). Y-chromosome haplogroups were also defined using the Y-Lineage-Tracker subcommand ‘classify’⁵¹, using as a reference panel the ISOGG Y-haplogroup tree v.15.73 (https://isogg.org/tree/); in this case the input files were genotypes from each individual, estimated using the allelePresence method from the ATLAS (https://bitbucket.org/wegmannlab/atlas/)⁵² call tool, accounting for post-mortem damage patterns and base-score recalibration patterns, estimated respectively with the ATLAS tools PMD and recal.

The results from the two methodologies were then compared, taking into account the differences between the two reference panels. In cases where the two methodologies yielded deeply diverging results (that is, to the first two ISOGG alphanumeric classification symbols) or were discordant with the estimated reciprocal genetic relatedness between individuals (described in the Biological relatedness section), the haplogroup assessment was further investigated using the software pathPhynder⁵³ with default options, using as reference the BigTree Y-chromosome dataset and the reference phylogenetic tree for sample placement provided by GitHub with the software and as input files the bam files filtered for phred mapping quality more than 30. In any other case, the conservative results from Y-LineageTracker (the column Key haplogroup) were considered reliable, given the more-stringent estimation of the genotypes and the updated ISOGG Y-chromosome phylogenetic tree version.

The results of the whole procedure can be found in Supplementary Table 1. PileupCaller (https://github.com/stschiff/sequenceTools) was used to carry out genotype calling from the q30 reads with the --randomHaploid flag that calls haploid genotypes by randomly choosing one high-quality base (phred base quality score ≥30) on the 1240k panel (pseudodiploid calls). We also used the --singleStrandMode, which removes only real cytosine-to-thymine deamination observed with single-stranded DNA libraries by ignoring cytosine–thymine polymorphisms at reads aligning to the forward strand and guanine–adenine polymorphisms at reads aligning to the reverse strand.

To produce the Y-chromosome haplogroup plots in Extended Data Fig. 4, all the haplogroup nomenclature was pruned to the first three characters; haplogroups with less than three characters of ISOGG notation were excluded from the plots. Complete Y-chromosome haplogroups can be found in Supplementary Table 1.

We found low mitochondrial contamination estimates (Supplementary Table 1). Most were less than 5% and only five samples had values between 5% and 10%. Of these we excluded one female individual (RKF048) with 7% contamination and one individual (KFJ019) with 5% contamination and ambiguous sex determination (an indirect sign of possible contamination); the remaining male individuals had low nuclear contamination and were therefore kept for nuclear genomic analyses. We also found low nuclear contamination estimates among the male individuals. We excluded four further individuals with values of more than 7%; RKF094 (15% contamination) was still counted among the related as showing high likelihood of close genetic relatedness with other individuals (Supplementary Table 4). We also excluded individuals with particularly low coverage (more than 20,000 SNPs) because they were not practically usable for further analyses (additional filtering for higher coverage thresholds is detailed for specific analyses in the following sections); these include two individuals also excluded for contamination and another 15 individuals still included as showing high likelihoods of close relatedness (RKF225, HNJ005, HNJ009 and RKF128). We kept 419 individuals for further analyses, 413 excluding one pair among the identical pairs found, and 424 including the previously published individuals from the KUP and KFJ sites¹⁰ (Supplementary Table 1). We then merged them with a reference genome-wide panel of 2,280 modern individuals genotyped with microarray technology using the commercial HumanOrigins chip^54,55,56 and previously published ancient-individuals’ genotypes sequenced with the same 1240k capture method or a 1240k SNPs subset from data obtained using whole-genome shotgun sequencing^{10,27,54,55,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74} downloaded from Poseidon (https://poseidon-framework.github.io). We produced two datasets, one including the modern data and the SNPs overlap between the 1240k sites and the HumanOrigins SNP chip (1240KHO dataset, around 600,000 SNPs), and one with ancient data and the whole 1240k panel (the 1240k dataset).

Genomic ancestry modelling with PCA, qpWave/qpAdm, DATES

We used principal component analysis (PCA) with smartpca v.16000 in the EIGENSOFT v.6.0.1 package⁷⁵ on the 1240KHO dataset using the lsqproject and the autoshrink parameters to project the genotypes of the ancient individuals (containing variable amounts of missing data) on top of the principal components calculated on the set of modern worldwide populations. For one PCA (Fig. 4a) we used a subset of Eurasian populations (the Eurasian PCA) as originally in reference⁵⁴ adapted as in reference²⁷, and for another PCA (Extended Data Fig. 8b) we used a standard subset of only west Eurasian populations (the west Eurasian PCA), as originally reported⁷⁶ and then adapted¹⁰.

We used the software qpWave/qpAdm (v.1520) of the ADMIXTOOLS package⁵⁶ to run the f₄-statistics-based ancestry analyses on the 1240k dataset^41,77. Standard errors for the computed f-statistics were estimated using a block jack-knife with a 5-cM block. We used the default allsnps: NO parameter, thereby calculating all the underlying f₄-statistics using the SNP overlap between all the groups for each test. We used a set of outgroups (or right populations) that are similar to those of a previous study¹⁰ that included representatives of ancient Eurasian lineages (European Mesolithic hunter-gatherers, European/Anatolia Neolithic, Levant Neolithic, Iranian Neolithic for western Eurasia, and ancient North Eurasian lineage (ANE⁷⁶), ANA, ancient Siberian and southern East Asia for eastern Eurasia, and key non-Eurasian ones (African, South Asian, Native American) when available, otherwise hte present-day proxies Mbuti.DG, Levant_N, Onge.DG, Iran_N, Iron_Gates_HG, EHG, Mixe.DG, Anatolia_N, DevilsCave_N.SG, Tarim_EMBA1, Kolyma_M.SG and YR_LN. The only difference with respect to ref. ¹⁰ is that we used Tarim_EMBA1 (ref. ⁷²) instead of the three Russia_Bolshoy individuals⁶⁵, which is a higher-coverage dataset of 12 individuals and a better representative of the ANE lineage⁷³ than any other high-ANE ancestry group available in the literature.

To select the sources (or left populations) to model the admixed ancestry of our newly sequenced individuals (the targets), we followed the following rationale. Among the data available from previous studies, we selected only ancient populations (of more than two individuals) that are either approximately contemporaneous or temporally preceding but are as close as possible to the time period of our target individuals, as suggested previously⁷⁸. In our selection, we also considered the findings from a previous genomic study of the Avar period¹⁰, as well as populations that are geographically, historically and archaeologically relevant. This led to a selection of 13 different source groups falling in 3 categories. (1) Sources representative of the east Eurasian Steppe ancestry that include ancient populations and cultures available from preceding time periods in the east Eurasian Steppe and surrounding areas in east Asia. (2) ‘Pre-Avar’ populations that are found in the Carpathian Basin in the first centuries ad, before the Avar period. (3) Relevant temporally preceding (first millennia bc and ad) populations available from across the Pontic- and central Asian Steppe (the ‘steppe’ sources).

Two- and three-way combinations of these sources led to a total of 190 different combinations being tested, all with qpWave P-values of much less than 0.05, which means that the sources are sufficiently differentiated with respect to the set of outgroups. They are therefore suitable sources to be tested⁷⁶ (Supplementary Table 5), applying the following rationale, which is the same as that used in a previous study¹⁰ based on suggestions discussed previously⁷⁸. We first tested two-way admixing sources using all combinations of eastern Eurasian Steppe groups plus the pre-Avar and steppe sources. If we could reject one but not the other, between the pre-Avar and steppe source models (if one had P < 0.05 we can reject; if the other had P > 0.05 we cannot reject), we considered the one we cannot reject (P > 0.05) as valid. If the two-way models did not significantly reject one or the other between the pre-Avar and steppe sources (both with P > 0.05) or produced no fitting results at all (both with P < 0.05), we proceeded by testing three-way competitive models, including the eastern Eurasian populations and contrasting directly the pre-Avar plus steppe sources as well as pre-Avar plus pre-Avar, accounting for the variability in ancestry and time period between the pre-Avar populations.

If the three-way models resulted in one of the two contrasting sources between pre-Avar plus steppe resetting the other (bringing its estimated admixture proportion to 0%), we considered these models. If the contrasting sources had intermediate admixture proportions, we considered as successful only those tests that could reject one of the two scenarios between either pre-Avar plus steppe or pre-Avar plus pre-Avar. The individuals who still had unresolved or non-fitting models between a pre-Avar or a steppe source were considered unsolved or failed and were not used for further meta-analyses or interpretations.

For the sake of simplicity and consistency, we chose one eastern Eurasian source to include in our plots and summary statistics: the genetically easternmost group of individuals from the early Avar period in the DTI region (DTI_EA_East; Fig. 4 and previously published¹⁰), to which we added data from unrelated individuals at the early-period site of KUP that presented the same genomic profile (Supplementary Fig. 12). We always used this eastern proxy, except in the few instances in which it did not produce fitting models, in favour of another one, suggesting an existing heterogeneity in the eastern component although much reduced with respect to the variability in the western sources (Supplementary Table 5). Nevertheless, it is important to note that although DTI_EA_East is the source that overall produced more fitting models, several other eastern sources (including lateXiongnu, AR_Xianbei_P_2c) resulted in many equally fitting models as well (Supplementary Table 5).

We used DATES v.753 (https://github.com/priyamoorjani/DATES) to date the average time of the east–west Eurasian ancestry admixture estimated for most of the Avar period individuals from the four sites analysed. This method is based on the same principle as many admixture dating methods^70,79; it assumes an admixture event between two admixing source populations, an east Asian and a west Eurasian one; in our case we used the unadmixed and high-SNP-covered LBA/IA group of the Ulaanzuukh_SlabGrave in Mongolia⁶³ or the same DTI_EA_East group used in the main qpAdm models as an ANA proxy and the pre-Avar Carpathian Basin ancient sources, Sarmatian¹⁰ and Longobard period⁵⁸ individuals, as a west Eurasian ancestry proxy. DATES calculates the decay of ancestry covariance coefficients between every pair of available overlapping SNPs between the test individuals and the source populations over increasing-genetic-distance windows⁷⁰. Population-genetic theory suggests that if admixture happens, an exponential function can be fitted to the decay of weighted ancestry covariance, and the number of generations since admixture can be derived from the parameters of such functions⁷⁹. The higher age limit of admixture events that would still produce detectable decays is theoretically considered to be around 4,000 years⁸⁰. In practice, recent admixture events (about one to three generations ago) are not properly detected because chromosomal recombination had insufficient generation time to start producing the expected decay pattern^81,82. To estimate the goodness of a fit, DATES calculates standard errors and Z-scores using a jack-knife approach, dropping a chromosome at a time. We set a maximum distance parameter of 0.5 cM, a bin size of 0.001 and a starting genetic distance of 0.45 cM. The integrated least-square function was used to estimate the number of generations since admixture parameter. If the raw data show no decay, the exponential function either cannot be fitted or is fitted with low Z-scores, much less than 2, and unreasonable dating estimates with negative values, or large numbers over the theoretical maximum of 4,000 years back in time. All samples showing such values were also inferred as non-admixed by PCA and qpAdm and were excluded from our inferences. For Extended Data Fig. 9, we also included dates with Z-scores of less than 2 (shown with a transparency factor) because in part they reflect the recent (for example, first or second generation) admixture events that we can observe directly in the pedigrees. These DATES estimates are mostly not significant because there is no decay pattern yet to fit an exponential function, but some still provide qualitatively correct recent admixture dates (Supplementary Table 1). We used a standard of 29 years per generation⁷⁰ to convert the generation times in years since admixture, and used the Avar-period chronological phase of the individuals as the date at death.

Biological relatedness

We used KIN¹⁶ as the primary method to assess biological relatedness between each pair of individuals from the four sites we investigated, although we validated the relatedness estimates with the independent methods of haplotype-IBD (detailed below) and BREAD (https://github.com/jonotuke/BREADR) (Supplementary Information). Given that single-stranded UDG-half treated libraries still preserve a roughly 10–30% proportion of C-to-T deamination at the last two base pairs of the mapped fragments, for this analyses we masked two base pairs at both ends of the q30 reads using the trimBam module of bamUtil v.1.0.13 (ref. ⁸³) and used these masked bam files as input data. KIN can confidently identify first- and second-degree relations while differentiating between parent–child and sibling relations¹⁶. Although the method does not explicitly differentiate relationships within the second degree, it outputs information about IBD sharing that can help to differentiate between avuncular and grandparent–grandchild relationships. We simulated avuncular, half-sibling and grandparent–grandchild pairs (Supplementary Information) to show that the length of IBD segments and the number of IBD segments can be used to differentiate between avuncular and grandparent–grandchild relationships, while half siblings overlap with both cases. Furthermore, KIN provides indications about third-degree relationships (with around 70% accuracy at 4× sequence coverage). Although these analyses are not sufficient to confidently identify within second-degree relationships, and may lack the power to identify third-degree relatives, they can be crucial when combined with other information, such as pedigree information from different pairs as well as from information about the skeletal age at death, the sex and the uniparental haplogroups (Y chromosome and mtDNA). Therefore, all this information was considered when building and cross-checking the pedigrees of biological relatedness (Supplementary Information). For clarity, we numbered the pedigrees that we found and we define one pedigree as a group of individuals who can be directly connected with close genetic relatedness and for whom a line of descent can be traced. In the case of the largest pedigree we reconstructed (146 individuals from RK), we divided it into five pedigrees descending from five different groups of 11 ‘founder male individuals’ (including multiple brothers as co-founders).

Simulations on second-degree relationships

We followed the methods section for KIN¹⁶ and simulated eight diploid individuals using msprime⁸⁴ with default parameters for the mutation rate (1 × 10^–8 per base per generation), the recombination rate r (1 × 10^–8 per base per generation) and an effective population size of 3,000. For each individual, we simulated 22 chromosomes with the same lengths as the GRCh38.p14 genome. To form a pedigree, we first simulated a recombined set of chromosomes for each parent and combined them to create the progeny. We obtained recombination points for each chromosome from the software Ped-sim⁸⁵. We matched the genotype density and the coverage of reads to that of our samples. We simulated 60 such pedigrees (see figure S9 in ref. ¹⁶ and Supplementary Figs. 17 and 18).

Consanguinity test (ROHs)

Consanguinity can be tested genetically by a straightforward approach: counting the length and number of long stretches of homozygous portions along the genome of an individual. This analysis is usually defined as ROHs. To estimate ROH, we applied a method called hapROH⁸⁶ that was designed to infer them on pseudo-haploid, lower-coverage and higher-missing-data ancient DNA samples; the method has also been shown empirically to be highly consistent with independent ROH estimates calculated on the same ancient imputed diploid genomes¹⁰. Specific patterns of long ROH (more than 4 cM) along the genome of an individual are typical of consanguineous unions between some of its recent ancestors (up to second-degree cousins⁸⁶). In Extended Data Fig. 5 we plotted ROH using the python package implemented in hapROH (https://pypi.org/project/hapROH/).

Genotype likelihood calls and imputation/phasing

Haplotype-based analyses (such as IBD described below) require information of the phase for each pair of paternal and maternal chromosomes of an individual, and this in turn requires there to be virtually no missing data along the genome. Obtaining such data from ancient genomes has been shown by recent studies^87,88 to be reliable in other similar contexts for coverage of more than 0.5–0.7×, and it has also been applied to 1240k capture data^10,89 through simultaneous statistical imputation and phasing. We used the ancient-DNA-specific genotype caller MLE function of ATLAS (https://bitbucket.org/wegmannlab/atlas/)⁵² to call genotype likelihoods. ATLAS can also calculate the base-quality recalibration (the recal function) that we performed in batches among libraries sequenced in the same sequencing run, accounting for specific sequencing errors. ATLAS recalibration also corrects the base qualities accounting for the empirical ancient DNA-damage pattern observed from the data and reduces the effect of reference bias introduced by genome mapping by relying on a list of 10 million highly conserved genomic positions across 88 mammal species downloaded from ensembl (https://grch37.ensembl.org/). We called genotype likelihoods on the whole 1,000-genomes SNPs panel of around 20 million SNPs and used these calls as input data for imputation with GLIMPSE⁹⁰, for which we used the phased 1,000 genomes phase-3 release data as reference haplotypes⁹¹. We ran GLIMPSE with the default parameters using sex-averaged genetic maps from HapMap, as suggested previously⁸⁸. The function GLIMPSE_phase was used to perform simultaneous imputation and phasing on genomic chunks of 2,000,000 base pairs with a buffer of 200,000 base pairs. We then used the integrated GLIMPSE_ligate and GLIMPSE_sample functions and bcftools v1.3 (refs. ^88,92) to obtain the final phase/imputed vcf files with the genotypes posterior probabilities at every 1240k position.

Haplotype IBD sharing analysis

We performed haplotype IBD analysis with ancIBD, a recently developed method that accounts for the high phasing errors of ancient DNA⁹³. This analysis searches for long haploid blocks along the genomes of two individuals that are identical by descent (IBD), meaning they have been inherited by a common ancestor at some time in the past. Therefore, it can detect close genetic relatives (first to third degrees of relation) as KIN does, but it can also detect more-distant relations, up to sixth degree, within ranges of biological stochasticity⁸⁵. However, it requires a much higher threshold of coverage, reducing the number of individuals analysed relative to KIN. We used imputed or phased data, including only those individuals with more than 450,000 SNPs obtained with our pseudo-haploid calls and SNPs with genotype posterior probabilities greater than 0.99 after imputation. We used the HapBLOCK function of ancIBD to perform the pairwise estimation with default parameters and only shared blocks of more than 8 cM containing more than 220 SNPs per centimorgan were considered. To further filter for possible false-positive hits, we considered only shared IBD segments longer than 12 cM, and if a pair of individuals had segments of less than 16 cM, we included them only if they had more than one such segment (Supplementary Table 4). We used Cytoscape v.3.9.1 (ref. ⁹⁴) to plot the networks of pairwise IBD relations.

Network analysis

For the IBD network analysis, only the Avar-period individuals were included. Because subadult individuals might be a confounding factor when assessing the sex-specific patterns of mobility and connectedness, we made an additional network that included only adults. The threshold of adulthood was set at 18 years of age, based on the lower limit of the estimated age of the youngest parent. The entire network consisted of 257 nodes, of which 195 represented adults (105 male and 90 female individuals) and 62 subadults (35 male and 27 female individuals) from four archaeological sites. The links of the network are represented by the IBD connections, which number 2,658 if the entire network is considered and 1,211 if only the adults are selected (Supplementary Table 4). In our analysis, we considered both unweighted and weighted networks. The unweighted network represents a configuration in which the found IBD relations define the presence or absence of links irrespective of their values. However, in the weighted network, the links are weighted by the maximum IBD values of the analysis, allowing the magnitude of relatedness to be evaluated. Both networks are undirected because sharing of IBD segments between two individuals has no directionality.

Degree centrality (k) is defined as the number of links held by the node. The average degree ⟨k⟩ of the Avar-period adults’ network is 18.07. Considering the assigned weights on the links, which in our case is the sum of the weights (max_IBD) of the links attached to each node, the mean strength ⟨w⟩ is 1,620.54. When sex is considered as a node attribute, the degree and the strength distributions are significantly different between male and female individuals (Fig. 3c and Supplementary Figs. 47 and 48). For male individuals, ⟨k⟩ is 27.39 and ⟨w⟩ is 2,392.37, whereas for female individuals, ⟨k⟩ is 7.21 and ⟨w⟩ is 720.08. The two-sample Kolmogorov–Smirnov test revealed significant differences between the male and female individuals’ degree and strength distribution (P < 0.05).

The degree centrality of a node can be partitioned into within-module (k_W) and between-module (k_B) links by considering the archaeological site of the burial as a module. The k_B/k ratio represents the ratio of between-module connections over the total connections, which can range between 0 and 1, with 0 indicating that related individuals are buried solely at the same site and 1 indicating that related individuals are buried only at a different site. To evaluate this ratio, the value of degree centrality must also be considered because individuals with small degree centrality may have a higher k_B/k ratio. The other results of the analysis are explained in Supplementary Figs. 44–48. The analysis was performed using R and the node measurements were calculated using customized R scripts with the igraph package⁹⁵.

Isotope analysis and ¹⁴C dating

¹⁴C dating and isotope analysis (δ¹³C, δ¹⁵N) was performed in the same bone material in the isotope and radiocarbon laboratories at the Curt Engelhorn Centre Archaeometry in Mannheim, Germany. Bone samples were cleaned, chemically treated and collagen extracted using a modified Longin method⁹⁶. For stable isotope analysis of carbon and nitrogen, triplicates of the resulting collagen were combusted in an elemental analyser (PYROcube, Elementar) and isotopic ratios were measured by isotope ratio mass spectrometry (precisION, Elementar). The same collagen extract was used for ¹⁴C dating. After ultrafiltration to remove short-chained macromolecules, the collagen was reduced to graphite using either a commercially available system (AGE3, IonPlus) or a custom-made system. A MICADAS-type accelerator mass spectrometer (IonPlus) was used to determine the conventional ¹⁴C ages⁹⁷. ¹⁴C dates were modelled in the software Oxcal v.4.4.4 (ref. ⁹⁸) and terrestrial samples were calibrated using IntCal20 (ref. ⁹⁹). Bayesian modelling of ¹⁴C dates include prior information of relative chronological information provided by pedigrees following methods outlined previously²⁴. Model results and detailed explanations are given in Supplementary Tables 2 and 3 and Supplementary Information.

For all strontium measurements, the tooth enamel was extracted in a laboratory at the Institute of Archaeogenomics in Budapest. The surface of the teeth was cleaned by a Dremel tool with an abrasion tip, then, after a ten-minute ultrasonic bath, the enamel was carefully powdered with a diamond-coated dental drill bit attached to the Dremel tool, until 25–50 mg was obtained. Strontium separation chemistry for all samples followed a previous method¹⁰⁰. Analyses were performed on a Nu Instruments NuPlasma HR at the MC-ICP-MS facility in the Department of Geological Sciences at the University of Cape Town in Rondebosch, South Africa, and followed the procedure and referencing values (SRM987 ⁸⁷Sr/⁸⁶Sr of 0.710255) described previously¹⁰¹. Past 4.11 software¹⁰² was used for the statistical analysis of the isotope data.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The sequence data have been deposited in the European Nucleotide Archive (ENA) with the accession number PRJEB72021. The haploid genotype data are available through the Poseidon framework via GitHub at https://github.com/poseidon-framework/community-archive/tree/master/2024_GnecchiRuscone_CarpathianBasinAvarPedigrees (ref. ¹⁰³). Geographic maps were plotted with R³⁴.

References

Pohl, W. The Avars: A Steppe Empire in Central Europe 567–822 (Cornell Univ. Press, 2018).
Krader, L. Principles and structures in the organization of the Asiatic steppe-pastoralists. Southwest. J. Anthropol. 11, 67–92 (1955).
Article Google Scholar
Fowler, C. et al. A high-resolution picture of kinship practices in an Early Neolithic tomb. Nature 601, 584–587 (2022).
Article ADS CAS PubMed Google Scholar
Mittnik, A. et al. Kinship-based social inequality in Bronze Age Europe. Science 366, 731–734 (2019).
Article ADS CAS PubMed Google Scholar
Rivollat, M. et al. Extensive pedigrees reveal the social organization of a Neolithic community. Nature 620, 600–606 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Pohl, W., Krause, J., Vida, T. & Geary, P. Integrating genetic, archaeological, and historical perspectives on eastern central Europe, 400–900 ad. Hist. Stud. Cent. Eur. 1, 213–228 (2021).
Curta, F. Ethnicity in the steppe lands of the northern Black Sea region during the early Byzantine Times. Archaeol. Bulgarica 23, 33–70 (2019).
Google Scholar
Vida, T. in The Other Europe in the Middle Ages: Avars, Bulgars, Khazars, and Cumans (ed. Curta, F.) 13–46 (Brill, 2008).
Daim, F. in The Transformation of the Roman World (eds Goetz, H.-W. et al.) 463–570 (Brill, 2003).
Gnecchi-Ruscone, G. A. et al. Ancient genomes reveal origin and rapid trans-Eurasian migration of 7th century Avar elites. Cell 185, 1402–1413 (2022).
Article CAS PubMed PubMed Central Google Scholar
Balogh, C. A Duna-Tisza Köze Avar Kori Betelepülésének Problémái. PhD thesis, Eötvös Loránd Univ. (2013).
Lezsák, G. M. Avarok a Herke-Tónál. A Kunszállás-Fülöpjakabi Avar Temető Története (Antológia, 2008).
Mácsai, V. A Rákóczifalva-Bagi-földek 8A avar temetőjének feldolgozása. Master thesis, Eötvös Loránd Univ. Budapest (2012).
Rácz, Z. & Szenthe, G. Avar temető Hajdúnánás határában. Commun. Archaeol. Hung. 2009, 309–335 (2009).
Hajdu, T., Guba, Z. & Pap, I. A hajdúnánási avar temető embertani leletei. Preprint at Commun. Archaeol. Hung. 2009, 339–358 (2009).
Popli, D., Peyrégne, S. & Peter, B. M. KIN: a method to infer relatedness from low-coverage ancient DNA. Genome Biol. 24, 10 (2023).
Article CAS PubMed PubMed Central Google Scholar
Weisberg, D. E. Levirate Marriage and the Family in Ancient Judaism (Brandeis Univ. Press, 2009).
Гмыря, Л. Б. Страна гуннов у Каспийских ворот: Прикаспийский Дагестан в эпоху Великого переселения народов (Dagestanskoe Knizhnoe Izdatel Stvo, 1995).
Commercio, M. E. ‘Don’t become a lost specimen!’: polygyny and motivational interconnectivity in Kyrgyzstan. Cent. Asian Surv. 39, 340–360 (2020).
Article Google Scholar
Holmgren, J. in Marriage and Inequality in Chinese Society (eds Watson, R. S. & Ebrey, P. B.) 58–96 (Univ. California Press, 1991).
Taşbaş, E. The Turkic kinship system. Acta Orient. 72, 245–258 (2019).
Article Google Scholar
Fadlan, A. I. Mission to the Volga (NYU Press, 2017).
Maróti, Z. et al. The genetic origin of Huns, Avars, and conquering Hungarians. Curr. Biol. 32, 2858–2870 (2022).
Article PubMed Google Scholar
Massy, K., Friedrich, R., Mittnik, A. & Stockhammer, P. W. Pedigree-based Bayesian modelling of radiocarbon dates. PLoS ONE 17, e0270374 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ventresca Miller, A. R. & Makarewicz, C. A. Intensification in pastoralist cereal use coincides with the expansion of trans-regional networks in the Eurasian Steppe. Sci Rep. 9, 8363 (2019).
Article ADS PubMed PubMed Central Google Scholar
Szenthe, G. & Gáll, E. A (needle) case in point: transformations in the Carpathian Basin during the early Middle Ages (late Avar period, 8th−9th century ad). Eur. J. Archaeol. 24, 345–366 (2021).
Article Google Scholar
Gnecchi-Ruscone, G. A. et al. Ancient genomic time transect from the Central Asian Steppe unravels the history of the Scythians. Sci. Adv. 7, eabe4414 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Depaermentier, M. L. C., Kempf, M., Bánffy, E. & Alt, K. W. Tracing mobility patterns through the 6th–5th millennia BC in the Carpathian Basin with strontium and oxygen stable isotope analyses. PLoS ONE 15, e0242745 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gulyás, B. Cultural connections between the Eastern European steppe region and the Carpathian Basin in the 5th–7th centuries AD: the origin of the Early Avar Period population of the Trans-Tisza region. Diss. Archaeol. 3, 701–756 (2024).
Stark, S. in From the Huns to the Turks: Mounted Warriors in Europe and Central Asia (eds Daim, F. et al.) 59–87 (Tagungen des Landesmuseums für Vorgeschichte Halle, 2021).
Csiky, G. in Crossing Boundaries: Mounted Nomads in Central Europe, their Eastern Roots and Connections (eds Daim, F. & Meller, H.) 33–44 (Tagungen des Landesmuseums für Vorgeschichte Halle, 2022).
Vida, T. in Crossing Boundaries: Mounted Nomads in Central Europe, their Eastern Roots and Connections (eds Daim, F. & Meller, H.) 260–275 (Tagungen des Landesmuseums für Vorgeschichte Halle, 2022).
Bóna, I. Avar lovassír Iváncsáról. Archaeol. Értesítő 97, 243–261 (1970).
Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing https://www.R-project.org/ (R Foundation for Statistical Computing, 2022).
Dabney, J. et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl. Acad. Sci. USA 110, 15758–15763 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Rohland, N., Glocke, I., Aximu-Petri, A. & Meyer, M. Extraction of highly degraded DNA from ancient bones, teeth and sediments for high-throughput sequencing. Nat. Protoc. 13, 2447–2461 (2018).
Article CAS PubMed Google Scholar
Gansauge, M.-T. et al. Single-stranded DNA library preparation from highly degraded DNA using T4 DNA ligase. Nucleic Acids Res. 45, e79 (2017).
CAS PubMed PubMed Central Google Scholar
Gansauge, M.-T., Aximu-Petri, A., Nagel, S. & Meyer, M. Manual and automated preparation of single-stranded DNA libraries for the sequencing of DNA from ancient biological remains and other sources of highly degraded DNA. Nat. Protoc. 15, 2279–2300 (2020).
Article CAS PubMed Google Scholar
DeAngelis, M. M., Wang, D. G. & Hawkins, T. L. Solid-phase reversible immobilization for the isolation of PCR products. Nucleic Acids Res. 23, 4742–4743 (1995).
Article CAS PubMed PubMed Central Google Scholar
Fu, Q. et al. DNA analysis of an early modern human from Tianyuan Cave, China. Proc. Natl Acad. Sci. USA 110, 2223–2227 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Fu, Q. et al. An early modern human from Romania with a recent Neanderthal ancestor. Nature 524, 216–219 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Fellows Yates, J. A. et al. Reproducible, portable, and efficient ancient genome reconstruction with nf–core/eager. PeerJ 9, e10947 (2021).
Article PubMed PubMed Central Google Scholar
Schubert, M., Lindgreen, S. & Orlando, L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes 9, 88 (2016).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. & Orlando, L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013).
Article PubMed PubMed Central Google Scholar
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: analysis of next generation sequencing data. BMC Bioinformatics 15, 356 (2014).
Article PubMed PubMed Central Google Scholar
Renaud, G., Slon, V., Duggan, A. T. & Kelso, J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 16, 224 (2015).
Article PubMed PubMed Central Google Scholar
Kloss-Brandstätter, A. et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum. Mutat. 32, 25–32 (2011).
Article PubMed Google Scholar
Chen, H., Lu, Y., Lu, D. & Xu, S. Y-LineageTracker: a high-throughput analysis framework for Y-chromosomal next-generation sequencing data. BMC Bioinformatics 22, 114 (2021).
Article PubMed PubMed Central Google Scholar
Link, V. et al. ATLAS: analysis tools for low-depth and ancient samples. Preprint at bioRxiv https://doi.org/10.1101/105346 (2017).
Martiniano, R., De Sanctis, B., Hallast, P. & Durbin, R. Placing ancient DNA sequences into reference phylogenies. Mol. Biol. Evol. 39, msac017 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jeong, C. et al. The genetic history of admixture across inner Eurasia. Nat. Ecol. Evol. 3, 966–976 (2019).
Article PubMed PubMed Central Google Scholar
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient Near East. Nature 536, 419–424 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Patterson, N. et al. Ancient admixture in human history.Genetics 192, 1065–1093 (2012).
Article PubMed PubMed Central Google Scholar
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172 (2015).
Article ADS CAS PubMed Google Scholar
Amorim, C. E. G. et al. Understanding 6th-century barbarian social organization and migration through paleogenomics. Nat. Commun. 9, 3547 (2018).
Article ADS PubMed PubMed Central Google Scholar
de Barros Damgaard, P. et al. The first horse herders and the impact of early Bronze Age steppe expansions into Asia. Science 360, eaar7711 (2018).
Article PubMed PubMed Central Google Scholar
de Barros Damgaard, P. et al. 137 ancient human genomes from across the Eurasian steppes. Nature 557, 369–374 (2018).
Article ADS Google Scholar
Fu, Q. et al. The genetic history of Ice Age Europe. Nature 534, 200–205 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Jeong, C. et al. Bronze Age population dynamics and the rise of dairy pastoralism on the eastern Eurasian steppe. Proc. Natl. Acad. Sci. USA 115, E11248–E11255 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Jeong, C. et al. A dynamic 6,000-year genetic history of Eurasia’s eastern steppe. Cell 183, 890–904 (2020).
Article CAS PubMed PubMed Central Google Scholar
Krzewińska, M. et al. Ancient genomes suggest the eastern Pontic-Caspian steppe as the source of western Iron Age nomads. Sci. Adv. 4, eaat4457 (2018).
Article ADS PubMed PubMed Central Google Scholar
Lamnidis, T. C. et al. Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe. Nat. Commun. 9, 5018 (2018).
Article ADS PubMed PubMed Central Google Scholar
Li, J. et al. The genome of an ancient Rouran individual reveals an important paternal lineage in the Donghu population. Am. J. Phys. Anthropol. 166, 895–905 (2018).
Article PubMed Google Scholar
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499–503 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Mathieson, I. et al. The genomic history of southeastern Europe. Nature 555, 197–203 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
McColl, H. et al. The prehistoric peopling of Southeast Asia. Science 361, 88–92 (2018).
Article ADS CAS PubMed Google Scholar
Narasimhan, V. M. et al. The formation of human populations in South and Central Asia. Science 365, eaat7487 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ning, C. et al. Ancient genomes from northern China suggest links between subsistence changes and human migration. Nat. Commun. 11, 2700 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, F. et al. The genomic origins of the Bronze Age Tarim Basin mummies. Nature 599, 256–261 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Raghavan, M. et al. Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans. Nature 505, 87–91 (2014).
Article ADS PubMed Google Scholar
Sikora, M. et al. The population history of northeastern Siberia since the Pleistocene. Nature 570, 182–188 (2019).
Article ADS CAS PubMed Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article PubMed PubMed Central Google Scholar
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Reich, D. et al. Reconstructing Native American population history. Nature 488, 370–374 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Harney, É., Patterson, N., Reich, D. & Wakeley, J. Assessing the performance of qpAdm: a statistical tool for studying population admixture. Genetics 217, iyaa045 (2021).
Article PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Inferring admixture histories of human populations using linkage disequilibrium. Genetics 193, 1233–1254 (2013).
Article PubMed PubMed Central Google Scholar
Hellenthal, G. et al. A genetic atlas of human admixture history. Science 343, 747–751 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Liang, M. & Nielsen, R. The lengths of admixture tracts. Genetics 197, 953–967 (2014).
Article PubMed PubMed Central Google Scholar
Iasi, L. N. M., Ringbauer, H. & Peter, B. M. An extended admixture pulse model reveals the limitations to human–Neandertal introgression dating. Mol. Biol. Evol. 38, 5156–5174 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jun, G., Wing, M. K., Abecasis, G. R. & Kang, H. M. An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data. Genome Res. 25, 918–925 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kelleher, J., Etheridge, A. M. & McVean, G. Efficient coalescent simulation and genealogical analysis for large sample sizes. PLoS Comput. Biol. 12, e1004842 (2016).
Article ADS PubMed PubMed Central Google Scholar
Caballero, M. et al. Crossover interference and sex-specific genetic maps shape identical by descent sharing in close relatives. PLoS Genet. 15, e1007979 (2019).
Article PubMed PubMed Central Google Scholar
Ringbauer, H., Novembre, J. & Steinrücken, M. Parental relatedness through time revealed by runs of homozygosity in ancient DNA. Nat. Commun. 12, 5425 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Hui, R., D’Atanasio, E., Cassidy, L. M., Scheib, C. L. & Kivisild, T. Evaluating genotype imputation pipeline for ultra-low coverage ancient genomes. Sci. Rep. 10, 18542 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
da Mota, B. S. et al. Imputation of ancient human genomes. Nat. Commun. 14, 3660 (2023).
Childebayeva, A. et al. Population genetics and signatures of selection in early Neolithic European farmers. Mol. Biol. Evol. 39, msac108 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rubinacci, S., Ribeiro, D. M., Hofmeister, R. J. & Delaneau, O. Efficient phasing and imputation of low-coverage sequencing data using large reference panels. Nat. Genet. 53, 120–126 (2021).
Article CAS PubMed Google Scholar
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ringbauer, H. et al. Accurate detection of identity-by-descent segments in human ancient DNA. Nat. Genet. 56, 143–151 (2023).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Csárdi, G. & Nepusz, T. The igraph software package for complex network research. InterJournal Complex Systems 1695, 1–9 (2006).
Google Scholar
Brown, T. A., Nelson, D. E., Vogel, J. S. & Southon, J. R. Improved collagen extraction by modified Longin method. Radiocarbon 30, 171–177 (1988).
Article CAS Google Scholar
Kromer, B., Lindauer, S., Synal, H. A. & Wacker, L. MAMS – a new AMS facility at the Curt-Engelhorn-Centre for Achaeometry, Mannheim, Germany. Nucl. Instrum. Methods Phys. Res. B 294, 11–13 (2013).
Article ADS CAS Google Scholar
Bronk Ramsey, C. Bayesian analysis of radiocarbon dates. Radiocarbon 51, 337–360 (2009).
Article Google Scholar
Reimer, P. J. et al. The IntCal20 Northern Hemisphere radiocarbon age calibration curve (0–55 cal kBP). Radiocarbon 62, 725–757 (2020).
Article CAS Google Scholar
Pin, C., Briot, D., Bassin, C. & Poitrasson, F. Concomitant separation of strontium and samarium-neodymium for isotopic analysis in silicate samples, based on specific extraction chromatography. Anal. Chim. Acta 298, 209–217 (1994).
Article CAS Google Scholar
Copeland, S. R. et al. Strontium isotope investigation of ungulate movement patterns on the Pleistocene Paleo-Agulhas Plain of the Greater Cape Floristic Region, South Africa. Quat. Sci. Rev. 141, 65–84 (2016).
Article ADS Google Scholar
Hammer, Ø., Harper, D. A. T. & Ryan, P. D. PAST: paleontological statistics software package for education and data analysis. Palaeontol. Electron. 4, 4 (2001).
Google Scholar
Gnecchi-Ruscone, G. A. 2024_GnecchiRuscone_CarpathianBasinAvarPedigrees. GitHub https://github.com/poseidon-framework/community-archive/tree/master/2024_GnecchiRuscone_CarpathianBasinAvarPedigrees (2024)

Download references

Acknowledgements

We thank G. Csíky for work on the archaeological dataset; K. Sebők for help with the excavation documentation of the Rákóczifalva cemetery; A. Ben Rohrlach for advice on network statistical analyses; H. Ringbauer for sharing data, feedback and discussion on IBD analyses; and I. Rainer, P. Hofman and A. Plonka for graphical support. Data were produced by the Ancient DNA Core Unit of the Max Planck Institute for Evolutionary Anthropology, which is funded by the Max Planck Society. M. Spross, H. Beigzad, L. Schwarz, S. Lindauer, E. Dimitrakopoulos, E. Podolskaja, M. Hänisch and J. Wintel contributed to stable isotope analyses and radiocarbon dating at CEZA. S. Gábriel, D. Pokker, V. Bódis and K. Kerestély contributed to the DNA sample preparation in the HUN-REN RCH Institute of Archaeogenomics in Budapest. We thank the Hungarian Natural History Museum and the Department of Biological Anthropology of the University of Szeged for access to samples. This project received funding from the European Research Council under the European Union’s Horizon 2020 research and innovation programme (grant 856453 ERC-2019-SyG), the Czech Grant Agency (GACR 21-17092X), the Czech Ministry of Education, Youth and Sports (CZ.02.01.01/00/22_008/0004593) and the Max Planck Society. T.S. was supported by the ÚNKP-22-4 New National Excellence Program of the Ministry for Culture and Innovation from the National Research, Development and Innovation Fund. The analysis of the pre-Avar radiocarbon data was supported by the Hungarian National Research, Development and Innovation Fund project 128035 led by Z.R.

Funding

Open access funding provided by Max Planck Society.

Author information

These authors contributed equally: Guido Alberto Gnecchi-Ruscone, Zsófia Rácz

Authors and Affiliations

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Guido Alberto Gnecchi-Ruscone, Luca Traverso, Ke Wang, Rita Radzeviciute, Johannes Krause & Zuzana Hofmanová
Institute of Archaeological Sciences, ELTE - Eötvös Loránd University, Budapest, Hungary
Zsófia Rácz, Levente Samu, Norbert Faragó, István Koncz, Viktor Mácsai, Olga Spekker & Tivadar Vida
Department of Biological Anthropology, ELTE - Eötvös Loránd University, Budapest, Hungary
Tamás Szeniczey & Tamás Hajdu
Curt Engelhorn Center for Archaeometry gGmbH, Mannheim, Germany
Corina Knipper & Ronny Friedrich
Department of Archaeology and Museology, Faculty of Arts, Masaryk University, Brno, Czechia
Denisa Zlámalová & Zuzana Hofmanová
Department of History, University of Vienna, Vienna, Austria
Salvatore Liccardo, Sandra Wabnitz & Walter Pohl
Institute for Medieval Research, Austrian Academy of Sciences, Vienna, Austria
Salvatore Liccardo, Sandra Wabnitz & Walter Pohl
Department of Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Divyaratan Popli
MOE Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China
Ke Wang
Hungarian National Museum, Budapest, Hungary
Bence Gulyás
Department of Art History, Istanbul Medeniyet University, Istanbul, Turkey
Csilla Balogh
Institute of History, HUN-REN Research Centre for the Humanities, Budapest, Hungary
Gabriella M. Lezsák
ARC Centre of Excellence for Australian Biodiversity and Heritage, College of Arts, Society and Education, James Cook University, Cairns, Queensland, Australia
Magdalena M. E. Bunbury
Department of Biological Anthropology, University of Szeged, Szeged, Hungary
Olga Spekker
Department of Geological Sciences, University of Cape Town, Rondebosch, South Africa
Petrus le Roux
Institute of Archaeogenomics, HUN-REN Research Centre for the Humanities, Budapest, Hungary
Anna Szécsényi-Nagy & Balázs Gusztáv Mende
BirthRites Lise Meitner Research Group, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Heidi Colleran
Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Heidi Colleran
Institute for Advanced Study, Princeton, NJ, USA
Patrick Geary
Institute of Archaeology, HUN-REN Research Centre for the Humanities, Budapest, Hungary
Tivadar Vida

Authors

Guido Alberto Gnecchi-Ruscone
View author publications
You can also search for this author in PubMed Google Scholar
Zsófia Rácz
View author publications
You can also search for this author in PubMed Google Scholar
Levente Samu
View author publications
You can also search for this author in PubMed Google Scholar
Tamás Szeniczey
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Faragó
View author publications
You can also search for this author in PubMed Google Scholar
Corina Knipper
View author publications
You can also search for this author in PubMed Google Scholar
Ronny Friedrich
View author publications
You can also search for this author in PubMed Google Scholar
Denisa Zlámalová
View author publications
You can also search for this author in PubMed Google Scholar
Luca Traverso
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Liccardo
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Wabnitz
View author publications
You can also search for this author in PubMed Google Scholar
Divyaratan Popli
View author publications
You can also search for this author in PubMed Google Scholar
Ke Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rita Radzeviciute
View author publications
You can also search for this author in PubMed Google Scholar
Bence Gulyás
View author publications
You can also search for this author in PubMed Google Scholar
István Koncz
View author publications
You can also search for this author in PubMed Google Scholar
Csilla Balogh
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella M. Lezsák
View author publications
You can also search for this author in PubMed Google Scholar
Viktor Mácsai
View author publications
You can also search for this author in PubMed Google Scholar
Magdalena M. E. Bunbury
View author publications
You can also search for this author in PubMed Google Scholar
Olga Spekker
View author publications
You can also search for this author in PubMed Google Scholar
Petrus le Roux
View author publications
You can also search for this author in PubMed Google Scholar
Anna Szécsényi-Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Balázs Gusztáv Mende
View author publications
You can also search for this author in PubMed Google Scholar
Heidi Colleran
View author publications
You can also search for this author in PubMed Google Scholar
Tamás Hajdu
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Geary
View author publications
You can also search for this author in PubMed Google Scholar
Walter Pohl
View author publications
You can also search for this author in PubMed Google Scholar
Tivadar Vida
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Krause
View author publications
You can also search for this author in PubMed Google Scholar
Zuzana Hofmanová
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and led by: Z.H., J.K., T.V., W.P., P.G., G.A.G.-R., and Z.R. Formal analyses: G.A.G.-R., Z.R., L.S., T.S., N.F., C.K., R.F., L.T., D.Z., D.P., K.W. Sample preparation and laboratory work: R.R., B.G., I.K., C.B., G.M.L., V.M., O.S., M.M.E.B., P.R., A.S.-N., B.G.M., T.H. Visualization: G.A.G.-R., L.S., Z.R., T.S., N.F., C.K., R.F., L.T., D.Z., D.P. Writing, original draft: G.A.G.-R., Z.R., Z.H., W.P., L.S., S.L., S.W., T.S., N.F., C.K., R.F., L.T. Writing, reviewing and editing: J.K., T.V., P.G., H.C. with contributions from all authors.

Corresponding authors

Correspondence to Guido Alberto Gnecchi-Ruscone, Tivadar Vida, Johannes Krause or Zuzana Hofmanová.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks Daniel Ziemann and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Pedigree and cemetery map of the individuals analyzed in the site of Kunszállás (KFJ).

a) pedigree highlighting the father-son levirate union discovered. b) cemetery map showing the burial location of the related and unrelated individuals in Kunszállás.

Extended Data Fig. 2 Pedigrees and cemetery map of the individuals analyzed in the site of Kunpeszér (KUP).

a) the unconnected early (left) and late (right) Avar period pedigrees highlighting the possible levirate union reconstructed for pedigree 1. b) cemetery map showing the burial location of the related and unrelated individuals in Kunpeszér.

Extended Data Fig. 3 Pedigrees and cemetery map of the individuals analyzed in the site of Hajdúnánás (HNJ).

a) the unconnected admixed ancestry pedigree 1 and European ancestry pedigree 2. b) cemetery map showing the burial location of the related and unrelated individuals in Hajdúnánás.

Extended Data Fig. 4 Pie-charts showing the frequency of the Y-chromosome and mtDNA haplogroups in the four Avar-period sites.

The four sites are dominated by one predominant Y-chromosome lineage (or two in case of RK) and the remaining ones are mostly restricted to outlier, unrelated individuals or smaller pedigrees not genetically related to the main ones whose patterns are analyzed in this article. While the mtDN-haplogroup diversity is much higher and more uniformly distributed.

Extended Data Fig. 5 Runs of homozygosity (ROH), test for consanguinity.

all the Avar period individuals from the 4 sites are shown in 4 panels. The only individual that shows a pattern of long ROH consistent with its parents being relatively close relatives (possible 1^st cousins) is a European ancestry individual found in the RK site, unrelated and unconnected through IBDs to the main extended pedigrees described in the article.

Extended Data Fig. 6 Network analysis of the ancIBD haplotype-IBD sharing between the Avar-period individuals analyzed in the study and previously published.

a) Visualization of the network of IBD connections (edges) between the ancient individuals (nodes) colored coded as in Fig. 3. The published individuals were retrieved from a number of different sites and are color coded to the regions where the site is located, DTI or TT. b) box plot showing the number of IBD connections for all the individuals from the network in a) having at least one IBD link with individuals from a site other than its own (237 individuals meeting this requirement). The plot shows all individual data points as well as the median (black line), upper and lower quartiles (the contour of the box) and the whiskers are the minimum and maximum values as calculated by the standard boxplot() function in R. c) map showing the sites with individuals connected to Kunbábony though IBDs. The thickness of the lines corresponds to the number of individuals connected. The base geographic map is from https://www.naturalearthdata.com and plotted with R³⁴.

Extended Data Fig. 7 Summary of main burial customs and cemetery map of Rákóczifalva 8 and 8 A.

On the top, the main characteristics of burial customs in the early to middle and the middle to late Avar period in the Rákóczifalva (RK) cemetery. In the early phase, several people were buried with horse tools, pots, and animal skin (partial animal), while in the late phase, wooden grave structures were the dominant feature of burial customs. The changes in burial customs correspond to the community shift and the spatial organization of the cemetery. On the bottom, the cemetery map shows the distribution of early, middle and late Avar-period graves. The left part of the cemetery is where early to middle Avar period graves are found, while the middle and right part contain predominantly middle to late Avar-period graves.

Extended Data Fig. 8 Evaluation of the sources of admixture with Western or Central Eurasian ancestries.

a) Map showing the location of the published data used as reference for our ancestry modeling. b) “Eurasian PCA” (left) and “West Eurasian PCA” (right), showing the ancient genomes from the sites in a) and unpublished pre-Avar individuals from the RK site. c) Summary of the best working qpAdm models for the newly sequenced individuals from the 4 sites. Gray color represents unspecified West Eurasian sources and the pie charts at the bottom show the sites’ average for the specific West Eurasian sources tested (full data in Supplementary Table 5). Gray slices in the pie charts represent models with unresolved West Eurasian sources. The base geographic map is from https://www.naturalearthdata.com.

Extended Data Fig. 9 Results of DATES admixture dates for all the newly sequenced individuals.

a) Individuals are colored according to their chronological category and b) colored according to their site. On the x axis are reported the Eurasian PCA Euclidean distances of each individual to the Rouran genome, used as a proxy for a non-admixed East Eurasian Steppe ancestry¹⁰. A transparency factor is added to the admixture dates with Z-score <2. Standard errors (SE) and Z-scores are obtained using a standard jackknife approach of 23 independent runs, dropping a chromosome at the time (Methods).

Supplementary information

Supplementary Information

This file contains Supplementary text and data, including Supplementary Figs. 1–50, Supplementary Tables 7–11 and Supplementary references.

Reporting Summary

Peer Review File

Supplementary Table 1

Metadata for all the individuals processed for genomic sequencing and individual-based summary of best qpAdm ancestry deconvolution models and individual-based DATES admixture dating.

Supplementary Table 2

Archaeological and anthropological data of the Avar-period cemeteries of Rakoczifalva-Bagi-foldek sites 8 and 8A (RK), Kunszallas-Fulopjakab (KFJ), Kunpeszer-Felsőpeszeri ut (KUP) and Hajdunanas-Furj-halom-jaras site 41A (HNJ).

Supplementary Table 3

New stable isotope data.

Supplementary Table 4

Pairwise genetic relatedness estimates obtained with the various methods described in the text and their summary statistics: KIN analyses on TT and DTI sites; haplotype IBD analyses run with ancIBD; concordance between BREADR and KIN.

Supplementary Table 5

All the RK-site individual-based qpWave/qpAdm models tested in the study.

Supplementary Table 6

New ¹⁴C dates produced for the study.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gnecchi-Ruscone, G.A., Rácz, Z., Samu, L. et al. Network of large pedigrees reveals social practices of Avar communities. Nature (2024). https://doi.org/10.1038/s41586-024-07312-4

Download citation

Received: 13 April 2023
Accepted: 13 March 2024
Published: 24 April 2024
DOI: https://doi.org/10.1038/s41586-024-07312-4

This article is cited by

DNA from ancient graves reveals the culture of a mysterious nomadic people
- Michael Eisenstein
Nature (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.