Rajkumar Revathi - Genetic Structure of Four Socio-culturally Diversified Caste Populations of Southwest India, 1 ...
[ Pobierz całość w formacie PDF ]
BMC Genetics
Research article
Genetic structure of four socio-culturally diversified caste
populations of southwest India and their affinity with related Indian
and global groups
Revathi Rajkumar and VK Kashyap*
Address: DNA Typing Unit, Central Forensic Science Laboratory, 30 Gorachand Road, Kolkata, India-700014
Email: Revathi Rajkumar - revathi_77@rediffmail.com; VK Kashyap* - vkk2k@hotmail.com
* Corresponding author
Published: 19 August 2004
BMC Genetics
2004,
5
:23
Received: 17 January 2004
Accepted: 19 August 2004
doi:10.1186/1471-2156-5-23
© 2004 Rajkumar and Kashyap; licensee BioMed Central Ltd.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (
),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Background:
A large number of microsatellites have been extensively used to comprehend the
genetic diversity of different global groups. This paper entails polymorphism at 15 STR in four
predominant and endogamous populations representing Karnataka, located on the southwest coast
of India. The populations residing in this region are believed to have received gene flow from south
Indian populations and world migrants, hence, we carried out a detailed study on populations
inhabiting this region to understand their genetic structure, diversity related to geography and
linguistic affiliation and relatedness to other Indian and global migrant populations.
Results:
Various statistical analyses were performed on the microsatellite data to accomplish the
objectives of the paper. The heretozygosity was moderately high and similar across the loci, with
low average G
ST
value. Iyengar and Lyngayat were placed above the regression line in the R-matrix
analysis as opposed to the Gowda and Muslim. AMOVA indicated that majority of variation was
confined to individuals within a population, with geographic grouping demonstrating lesser genetic
differentiation as compared to linguistic clustering. D
A
distances show the genetic affinity among
the southern populations, with Iyengar, Lyngayat and Vanniyar displaying some affinity with
northern Brahmins and global migrant groups from East Asia and Europe.
Conclusion:
The microsatellite study divulges a common ancestry for the four diverse populations
of Karnataka, with the overall genetic differentiation among them being largely confined to intra-
population variation. The practice of consanguineous marriages might have attributed to the
relatively lower gene flow displayed by Gowda and Muslim as compared to Iyengar and Lyngayat.
The various statistical analyses strongly suggest that the studied populations could not be
differentiated on the basis of caste or spatial location, although, linguistic affinity was reflected
among the southern populations, distinguishing them from the northern groups. Our study also
indicates a heterogeneous origin for Lyngayat and Iyengar owing to their genetic proximity with
southern populations and northern Brahmins. The high-ranking communities, in particular, Iyengar,
Lyngayat, Vanniyar and northern Brahmins might have experienced genetic admixture from East
Asian and European ethnic groups.
Page 1 of 8
(page number not for citation purposes)
BMC Genetics
2004,
5
:23
Background
The Indian subcontinent is regarded as a natural genetic
laboratory, owing to the co-existence and interaction of
socio-culturally, linguistically, ethnically and genetically
diversified endogamous populations in a geographical
terrain. It is believed that the earliest humans leaving
Africa for Eurasia might have taken a coastal route across
Saudi Arabia, through Iraq, Iran, to Pakistan and finally
entered India along the coastlines [1]. A second wave of
migration (~10,000 years ago) brought in Proto-Dravid-
ian Neolithic farmers from Afghanistan, who were later
displaced southwards by a large influx of Indo-European
speakers ~3500 years ago in to the subcontinent [2,3]. The
origin and settlement of the Indian people still remains
intriguing, fascinating scientists to explore the impact of
these past and modern migrations on the genetic diversity
and structure of contemporary populations [4-6].
could not be drawn due to low discriminatory power of
these markers. Recently, microsatellite markers have
gained immense popularity in precisely defining popula-
tion structure, diversity, affinities, gene flow and other
crucial aspects associated with population genetics [15-
21] because of the relative expediency, with which a large
number of loci and alleles can be typed, facilitating the
accumulation of vast data sets that can be readily analyzed
with an extensive array of statistical tools [22,23]. These
markers also demonstrate high heterozygosity [24], ren-
dering them highly suitable for carrying out the present
study.
Among the different caste and tribal groups inhabiting the
southwest coast of India, we have selected four predomi-
nant Dravidian-speaking communities from Karnataka:
Iyengar Brahmin, Lyngayat, Gowda and Muslim, they not
only belong to dissimilar groups of the Indian caste hier-
archy but also have varied migration histories, conferring
them uniqueness and significance from a genetic perspec-
tive. The present microsatellite study primarily attempts to
understand the genetic structure of the four selected pop-
ulations and to determine their genetic relationship with
other linguistically and ethnically similar groups of south-
ern India and Brahmin groups of northern India. It has
been suggested that that despite the linguistic homogene-
ity in southern India, these populations have remained
genetically diversified [25]. Hence, we sought to deter-
mine the role played by geographical location and linguis-
tic affiliation in genetically differentiating Indian
populations. Also, as mentioned earlier, the western coast
has witnessed colonization from different world popula-
tions, we aim to divulge the impact of these past migra-
tions on the gene pool of the present southern
populations by discerning their relationship with histori-
cally acclaimed and established migrant groups, ethni-
cally represented by European, Hispanic, East Asian and
African populations.
Anthropologically, southern and northern populations
are distinct and these differences are further substantiated
by (i) the presence of Neolithic sites in this region suggests
that Neolithic people of southern India came from north
by land and the west-coast by sea [7], (ii) the southern
megaliths resemble closely with those of the Mediterra-
nean and western-Europe, while those from northern
India are similar to megaliths found in Iran and Baluch-
isthan [8], and (iii) the predominance of Dravidian lan-
guage in this region as opposed to their secluded
occurrence in central Asia and other parts of India, sug-
gests that the Dravidian languages might have originated
within India [9]. It is, thus, of considerable genetic interest
to understand the genetic structuring and relationships of
southern populations.
The present study was carried out on one of the largest
southern states, Karnataka, positioned on the southwest
coast of India, with a dwelling of about 50 million people.
This expanse has been a rich source of prehistoric discov-
eries dating back to the Paleolithic era that are akin to
those seen in Europe [7]. Karnataka has received continu-
ous gene flow from different caste and linguistic groups
residing in the adjoining areas of Maharashtra, Andhra
Pradesh and Tamil Nadu [10], resulting in the congrega-
tion of a large number of diverse endogamous groups
within this region. Its large coastline of about 400 Km also
attracted the Portugese, Dutch and French traders, who
were seeking more profitable ventures on the southern
coast at large [2]. Southwest India is, thus, one of the most
disparate terrains, with extensive colonization in the past
and justifies an in-depth genetic study.
Results
Allele frequency at 15 STR was used to compute the heter-
ozygosity (observed) for the four studied populations,
which varied for each locus, and population but reflected
similar values, ranging between 0.724 and 0.797 (Table
1). An average G
ST
value of 0.009 elucidates the low degree
of genetic differentiation in them. However, the G
ST
value
for the pooled Indian and global populations demon-
strated a high value at 2.3% (data not shown). Genetic
relationship of studied populations with other similar
southern groups; Vanniyar, Gounder, Pallar and Tanjore
Kallar [26,27], northern Brahmins belonging to Orissa
[28] and Bihar [29], and four relevant global ethnic
groups: European, Hispanic, African [30] and East Asian
[31] was divulged by computing DA distances (Table 2)
and represented using NJ tree (Fig. 1). Among the four
A few studies utilizing classical markers have been carried
out on southern populations [5,11,12], including few
communities of Karnataka [13,14]. However, sound infer-
ences relating to their genetic structuring and diversity
Page 2 of 8
(page number not for citation purposes)
BMC Genetics
2004,
5
:23
Table 1: Average heterozygosity and G
ST
values for 15 loci in the four studied populations.
OBSERVED HETEROZYGOSITY
LOCUS
G
ST
BRAHMIN
LINGAYAT
GOWDA
MUSLIM
TPOX
0.707
0.581
0.542
0.555
0.010
D3S1358
0.661
0.793
0.559
0.488
0.036
THO1
0.815
0.785
0.678
0.688
0.005
D21S11
0.876
0.857
0.779
0.733
0.005
D18S51
0.907
0.938
0.779
0.888
0.006
PENTA E
0.921
0.876
0.864
0.800
0.014
D5S818
0.692
0.724
0.525
0.733
0.007
D13S317
0.753
0.714
0.745
0.733
0.007
D7S820
0.723
0.734
0.754
0.800
0.005
D16S539
0.861
0.846
0.830
0.777
0.010
CSF1PO
0.723
0.734
0.745
0.733
0.009
PENTA D
0.815
0.755
0.741
0.933
0.006
vWA
0.784
0.734
0.779
0.688
0.002
D8S1179
0.861
0.822
0.745
0.755
0.007
FGA
0.861
0.894
0.803
0.911
0.017
Average
0.797
0.785
0.724
0.747
0.009
Table 2: D
A
distance matrix between ten Indian and four global groups based on allele frequency at 15 microsatellites.
op
S
E
O
A
BIB LY
O
U
B L
N
K
D
HS
NE
0.079
MO
0.049
0.123
CA
0.029
0.086
0.07
OB
0.044
0.092
0.052
0.04
IB
0.044
0.091
0.041
0.04
0.03
LY
0.047
0.096
0.047
0.04
0.03
0.026
GO
0.066
0.122
0.072
0.07
0.055
0.036
0.047
MU
0.076
0.118
0.078
0.07
0.066
0.051
0.056
0.054
BB
0.043
0.101
0.052
0.05
0.038
0.031
0.037
0.054
0.068
PL
0.061
0.11
0.067
0.07
0.064
0.05
0.056
0.075
0.077
0.063
VN
0.045
0.105
0.042
0.04
0.034
0.023
0.028
0.039
0.053
0.037
0.049
TK
0.047
0.096
0.053
0.05
0.044
0.028
0.029
0.052
0.062
0.044
0.052
0.032
GD
0.064
0.112
0.064
0.06
0.051
0.036
0.043
0.057
0.073
0.054
0.059
0.032
0.043
Abbreviations used in Table-
Hispanic – HS, African – AF, Asian – AS, European – EU, OriyaBrahmin – OB, Iyengar Brahmin – IB, Lyngayat –
LY, Gowda – GO, Muslim – MU, Bihar Brahmin – BB, Pallar – PL, Vanniyar – VN, Tanjore Kallar – TK, Goundar – GD.
studied populations, Iyengar, Gowda and Muslim formed
a distinct cluster. Although NJ tree clearly depicts the clus-
tering of southern populations, D
A
distances indicate that
among these groups, Iyengar, Lyngayat and Vanniyar are
more similar to the northern Brahmins (0.030). Further-
more, genetic distances emphasize the affinity of Lyngayat
with Tanjore Kallar (0.029), Iyengar (0.026) and Vanniyar
(0.028). Estimation of relatedness between the southern
and global populations shows that all the southern com-
munities formed a separate cluster, nevertheless, genetic
distances disclose the affinity of upper caste Indian com-
munities; Iyengar, Lyngayat, Vanniyar, Bihar and Oriya
Brahmin with Europeans and East Asians. The Indian
populations were most distant to Africans.
The regression model (Fig. 2), of mean per locus heterozy-
gosity against distance from centroid assumes that when a
population experiences same amount of gene flow from a
homogenous source, a linear relationship exists between
the expected and observed heterozygosity. A change in
gene flow directly affects this linear relationship. The R-
matrix when applied to the Indian populations assists in
Page 3 of 8
(page number not for citation purposes)
BMC Genetics
2004,
5
:23
0.82
Go wda
PL
OB
Musl i m
0.81
IyengerBrahmin
Lyngayat
Tanjorekallar
IB
0.8
0.79
VN
LY
KL
0.78
Pallar
Vanniyar
0.77
Goundar
East Asian
0.76
GD
0.75
BiharBrahmin
Or i yaBr ahmi n
0.74
MU
0.73
Hi spani c
European
BB
0.72
GO
African
0.71
0.01
0
0.05
0.1
0.15
0.2
groups based on 15 STR markers
Neighbor-joining tree depicting the genetic relationship of
Karnataka populations with related Indian and global ethnic
Figure 1
Neighbor-joining tree depicting the genetic relationship of
Karnataka populations with related Indian and global ethnic
groups based on 15 STR markers.
Distance from centroid (rii)
levels in high-ranking populations of India
Regression plot demonstrating the relatively higher gene flow
Figure 2
Regression plot demonstrating the relatively higher gene flow
levels in high-ranking populations of India.
Abbreviations
used in figure:
OB-Oriya Brahmin, PL-Pallar, IB-Iyngar
Brahmin, VN-Vanniyar, LY-Lyngayat, TK-Tanjorekallar, GD-
Goundar, Mu-Muslim, BB-Bihar Brahmin, GO-Gowda.
understanding the influence of external gene flow and
admixture among populations. The higher observed than
expected heterozygosity of Iyengar and Lyngayat, placed
above the theoretical regression line helps infer that these
populations have received more than average external
gene flow, which was also observed in Vanniyar, Pallar
and Oriya Brahmin. The Gowda and Muslim groups
exhibit lower than expected heterozygosity values and fall
below the regression line, suggesting lesser admixture in
them.
India have played a major role in formation of the Indian
gene pool, however, very few genetic studies have been
carried out on them. The present study utilizes 15 STRs to
provide comprehensive genetic information on four pre-
dominant communities inhabiting the southwest coast of
India, which may significantly help in understanding the
genetic composition of southern populations.
The microsatellite diversity computed using AMOVA
revealed that the genetic variation observed in Indian
populations was mainly confined to variation amongst
individuals (~98%), irrespective of their geographic or
linguistic grouping (Table 3). The geographical clustering
of populations into three regions: north, southwest (Kar-
nataka) and southeast (Tamil Nadu) demonstrated a low
variance of 0.29%, p = 0.010 (Table 3a). As compared to
geographical grouping, the linguistic clustering (Indo-
Caucasian and Dravidian) exhibited a noticeable increase
in the molecular variance between the two groups, 0.65%
(p = 0.06, Table 3b). The genetic diversity among popula-
tions within each group remained almost similar at both
levels of analysis.
Genetic structure of Karnataka populations
The most distinctive feature revealed by the fifteen micro-
satellites was the considerable genetic homogeneity
amongst the four diverse caste groups residing in south-
west India. The presence of an almost similar allele fre-
quency pattern [34], suggests that these populations
might have a common ancestry or probably experienced
very high gene flow during the period of their coexistence.
The above finding is further supported by the low genetic
differentiation of 1.0% among the studied groups irre-
spective of their caste and migration histories. The high
heterozygosity and rii values in Lyngayat reflect the
admixture and stochastic processes experienced by it. The
genetic affinity of Lyngayat with other related southern
caste populations, like, Iyengar, Vanniyar and Tanjore
Kallar reiterates its heterogeneous past. It is noteworthy
that although the southern populations exhibited higher
affinity amongst each other, the high-ranking popula-
tions, like, Iyengar, Lyngayat and Vanniyar also displayed
Discussion
In recent years, population genetics has witnessed exten-
sive use of microsatellite markers to understand and evo-
lutionary histories of contemporary human populations
[17,32-34]. Though, the populations inhabiting south
Page 4 of 8
(page number not for citation purposes)
BMC Genetics
2004,
5
:23
Table 3: Genetic differentiation of Indian populations based on AMOVA
(a) Geographical grouping
Groups in set 1
Source of Variation
Percentage Variation
Group 1 – North: Bihar and Orissa populations
Among groups
0.29
Group 2 – South-west: Karnataka populations
Among populations in groups
0.97
Group 3 – South-east: Tamil Nadu Populations
Within populations
98.74
(b) Linguistic grouping
Groups in set 2
Source of Variation
Percentage Variation
Group 1 – Indo-European: Orissa and Bihar
Among groups
0.69
Group 2 – Dravidian: Southern populations
Among populations in groups
0.94
Within populations
98.40
some genetic similarity to Brahmins from Bihar and
Orissa, indicating that the gene pool of Iyengar and Lyn-
gayat probably consists of genetic inputs from both south-
ern and northern groups. However, strong conclusions
cannot be drawn due to low genetic differentiation among
the studied populations. Though the Gowda is known to
have moved in to Karnataka from the adjoining area of
Tamil Nadu, our study reveals that Gowda cluster with the
studied populations and not with Tamil groups. The low
hetetozygosity and high rii values of Gowda implies that
it might have differentiated as a result of stochastic proc-
esses. Furthermore, the relatively lower heterozygosity
and admixture levels of Gowda and Muslim might be
attributed to the socio-cultural practice of consanguine-
ous marriages in them. The Muslim group was found to be
genetically similar to local populations. Regional conver-
sions from diverse castes that occurred during the period
of Islamic dominance might elucidate the more or less
identical genetic relationship between Muslims and other
studied groups. The microsatellite study emphasizes the
genetic similarity among the Karnataka populations, with
the lack of any strong caste or religious bias in them.
viz., Dravidian and Indo-European. Our finding provides
evidence to the strong linguistic affinity prevailing
amongst the Dravidian speaking populations and imparts
them genetic distinctness from the Indo-European lin-
guistic group. Even though prior studies have indicated
that genetic clusters often correspond closely to
predefined regional and linguistic groups [34], AMOVA
suggests that caste system along with geographical conti-
guity are not ideal platforms for differentiating the ana-
lyzed Indian populations. It must, however, be
acknowledged that use of less number of polymorphisms
in this study might plausibly have led to the greater influ-
ence of linguistic affiliation on these populations rather
than geographical proximity.
Genetic affinity with global populations
The genetic differentiation of the studied populations
with relevant global migrant groups was estimated to be
2.3%, relatively lower than the 9% observed in another
similar study [16], which had used a different set of mic-
rosatellite markers. Sampling from a confined area, as
well as the use of lesser number of loci might have con-
tributed to this apparent difference in the results. The
southern populations formed a separate cluster from the
world populations. Molecular studies on Indian
populations using diverse markers (nuclear, mtDNA and
Y-chromosome) have demonstrated that the upper caste
populations have higher semblance with Europeans than
Asians [26]. Intriguingly, in the present study, communi-
ties belonging to the upper strata of the Hindu caste hier-
archy, i.e., Iyengar, Lyngayat, Vanniyar and northern
Brahmins, displayed almost identical genetic affinity with
both Europeans and East Asians. Therefore, all though it
is believed that south India remained isolated and cush-
ioned from the foreign invasions, the southern popula-
tions, especially, the high-ranking groups might have
genetically admixed with migrant groups that entered via
the west coast and north. Further exploration of their rela-
Analysis of genetic variance
AMOVA test strongly suggests that genetic diversity
among the southern populations was mainly confined to
intra-population variation, further emphasizing the
genetic homogeneity in them. Analysis using different
genetic markers corroborate with our finding that the
genetic diversity in human populations can be mainly
attributed
to
variation
within
populations
[4,17,19,34,36,37].
An exploration of the genetic differentiation based on
geographical grouping of populations discloses the
genetic similarity among populations residing in a region.
Nevertheless, the geographic affinity was comparably
lesser to that observed within the two linguistic families,
Page 5 of 8
(page number not for citation purposes)
[ Pobierz całość w formacie PDF ]