Evolution of human genes encoding cell surface receptors involved in the regulation of appetite: an analysis based on the phylostratigraphic age and divergence indexes

Genes encoding cell surface receptors make up a significant portion of the human genome (more than a thousand genes) and play an important role in gene networks. Cell surface receptors are transmembrane proteins that interact with molecules (ligands) located outside the cell. This interaction activates signal transduction pathways in the cell. A large number of exogenous ligands of various origins, including drugs, are known for cell surface receptors, which accounts for interest in them from biomedical researchers. Appetite (the desire of the animal organism to consume food) is one of the most primitive instincts that contribute to survival. However, when the supply of nutrients is stable, the mechanism of adaptation to adverse factors acquired in the course of evolution turned out to be excessive, and therefore obesity has become one of the most serious public health problems of the twenty-first century. Pathological human conditions characterized by appetite violations include both hyperphagia, which inevitably leads to obesity, and anorexia nervosa induced by psychosocial stimuli, as well as decreased appetite caused by neurodegeneration, inflammation or cancer. Understanding the evolutionary mechanisms of human diseases, especially those related to lifestyle changes that have occurred over the past 100–200 years, is of fundamental and applied importance. It is also very important to identify relationships between the evolutionary characteristics of genes in gene networks and the resistance of these networks to changes caused by mutations. The aim of the current study is to identify the distinctive features of human genes encoding cell surface receptors involved in appetite regulation using the phylostratigraphic age index (PAI) and divergence index (DI). The values of PAI and DI were analyzed for 64 human genes encoding cell surface receptors, the orthologs of which were involved in the regulation of appetite in model animal species. It turned out that the set of genes under consideration contains an increased number of genes with the same phylostratigraphic age (PAI = 5, the stage of vertebrate divergence), and almost all of these genes (28 out of 31) belong to the superfamily of G-protein coupled receptors. Apparently, the synchronized evolution of such a large group of genes (31 genes out of 64) is associated with the development of the brain as a separate organ in the first vertebrates. When studying the distribution of genes from the same set by DI values, a significant enrichment with genes having a low DIs was revealed: eight genes (GPR26, NPY1R, GHSR, ADIPOR1, DRD1, NPY2R, GPR171, NPBWR1) had extremely low DIs (less than 0.05). Such low DI values indicate that most likely these genes are subjected to stabilizing selection. It was also found that the group of genes with low DIs was enriched with genes that had brain-specific patterns of expression. In particular, GPR26, which had the lowest DI, is in the group of brain-specific genes. Because the endogenous ligand for the GPR26 receptor has not yet been identified, this gene seems to be an extremely interesting object for further theoretical and experimental research. We believe that the features of the genes encoding cell surface receptors we have identified using the evolutionary metrics PAI and DI can be a starting point for further evolutionary analysis of the gene network regulating appetite.


Introduction
Appetite (the desire of the animal organism to consume food) is a physiological mechanism (feeling) that regulates the intake of nutrients.The desire to consume food is one of the most primitive instincts that contribute to survival.This instinct has been formed over millions of years of evolution of living beings and has provided powerful mechanisms for adaptation and response to periods of nutrient shortage (Yeo, Heisler, 2012).The ability to consume excessive amounts of food during periods of its availability significantly affected the survival of individuals both in human populations and in populations of other animal species.
With the development of human civilization, the human populations living in developed countries faced the problem of adaptation to the abundance of food combined with a decrease in physical activity, making obesity one of the most serious public health problems of the twenty-first century (Kaidar-Person et al., 2011).Thus, the mechanism of adaptation to

831
ЭВОЛЮЦИОННАЯ КОМПЬЮТЕРНАЯ БИОЛОГИЯ / EVOLUTIONARY COMPUTATIONAL BIOLOGY unfavourable factors acquired during evolution in conditions of stable nutrient supply turned out to be excessive (Yeo, Heisler, 2012).
In humans and other animal species, the physiological system that regulates appetite functions with the participation of protein products of genes expressed both in the brain (Olszewski et al., 2008) and in peripheral organs and tissues: stomach, intestine, pancreas, adipose tissue.Neurons involved in the regulation of the motivational drive to obtain food are located in different parts of the brain (hypothalamic nuclei, amygdala, dorsal raphe nucleus, nuclei of the solitary tract, ventral tegmental area, prefrontal cortex, etc.).They integrate signals received from the sensory organs (olfactory, visual, taste sensations) as well as various interoceptive and humoral signals and control search for food and food consumption (Yeo, Heisler, 2012;Tremblay, Bellisle, 2015;Heisler, Lam, 2017).
Appetite can be induced by energy and nutrient shortages (in this case it is referred to by the term homeostatic appetite).However, even in the absence of apparent homeostatic needs, factors such as the sight, smell and taste of food, environmental cues, and the anticipation of new sensations that arise from eating can stimulate eating behavior, i. e. non-homeostatic appetite.The neuronal systems controlling homeostatic and non-homeostatic appetite function in close cooperation (Ahn et al., 2022).
Human pathological conditions characterized by appetite disorders are known.A pathological increase in body weight (obesity) can be caused by such a condition as hyperphagia (bulimia).A catastrophic decrease in appetite is seen in anorexia nervosa, which is extremely dangerous and increases the risk of death in young people tenfold (Fichter, Quadflieg, 2016).Reduced appetite can accompany chronic inflammatory and autoimmune processes, cancer and neurodegenerative diseases (Grossberg et al., 2010).In this context, any new knowledge about the system of genes regulating appetite is of particular importance.
Previously, we performed a functional analysis of genes involved in the regulation of appetite and body weight (Ignatieva et al., 2014(Ignatieva et al., , 2016)).When analyzing a set of 105 genes involved in appetite regulation, a statistically significant over-enrichment of genes specifically expressed in the brain was found.It was also revealed that a substantial proportion of genes (~45 %) in this set were genes encoding cell surface receptors.Many of these receptors belonged to the superfamily of G-protein-coupled receptors (GPCRs).
The GPCRs superfamily includes proteins that have a similar structure (all of them contain 7 transmembrane domains).These proteins can be found on the cell membranes of almost all eukaryotes (New, Wong, 1998;Yang et al., 2021).Analysis of the DNA sequence of the human genome made it possible to predict about 800 genes encoding proteins of this superfamily (including 388 genes encoding olfactory receptors) (Bjarnadóttir et al., 2006).GPCRs mediate the response of cells to extracellular signaling molecules of different nature -proteins, peptides, low molecular weight substances (odorous and taste stimuli, hormones), as well as light-sensitive compounds.In turn, these receptors activate signal transduction pathways in cells, providing fundamental physiological processes (vision, perception of taste and olfactory signals, neuronal functioning, endocrine regulation and reproduction processes) (Katritch et al., 2013).Some of the best known receptors from the GPCR superfamily, which we have previously classified as appetite-regulating genes (Igatieva et al., 2016), include, for example, GHSR (growth hormone secretagogue receptor), MC3R (melanocortin 3 receptor), MC4R (melanocortin 4 receptor), CCKAR (cholecystokinin A receptor), CCKBR (cholecystokinin B receptor) and GCGR (glucagon).
Understanding the evolutionary mechanisms of human diseases, especially those associated with lifestyle changes that have occurred over the last 100-200 years (and the above-mentioned diseases associated with appetite dysregulation are just such diseases), is of great fundamental and applied significance.It is also very important to find interdependence between the evolutionary characteristics of genes in gene networks and the resistance of these networks to disruptions of genes themselves (through mutations) and to alterations in gene expression patterns caused by genetic variability of regulatory regions.Phylogenetic and population analysis of genes and gene networks involved in the relevant biological processes may be useful in developing new scenarios for personalized prevention and targeted drug therapy of diseases.
The aim of this work was to identify the evolutionary features of human cell surface receptor genes involved in appetite regulation using phylostratigraphic age index (PAI) and divergence index (DI).To achieve this goal, at the first stage, a set of human receptor genes the orthologues of which were involved in appetite regulation in model organisms was formed based on the analysis of scientific publications.Next, the distributions of human genes according to PAI and DI values were examined.The characteristic features of these distributions were identified by comparison with the distributions obtained for all human protein-coding genes, as well as for genes encoding GPCRs.

Materials and methods
Collecting the list of genes involved in appetite regulation and encoding cell surface receptors.The list of genes was taken from (Ignatieva et al., 2016) and expanded based on a PubMed search (https://pubmed.ncbi.nlm.nih.gov/) using the keywords listed in Supplementary Material 1 1 .Only genes from experimental studies were considered; reviews were excluded.In most studies, the role of genes in the 832 Вавиловский журнал генетики и селекции / Vavilov Journal of Genetics and Breeding • 2023 • 27 • 7 Evolution of human genes encoding cell surface receptors involved in the regulation of appetite regulation of food consumption was established using model organisms (mice, rats, etc.).Therefore, the list of human genes controlling appetite included orthologues of those genes that were identified in experiments on other animal species.Indication that the product of a gene is a cell surface receptor was obtained from the text field "Summary" of the EntrezGene database (https://www.ncbi.nlm.nih.gov/gene).
Control sets of genes.The human gene sets listed in Table 1 were also used in the analyses.The list of human genes encoding receptors and controlling appetite was named Receptors_64.
The set containing all human protein-coding genes (allCDS_19,566) included 19,566 protein-coding genes for which PAI and DI values were identified.
The set containing human genes encoding GPCRs (allGPCR_389) was formed on the basis of the GPCRdb (https://gpcrdb.org)(Pandy-Szekeres et al., 2023).Genes encoding olfactory receptors were not included in this set because the set of appetite-controlling cell surface receptor genes (Receptors_64) did not contain genes encoding receptors of this type.
The set containing genes encoding GPCRs that control appetite (appGPCR_45) was obtained by the intersection of two sets -Receptors_64 and allGPCR_389.
Analysis of the evolutionary characteristics of genes.The analysis was performed using PAI (phylostratigraphic age index) and DI (divergence index).
Phylostratigraphic age index (PAI) shows to what extent the taxon reflecting the age of the gene is distant from the root of the phylogenetic tree.The taxon reflecting the age of the gene is understood as the taxon at which the divergence of the studied species from the most distant related taxon, in which the orthologue of the gene in question was found, occurred (Table 2).The greater the PAI value of the gene under study, the younger this gene is supposed to be.PAI values were calculated in the Orthoscape tool based on the KEGG Orthology service as described in (Mustafin et al., 2021).We used PAI values calculated at a similarity level of 0.5.
Divergence index (DI) is an index of evolutionary variability of a gene.DI is calculated based on the dN/dS ratio, where dN is the proportion of nonsynonymous substitutions in DNA sequences of the studied gene and its orthologue; dS is the proportion of synonymous substitutions.The value of this index was calculated based on the comparison of human genes with genes of closely related organisms from the Hominidae family, as described in (Mustafin et al., 2021).Thus, DI can be determined only for protein-coding genes and indicates the type of selection acting on the gene.DI value in the range from 0 to 1 shows that the gene is subjected to stabilizing selection, 1 -to neutral evolution, and more than 1 -to driving selection.
Analysis of tissue-specific characteristics of genes.We used the TSEA tool to find overrepresented groups of genes that had tissue-specific expression patterns identified for a certain organ or tissue (Wells et al., 2015).The TSEA tool (http://genetics.wustl.edu/jdlab/tsea/)uses data on tissue-specific gene expression patterns detected in 25 different human organs and tissues.The TSEA tool identifies groups of tissuespecific genes in the analyzed list of genes, the size of which

833
ЭВОЛЮЦИОННАЯ КОМПЬЮТЕРНАЯ БИОЛОГИЯ / EVOLUTIONARY COMPUTATIONAL BIOLOGY significantly exceeds the expected one for random reasons.
The TSEA tool uses data on specificity indices (SI) of gene expression products and their corresponding p-values (pSI).These values were calculated for each organ or tissue and for each transcript based on the analysis of data obtained by whole transcriptome profiling (GTEx Consortium, 2015).If the pSI value was < 0.01, the transcript was considered as tissue-specific for a given tissue.Statistical analysis.The significance of differences between the observed and expected numbers of genes in subgroups was evaluated using the Chi-Square test.

Genes encoding cell surface receptors and their functional characteristics
As a result of queries to PubMed, experimental data on genes of model organisms (mice, rats, etc.) involved in the regulation of food intake were found.Using this information, as well as information from EntrezGene, we found 64 human genes orthologous to genes identified in model organisms and encoding cell surface receptors (in Table 1 this set of genes is presented as Receptors_64).See Supplementary Material 2 for the full list of genes.
We compared these 64 genes with those accumulated in GPCRdb and found that 45 (70.3 %) out of 64 genes encoded G-protein-coupled receptors (Fig. 1, a).As is shown in Table 1, this subset of genes encoding receptors from the GPCR superfamily was named appGPCR_45 (Supplementary Material 2 contains data on whether the gene belongs to the GPCR superfamily).The remaining 19 genes (29.7 %) encoded receptors from the other superfamilies (this subset is represented in Table 1 as app_not_GPCR_19).
Analysis of the gene list using the TSEA tool (Wells et al., 2015) revealed that the Receptors_64 set was enriched in genes that have brain-specific expression pattern.Approximately one fifth of the genes (12 genes, or 18.75 %) fall into this category (see Fig. 1, b, Supplementary Material 3).

Phylostratigraphic age of genes (PAI-based analysis).
At the first step, we have analyzed the distribution of PAI values for all human protein-coding genes (the allCDS_19,556 set).PAI values were found to be unevenly distributed (Fig. 2, a).One third of all genes (33 %) had a PAI equal to zero (cellular organisms, the root of the phylogenetic tree), and the proportions of genes that had PAI values equal to 5 (the stage of vertebrate divergence) and 6 (the stage of euteleostomi divergence) were 17 % and 14 %, respectively.When we examined the distribution of PAI values for a set of human genes encoding cell surface receptors and involved in appetite regulation (the Receptors_64 set, Supplementary Material 4), we found that 31 genes out of 64 (i.e.48 %) had PAI values equal to 5 (the stage of vertebrate divergence) (see Fig. 2, a).And this number was significantly higher (p < 0.001) than the expected number (10.898) calculated based on the distribution obtained for the allCDS_19,556 gene set (see Fig. 2, a, Supplementary Material 5).
As noted above, a large proportion of genes from the Receptors_64 set (45 genes out of 64) encode GPCRs (see Fig. 1, a).To find out whether the evolutionary features of genes from the Receptors_64 set are caused by the features of genes from the GPCRs superfamily, we analysed the distribution of PAI values for a set of 389 human genes encoding GPCRs represented in the GPCRdb database (https://gpcrdb.org)(allGPCR_389 set).This distribution was also found to be different from the distribution obtained for all human protein-coding genes (see Fig. 2, b).The number of genes in the allGPCR_389 set that had PAI values equal to 5 (the stage of vertebrate divergence) was 39 % (150 genes out of 389) and it was significantly higher than the expected number calculated based on the proportion of this group of genes in the allCDS_19,566 set (Supplementary Material 6).
Next, we compared the distribution of PAI values for 45 genes encoding GPCRs and regulating appetite (the appGPCR_45 set) with the distribution for the allGPCR_389 set (see Fig. 2, c).In the group of genes from the appGPCR_45 set, 28 genes were found to have a PAI equal to 5 (the stage of vertebrate divergence) (i.e. 64 %), which was significantly higher than the expected number (17.35) calculated based on the proportion of this group of genes in the allGPCR_389 set (Supplementary Material 7).
As mentioned earlier, 19 receptor genes controlling appetite did not belong to the G-protein-coupled receptor superfamily (the app_not_GPCR_19 set).When the distribution over PAI values for this group of genes was examined, it was also found to differ from the distribution over PAI values for all human protein-coding genes (see Fig. 2, d).However, in this case, a significant (p < 0.05) excess over the expected number of genes with a PAI equal to 6 (the stage of euteleostomi divergence) was observed.The observed number in the app_not_GPCR_19 set was six genes out of 19 (32 %), whereas in the allCDS_19,566 set, PAI value equal to 6 was detected for 2,769 genes (14 %).Thus, the expected number of genes with PAI equal to 6 in the app_not_GPCR_19 set was 2.69 (Supplementary Material 8).

Evolutionary variability of genes (DI-based analysis).
The analysis of the distribution of genes from the Receptors_64 set according to DI values (Fig. 3, a, Supplementary Material 9) showed that 47 % of genes (30 out of 64) had DI < 0.2, most a, The proportion of genes encoding GPCRs; b, the proportion of genes that have brain-specific expression pattern (tissue-specific genes were identified using the TSEA tool).
Evolution of human genes encoding cell surface receptors involved in the regulation of appetite genes (63 out of 64, i. e. ~98 %) had DI < 1, and only one gene (QRFPR) had DI > 1, indicating that most of the genes are subjected to stabilizing selection.
Comparison of the distribution of genes from the Receptors_64 set by DI values with the distribution obtained for all human protein-coding genes (allCDS_19,566 set) showed that the Receptors_64 set is characterised by an increased content of genes with low DI values (see Fig. 3, a).The majority of genes from the Receptors_64 set (61 genes out of 64, i. e. 95 %) had DI < 0.6.And this number was significantly (p < 0.01) higher than the expected number (51.95) calculated using the distribution obtained for all human protein-coding genes (see Fig. 3, a, Supplementary Material 10).
When comparing the distribution over DI values for a set of all receptors from the GPCRs superfamily (allGPCR_389) with the distribution for all protein-coding genes (allCDS_19,566), no significant differences were found (see Fig. 3, b).
Comparison of the distribution over DI values for the appGPCR_45 set with the distribution for all receptors from the GPCRs superfamily (allGPCR_389) showed that the number of genes with low DI (DI ≤ 0.6) in the appGPCR_45 set (42 genes) was significantly (p 0.05) higher than the expected number of genes (37.018) calculated from the DI distribution for all genes encoding GPCRs (see Fig. 3, c, Supplementary Material 11).
As indicated above, approximately one-fifth (18.75 %) of genes from the Receptors_64 set are brain-specific.We have determined the content of genes that had brain-specific expression patterns in two subgroups of genes: (1) a subgroup of genes with low DI (DI ≤ 0.2); (2) a subgroup including all other genes (they had DI values between 0.2 and 1.3).It turned out that the number of brain-specific genes in these subgroups differs significantly from the expected number calculated based on random distribution: in the subgroup of genes with low DI, the content of brain-specific genes was increased (Fig. 4, Supplementary Material 12).

Discussion
Cell surface receptor genes constitute a substantial proportion (more than a thousand genes) of the human genome (Bausch-Fluck et al., 2018).The interest in the study of cell surface receptors is due to their important role in the cell.These transmembrane proteins interact with various molecules (ligands) located in the extracellular space and activate signal transduction pathways in the cell (Bausch-Fluck et al., 2018;Yang et al., 2021).A lot of substances and biochemical compounds (in particular, drugs) that affect the activity of cell surface Fig. 2. Distributions by PAI values obtained for the sets of human protein-coding genes presented in Table 1.a: all human protein-coding genes (allCDS_19,566) as a control set and the human appetite-regulating genes encoding receptors (Receptors_64); b: all human protein-coding genes (allCDS_19,566) as a control set and genes encoding GPCRs (allGPCR_389); c: genes encoding GPCRs as a control set (allGPCR_389) and genes encoding GPCRs controling appetite (appGPCR_45); d: all human protein-coding genes (allCDS_19,566) as a control set and genes controlling appetite but not belonging to the GPCRs superfamily (app_not_GPCR_19).

835
ЭВОЛЮЦИОННАЯ КОМПЬЮТЕРНАЯ БИОЛОГИЯ / EVOLUTIONARY COMPUTATIONAL BIOLOGY receptors (so-called agonists and antagonists) are known.Therefore, cell surface receptors are also of great interest from a biomedical point of view -for example, these proteins are targets for 66 % of drugs registered in the DrugBank database (Bausch-Fluck et al., 2018).This paper presents a set of 64 human genes encoding cell surface receptors, the orthologs of which are involved in food intake regulation in model organisms.The data are highly reliable, because this gene set was created on the basis of manual analysis of scientific publications.Finding such an impressive number of receptor genes involved in appetite regulation fits well with the idea of the complex nature of food motivation.
As mentioned above, appetite can satisfy both the basic needs of the organism for food (homeostatic appetite, which provides compensation of energy expenditure) and the needs for sensations associated with food (non-homeostatic appetite, aimed at obtaining positive emotions) (Johnson, 2013;Rebello, Greenway, 2016;Ahn et al., 2022).It is also known that food motivation can be adjusted depending on the life situation or psycho-emotional state of an individual (fright, depression, boredom, chronic stress, for animals -threat from predators, territory protection, mating behavior, etc.) (Lindén et al., 1987;Braden et al., 2018Braden et al., , 2023;;Hadjieconomou et al., 2020;Siegal et al., 2022).Such correction of food motivation is performed because the brain processes information received from the sensory organs and integrates it with signals about the state of various physiological systems of the body (Tomé et al., 2009;Holtmann, Talley, 2014;Spetter et al., 2014;Tremblay, Bellisle, 2015).And this process involves nerve cells with diverse specialization expressing a wide range of receptors on their surface (Yeo, Heisler, 2012;Heisler, Lam, 2017).
Examination of the distributions of genes by PAI values revealed that: (1) the Receptors_64 set has a significantly increased content of genes with the same phylostratigraphic age (PAI = 5, the stage of vertebrate divergence) than all proteincoding genes; (2) the subset of genes that encode GPCRs and are involved in appetite regulation (appGPCR_45) also contains an increased number of genes with the same phylostratigraphic age (PAI = 5, the stage of vertebrate divergence) than what would be expected based on the distribution of PAI values for all genes encoding GPCRs.
Thus, we found that the gene set composed of genes encoding cell surface receptors controlling appetite contains an increased number of genes with the same phylostratigraphic age (PAI = 5, the stage of vertebrate divergence).Apparently, the synchronised evolution of such a large group of genes (31 genes have PAIs equal to five) is associated with the formation of the brain as a separate organ in the first vertebrates (Sarnat, Netsky, 2002).It is noteworthy that almost all of these genes with PAI equal to five (28 out of 31) encode GPCRs, which agrees well with the fact that receptors of this genes encoding GPCRs with a PAI of 5 (the stage of vertebrate divergence) includes, in particular, genes encoding receptors for neuropeptide Y (NPY1R, NPY2R, NPY4R, NPY5R) and alpha-melanocyte stimulating hormone (MC3R and MC4R).
Neuropeptide Y and alpha-melanocyte stimulating hormone are signalling molecules secreted by neurons of the arcuate nucleus of the hypothalamus, a brain structure that acts as a central regulator of feeding behaviour (Yeo, Heisler, 2012;Heisler, Lam, 2017).
Another peculiarity was revealed for a subset of genes involved in appetite regulation but not encoding GPCRs (app_not_GPCR_19): it contains an increased number of genes with PAI equal to 6 (the stage of euteleostomi divergence).Notably, four genes from this group encode receptors involved in the regulation of immunity.These are GHR and LEPR encoding proteins from the type I cytokine receptor family and TLR2 and TLR4 encoding proteins from the Tolllike receptor family.
The PAI-based analysis has shown that the so-called "ancient" genes (i. e., genes with PAI equal to 0 (cellular organisms, the root of the taxonomic tree)) are also involved in the regulation of food intake.This group includes, for example, genes encoding (1) the insulin receptor (INSR), which, in particular, regulates secretion of neuropeptide Y and alphamelanocyte stimulating hormone by neurons of the arcuate nucleus of the hypothalamus (Leibowitz, Wortley, 2004), and (2) NTRK2, the receptor for BDNF (brain-derived neurotrophic factor), which mediates the anorexigenic effects of BDNF produced in the paraventricular nucleus of the hypothalamus (An et al., 2015;Chu et al., 2023).Both genes are expressed in different tissues and organs (Escandón et al., 1994;Federici et al., 1997), indicating that at early stages of evolution, ancestral forms of INSR and NTRK2 could be involved in the regulation of various biological processes and joined the system of genes regulating food intake at the evolutionary stage corresponding to the formation of specialised brain structures.
When considering PAI values, a group of relatively "young" genes was identified (PAI values of 6 and 7, the stages of euteleostomi and mammalia divergence).Five genes from this group encode receptors relevant to the immune system: these are the four genes mentioned earlier (GHR, LEPR, TLR2, and TLR4), as well as IL1R1.The detection of these genes in a group of relatively "young" genes is in good agreement with the known data on the adaptive immunity system having begun to form relatively recently in the course of evolution (Ward, Rosenthal, 2014).
When examining the distribution of genes from the Recep-tors_64 set by DI values, a significant enrichment of this group with genes subjected to stabilizing selection was revealed.It turned out that the subgroup of appetite-regulating genes encoding GPCRs (appGPCR_45) also contained an increased number of genes with low DI values.
An extremely low DI value (<0.005) was found for the GPR26 gene.GPR26 encodes a receptor from the GPCRs superfamily, the endogenous ligand of which has not yet been identified.Targeted inactivation of GPR26 in mice causes hyperphagia leading to early onset of diet-induced obesity (Chen et al., 2012).In addition, according to behavioural tests, Gpr26-deficient mice display increased anxiety-and depression-like behavior, and prefer ethanol to a greater extent than mice with normal genotype (Zhang et al., 2011).According to the TSEA tool, GPR26 has brain-specific expression pattern.In humans, GPR26 is expressed in the amygdala, hippocampus, and thalamus (Jones et al., 2007).The function of the GPR26 gene is evolutionarily conserved.In C. elegans, the Y5H2B gene with similarity to GPR26 was found.Ashrafi K. and co-workers used RNA-mediated interference to disrupt the expression of genes and found that Y5H2B increases fat content (Ashrafi et al., 2003).The functions of other genes that had extremely low DI values (<0.05) are described in Supplementary Ma terial 13.
Only one gene (QRFPR) among the genes from the Re-ceptors_64 set had a DI > 1 (see Supplementary Material 9), indicating that this gene is probably subjected to driving selection.QRFPR encodes the receptor for the orexigenic neuropeptide QRFP (pyroglutamylated RFamide peptide) (Cook et al., 2022).According to EntrezGene and UniProt databases, human QRFPR is expressed in different parts of the brain and in peripheral tissues (heart, kidney, stomach, testes, and thyroid gland).The mouse, rat, and hamster genomes are known to contain at least two genes encoding the neuropeptide receptor QRFP (Cook et al., 2022).No data like that are available for the human genome; however, it can be assumed that human QRFPR is not subjected to stabilizing selection, since the human genome also contains more than one gene encoding proteins with QRFPR-like activity.
We also found that the group of genes with low DI, i. e. most likely to be subject to stabilizing selection, is enriched in genes that have brain-specific expression patterns.This result agrees well with the finding made by G. Dumas et al. who examined the set of almost all human protein-coding genes (N = 11,667) and revealed that genes encoding brain-related proteins are among the most strongly conserved protein-coding genes in the human genome (Dumas et al., 2021).Among the genes that have low DI and brain-specific expression pattern, the previously mentioned GPR26 gene was found.Due to the fact that this gene has an extremely low DI and its endogenous ligand is still unknown (Chen et al., 2012), GPR26 seems to be an extremely interesting object for further theoretical and experimental studies.

Conclusion
In this paper, we analyzed the distributions of PAI and DI values for a group of human cell surface receptor genes, the orthologues of which were involved in appetite regulation in model organisms.It was found that the gene set under consideration contains an increased number of genes with the same phylostratigraphic age (PAI = 5, the stage of vertebrate divergence), which is apparently associated with the formation of the brain as a separate organ in the first vertebrates.A significant enrichment of this group of genes in genes with low DI values was also revealed, indicating a significant susceptibility of these genes to stabilizing selection.At the same time, the group of genes with low DI is enriched with genes that have brain-specific expression pattern.The

837
ЭВОЛЮЦИОННАЯ КОМПЬЮТЕРНАЯ БИОЛОГИЯ / EVOLUTIONARY COMPUTATIONAL BIOLOGY characteristic features of the cell surface receptor genes distribution according to the evolutionary indices PAI and DI revealed in this study are a starting point for further analyses of the evolutionary characteristics of the entire gene network controlling appetite.

Fig. 1 .
Fig. 1.Functional characteristics of human genes encoding cell surface receptors and involved in appetite regulation (genes from the Recep-tors_64 set).
PAI values were calculated at a threshold of 0.5 for the level of similarity between the DNA sequences of the orthologous genes.Asterisks indicate differences between the number of genes with a PAI equal to 5 (the stage of vertebrate divergence) (a-c) or a PAI equal to 6 (the stage of euteleostomi divergence) (d) and their expected numbers calculated based on the distributions in the control sets.*** p < 0.001, * p < 0.05.See Supplementary Materials 5

Table 1 .
Gene sets for which the distributions of PAI and DI values were analysed This set includes human genes orthologous to genes of other animal species, the role of which in appetite regulation has been studied experimentally. *

Table 2 .
PAI values and taxonomic units dating the corresponding phylostratigraphic age of genes superfamily are involved in processing signals from sensory organs, as well as signals transmitted by hormones and neuromediators (Pandy-Szekeres et al., 2023).Thus, the group of Distributions of genes from the sets presented in Table1according to the DI index.Distribution of genes from the Receptors_64 set by DI values.Shown are the proportions of genes that have brain-specific expression patterns according to the TSEA tool.The observed number of brain-specific genes differ from the expected number, * p < 0.05.(The numbers of genes in four subgroups are given in Supplementary Material 12.)