Type distribution of human papillomaviruses in ThinPrep cytology samples and HPV16/18 E6 gene variations in FFPE cervical cancer specimens in Fars province, Iran

Background There exists strong evidence that human papillomavirus (HPV) is associated with cervical cancer (CC). HPV E6 is a major oncogene whose sequence variations may be associated with the development of CC. There is not sufficient data on the distribution of HPV types in ThinPrep cytology specimens and HPV 16/18 E6 gene variations among CC patients in the southwest of Iran. This study was conducted to contribute to HPV screening and vaccination in Iran. Methods A total of 648 women screened for cervicitis, intraepithelial neoplasia or CC were included in the study. All participants underwent ThinPrep cytology testing, single-step HPV DNA detection and allele-specific reverse hybridization assays. Moreover, a total of 96 specimens previously tested positive for single infection with HPV16 or 18 were included for variant analysis. HPV16/18 lineages and sublineages were determined by PCR assays followed by sequencing the E6 gene and the construction of neighbor-joining phylogenetic trees. Results Overall, HPV DNA was detected in 62.19% of all the screened subjects. The detection rates of HPV DNA among individuals with normal, ASC-US, ASC-H, LSIL, and HSIL cervical cytology were 48.9%, 93.6%, 100%, 100%, and 100%, respectively. Low-risk HPVs were detected more frequently (46.9%) than high-risk (38.9%) and possible high-risk types (11.1%). Of 403 HPV-positive subjects, 172 (42.7%) had single HPV infections while the remaining 231 (57.3%) were infected with multiple types of HPV. Our results indicated a remarkable growth of high-risk HPV66 and 68 and low-risk HPV81 which have rarely been reported in Iran and HPV90 and 87 that are reported for the first time in the country. In addition, 3 lineages (A, D, and C) and 6 sublineages (A1, A2, A4, C1, D1, and D2) of HPV16, and one lineage and 4 sublineages (A1, A3, A4, and A5) of HPV18 were identified. The studied HPV16 and 18 variants mainly belonged to the D1 and A4 sublineages, respectively. Conclusion The present study suggests that the prevalence of HPV infection in women of all age groups with or without premalignant lesions in the southwestern Iran is high and the predominant HPV types in the southwest of Iran may differ from those detected in other parts of the country. This study also highlights the necessity of not only initiating HPV vaccination for the general population but also developing new vaccines that confer immunity against the prevalent HPV types in the area and national cervical screening programs using a combination of thinPrep cytology test and HPV detection assays in order to improve the accuracy of the screening. Supplementary Information The online version contains supplementary material available at 10.1186/s12935-023-03011-8.

immunity against the prevalent HPV types in the area and national cervical screening programs using a combination of thinPrep cytology test and HPV detection assays in order to improve the accuracy of the screening.Keywords HPV, ThinPrep, Phylogenetic analysis, Lineage, HPV16, HPV18, Cervical cancer Background Known as the fourth most common gynecological malignancy, cervical cancer (CC) was reported to account for about 604,000 new cases and approximately 342,000 deaths across the world in the year 2020 [1].The incidence rate of CC is increasing in Iran and has been estimated to be 4.5 per 100,000 women [2].Annually, one out of every 123 women develops CC and nine out of every 100,000 women die of this disease [3].Persistent human papillomavirus (HPV) infection is known to be the primary cause of CC as well as the preceding premalignant lesions referred to as cervical intraepithelial neoplasia (CIN) [4].The development of CC from CIN takes place through a long reversible process.Timely diagnosis and intervention during this process can reduce the risk of CC occurrence [5].There exist 51 recognized HPV types that are classified into three distinct risk groups based on their association with CC.High-risk HPV (HR-HPV) types (16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, and 68) are known to have great carcinogenic potential.Possible high-risk (pHR-HPV) types (26,53,67,70,73, and 82) are suspected of carcinogenesis but whether they are indeed associated with CC or not is yet to be proved.Low-risk HPV (LR-HPV) types (6, 7, 11, 13, 30, 32, 34, 40, 42, 43, 44, 54, 61, 62, 69, 71, 72, 74, 81, 83, 84, 85, 86, 87, 89, 90, 91, 97, 102, 106, and 114), on the other hand, are generally responsible for non-cancerous genital lesions [6][7][8].
Proper management of CC relies on early diagnosis and effective prophylactic vaccination [9].There are HPV vaccines, such as GARDASIL ® 9, that target most of the HR-HPVs.However, not only has not HPV vaccination been included in the national vaccination program in Iran, but also the vaccines that are available in the market are only protective against HPV16 and 18.Furthermore, since the prevalence of HPV types varies by geographic region [10,11], the distribution pattern of HPV types should be considered to select the most effective HPV vaccine for different populations as well as different age groups [12].
HPV types are classified into different lineages provided that there is a difference of 1%-10% in complete genomic nucleotide sequences, and further into different sublineages in case of 0.5%-1% sequence variation, allowing better identification of viral heterogeneity [13].HPV16 variants have been classified into four principal lineages based on sequences published by Burk et al. [13], and also presented in the Papillomavirus Episteme database (http:// pave.niaid.nih.gov).These lineages are as follows; lineage A, including sublineages A1-A3 (previously known as European variants) and A4 (previously known as Asian variant); lineage B, comprising sublineages B1, B2 (formerly known as African-1 variants), B3, and B4; lineage C, with sublineage C1 (formerly African-2 variant), C2, C3, and C4; and lineage D, consisting of sublineages D1 (previously North American variant), D2, D3 (Asian-American variants), and D4.Moreover, HPV18 variants have been classified into three main lineages: lineage A, including sublineages A1, A2 (previously known as Asian Amerindian variants), and A3-A5 (formerly known as European variants); lineage B, with sublineages B1-B3, and lineage C with the only sub-lineage named C1 (all previously known as African variants) [13].According to epidemiological findings, HPV16 variants pose the risk of persistent infection, progression to pre-cancer, and cancer.Furthermore, HPV16 sublineages A3, A4, and D have been reported to be associated with higher risks of CC [14][15][16].Conversely, most studies do not support any of HPV18 lineages or sublineages carrying a higher risk of cancer compared to others [17,18].However, these findings were not replicated globally and the inconsistent results highlight the necessity of further studies on the distribution of HPV16/18 lineages and sublineages in different regions and their oncogenicity considering ethnicity.Undoubtedly, regional data on the prevalence and type distribution of HPVs are of great importance to evaluate the potential of currently available HPV vaccines to prevent CC.In addition, the knowledge of HPV type distribution in each country is pivotal to vaccine development and national vaccination programming.Although there are many reports on the distribution of different HPV types and variants worldwide, there seem to be few regional studies investigating the distribution pattern of HPV types [19,20] and HPV16/18 variants in the Iranian population [21][22][23][24].Previous studies in Iran have reported the prevalence of HPV in different cervical specimens to range from 5.5% to 9.4% in normal cytology specimens [25][26][27]; between 61.7% and 65.3% in CIN I-III samples [26,28,29], and between 75.2% and 87% in CC specimens [30][31][32].However, regardless of cervical cytology result, the overall prevalence of HPV infection was reported to be 52.25% in female outpatients referred to the laboratories of Tehran, the capital of Iran, between 2019 and 2021.The rate of HR-HPV and LR-HPV types among these patients were 42.1% and 57.9%, respectively [33].Herein, we have examined the type-specific prevalence of HPVs with a large sample size and further investigated the E6 gene-based genetic variability of HPV16/18 lineages and sublineages according to the severity of the cervical lesions in the southwest of Iran.

Study subjects
A total of 722 women who attended the outpatient office of Shahid Motahari Gynecology Clinic, a reference center affiliated with Shiraz University of Medical Sciences, Shiraz, Iran to be screened for cervicitis, intraepithelial neoplasia or CC from February 2018 to November 2021 were considered for inclusion in the study.After being informed of the research goals, the participants voluntarily completed a series of examinations consisting of ThinPrep cytology test and HPV DNA detection and typing assay.All participants were aged between 16 and 75 years old and met the inclusion criterion of having a history of sexual intercourse.Further, (1) pregnant women or those who had terminated their pregnancy within 3 months prior to the study, (2) those with mental disorders, (3) vaginal, cervical or uterine hemorrhage, (4) acute infections of lower genital tract, vulva, vagina or cervix, (5) concurrent sexually transmitted diseases, and (6) full or partial HPV vaccination were excluded from the study.Finally, 648 participants were identified as qualified for the study.Moreover, a total of 96 specimens previously screened for HPV infection, including 48 with HPV16 single infection and 48 with HPV18 single infection, were included in the study for variant analysis.Each group consisted of 12 normal, 12 low-grade squamous intraepithelial lesion (LSIL), 12 high-grade squamous intraepithelial lesion (HSIL), and 12 CC specimens.Premalignant/malignant samples in the mentioned groups were formalin-fixed paraffin-embedded (FFPE) tissue biopsies whereas the normal ones were ThinPrep Pap Test specimens.Hematoxylin-eosin staining was performed for all specimens and the diagnoses were confirmed by experienced pathologist.

ThinPrep cytology testing
For cytology examination, ThinPrep liquid-based cytology samples were referred by physicians to the laboratory based on standard CC screening methodology.Briefly, a sampling brush was used to collect exfoliated cells at the cervical canal and the external aperture of the cervix.Collected cells were stored in a preservation solution.Using a ThinPrep 2000 system, a thin-layer cell smear was prepared for Pap examination.Grading was carried out based on the Bethesda criteria [34] as follows: (I) no intraepithelial lesion or malignancy (NILM); (II) abnormality of squamous epithelial cell including: a, atypical squamous cells (ASCs), consisting of ASCs of undetermined significance (ASC-US) and ASCs that cannot exclude high-grade squamous intraepithelial lesion (ASC-H); b, low-grade squamous intraepithelial lesion (LSIL); c, high-grade squamous intraepithelial lesion (HSIL); and d, squamous cell carcinoma (SCC); 3) glandular epithelial cell abnormality including: a, atypical glandular cells (AGCs), consisting of AGC not otherwise specified (AGC-NOS) and AGC suspicious for neoplasia (AGC-N); b, cervical adenocarcinoma in situ of the cervical canal (AIS); and c, adenocarcinoma (ADC); and 4) other malignant tumors.

DNA extraction
ThinPrep media containing suspended cells were mixed by inversion and 500 µl-aliquots were separated for DNA extraction using QIAamp DNA mini kit (Qiagen, Hilden, Germany).In the case of FFPE tissue specimens, four 5-μm-thick slices were cut and collected in autoclaved Eppendorf microtubes for each patient.Only one case was sectioned at a time; the microtome blade was changed and the workplace was cleaned with ethanol thoroughly along with the microtome between every two cases to prevent contamination.Furthermore, paraffin blocks without tissue were cut after every real specimen and served as negative controls of DNA extraction process.DNA was extracted from FFPE tissue specimens using QIAamp DNA FFPE Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer's instruction.The concentration of the extracted DNA was measured by a NanoDrop (ND-1000) spectrophotometer (peQLab Biotechnologie, Erlangen, Germany).All DNA specimens were stored at − 70 °C until required.

Determination of HPV16 and 18 lineages and sublineages and phylogenetic analysis
Primer sets HPV16-E6-F/HPV16-E6-R and HPV18-E6-F/HPV18-E6-R were used for the amplification of full-length HPV16/18 E6 gene in the DNA samples extracted from the FFPE tissue specimens that had previously tested positive for single infections with either HPV16 or 18 (Additional file 1: Table S1).The amplification of HPV16/18 E6 gene was performed in a 50-μl reaction mixture containing 500 ng of DNA template, 0.5 µM of each primer, and TEMPase Hot Start DNA Polymerase 2 × Master Mix (Ampliqon, Odense, Denmark).PCR amplification cycles included an initial 15-min denaturation at 95 °C, followed by 40 cycles of 95 °C for 45 s, 55 °C for 1 min, and 72 °C for 1 min, and a final elongation at 72 °C for 5 min.A reaction mixture without template DNA, as a negative control, was included in every PCR run.Plasmids containing HPV16 and HPV18 DNA cloned in pBluescript (Manassas, VA, USA) and pBR322 (Manassas, VA), respectively, which were available from a previous study [35], were used as positive controls.
Since using HPV16 E6 G433 and A532 nucleotide sequences could not distinguish between D1 and D4 sublineages, all HPV16 isolates which belonged to the D1/D4 sublineages were further analyzed by PCR assay using 16LCR-F/16LCR-R primer set for the amplification of HPV16 long control region (LCR) according to the previously published protocol [36] (Additional file 1: Table S1).The single nucleotide polymorphism (SNP) C7781T was considered as a diagnostic criterion for differentiation between D1 and D4 sublineages.The specific detection of D1 sublineage was achieved by observing the following six SNP variations: G145T, T286A, A289G, C335T, T350G, and C7781T.
Following visualization on 1.5% agarose gel, the bands of generated amplicons were excised and purified using GF-1 PCR Clean-Up Kit (Vivantis, Malaysia) and were subsequently subjected to sequencing (Sequetech Corp.,Mountain View, CA, USA) in both directions.The obtained sequences are available at http:// www.ncbi.nlm.nih.gov/ with GenBank accession numbers from OP572427 to OP572522.All the sequences were analyzed using BLAST software program (http:// www.blast.ncbi.nlm.nih.gov/ blast/ html) and classified into lineages and sublineages according to the prototype reference sequences given in the Papillomavirus Episteme database (http:// pave.niaid.nih.gov).Phylogenetic trees were generated using maximum-likelihood method Mega software version 7 [37], and the sublineages were identified based on 0.5-1.0%differences between isolate genomes.

Statistical analysis
Data were analyzed using SPSS version 21.0 (SPSS Institute, Chicago, IL, USA).Chi-square test or two-sided Fisher's exact test was used to analyze the potential association of HPV PCR results, HPV types, and lineages with age, cervical pre-neoplastic lesions, CC, and other categorical factors, where appropriate.p-values less than 0.05 were considered to be statistically significant.
The detection rates of HPV testing among individuals with normal, ASC-US, ASC-H, LSIL, and HSIL cervical cytology were 48.9%, 93.6%, 100%, 100% and 100%, respectively (Table 1).The frequency of HPV infection was significantly higher among participants with abnormal cervical cytology than that among individuals with normal cervical cytology (p < 0.0001).The distribution of HR-HPV, pHR-HPV, and LR-HPV types in different cervical cytology groups is presented in Fig. 2.
Of 403 HPV-positive cases, 172 (42.7%) were found to be infected with a single HPV type while the remaining 231 (57.3%) cases were infected with multiple types of HPV.Multiple HPV infections were significantly more frequent among participants with abnormal cervical cytology than those with normal cervical cytology (p < 0.0001).Moreover, the rate of multiple HPV infections was significantly higher among participants aged above 30 years compared to those with ≤ 30 years of age (p = 0.001).The distribution of single and multiple HPV infections in the groups of different cervical cytology  grades and age ranges is presented in Tables 1 and 2, respectively.
Examining the frequency of HPV types in different cytology groups, we found that individuals infected with at least one high-risk or possible high-risk HPV type comprised a significantly larger proportion of abnormal cervical cytology group compared to the normal cervical cytology group (p < 0.0001).Moreover, among the LR-HPV types, HPV6 was found to be the most frequent in all of the cytology groups having infected 36.2%,33.3%, 32.1%, and 43.8% of individuals in the ASC-US, ASC-H, LSIL, and HSIL group, respectively.Among the HR-HPV types, HPV16 was the most prevalent type in the ASC-US, ASC-H, and LSIL cytology group with the infection rates of 27.7%, 23.8%, and 46.4%, respectively.Interestingly, in the case of the HSIL group, HPV56 was the most frequent HR-HPV type with an infection rate of 43.75%.
With regard to the frequency of HPV types among single and multiple infection cases, HPV6 was the most common LR-HPV type accounting for 47.1% of single infections and present in 44.6% of multiple infection cases.Furthermore, while HPV16, 18, and 52 were the most frequent HR-HPV types responsible for single HPV infections (9.3%, 4.7%, and 4.7% of single infections respectively), HPV16, 56, and 66 were the most frequently detected HR-HPV types among multiple HPV infection cases (present in 23.8%, 20.8%, and 19.5% of multiple infections respectively).HPV53 was found to be the most common pHR-HPV type among both single (6.4%) and multiple infection cases (14.3%).

Genetic variations of HPV16 and 18 E6 gene regions
Forty-eight HPV16 and 48 HPV18 isolates of the 96 previously screened cases were sequenced across the E6 gene (nt: 83-559 and nt: 105-581) and compared with the corresponding HPV16 and HPV18 E6 reference sequences of each lineage and sublineage (Additional file 1: Tables S2 and S3) Additional file 1: Figs.S2-S7 present DNA Sanger sequencing chromatograms showing sequence polymorphisms throughout the E6 gene region of HPV16 and 18.The sublineage analysis of HPV16 isolates could not distinguish between D1 and D4 based on the HPV16 E6 gene sequences.Therefore, 22 HPV16 isolates were further analyzed by PCR assay for the amplification of HPV16 LCR and subsequent Sanger sequencing of the PCR products.Since C7781T SNP was not detected in any of these isolates, they were all sorted into sublineage D1.Overall, 3 lineages (A, D, and C) and 6 sublineages  The numbers refer to the positions of the nucleotides according to the reference sequence (GenBank accession number K02718).Nucleotide positions in E6 are presented at the top of the table according to the reference sequence.Nucleotide changes are shown by the corresponding letters.Dashes indicate positions at which no variation was found.Amino acid sequence variations are shown at the bottom; amino acid changes whose codons contain more than one nucleotide replacement are marked with /.Asterisks represent the new SNPs detected in the studied HPV16 isolates ADC adenocarcinoma/adenosquamous carcinoma, CIN cervical intraepithelial neoplasia, LSIL low-grade squamous intraepithelial lesion, HSIL high-grade squamous intraepithelial lesion, SCC squamous cell carcinoma, HPV human papillomavirus

Discussion
This is the first investigation of the presence of HPV types in a large-scale screen of liquid-based cytology samples from Iranian population in the southwest.The study population was comprised of gynecological outpatients including symptomatic and asymptomatic women.The vast majority of Iranian published data include FFPE tissue specimens in hospitals and research centers.However, the current study focused on ThinPrep cytology samples from patients in Shiraz, the capital city of Fars Province.In this study, the overall HPV-positive rate was Fig. 3 continued 62.19%, a rate similar to but slightly higher than that in a recent cross-sectional retrospective study (52.25%) by Rezaee Azhar et al. on female outpatients referred to the medical laboratories of Tehran Metropolitan, Iran [33].Previously, in the largest Iranian study including 10,266 samples collected from 31 Iranian provinces, Mobini Kesheh et al. found 45.9% (n = 8351) of women positive for HPV DNA [20].Reports from different parts of the world indicate an overall HPV prevalence ranging from 9.9 to 49.1% [38,39].Furthermore, consistent with previously published reports from Iran, the prevalence of HPV has been reported to range from 5.5 to 9.4% in normal cytology specimens, 61.7 to 65.3% in CIN I-III specimens, and 75.2 to 87% in CC specimens with various study populations and methodologies [25][26][27][28][29][30][31][32].In our study, HPV prevalence in normal, ASC-US, ASC-H, LSIL, and HSIL cervical cytology samples were 48.9%, 93.6%, 100%, 100%, and 100%, respectively which appeared to be higher than not only the overall prevalence previously reported in Iran, but also previous reports from regional countries and most of other countries worldwide [40][41][42][43].On the other hand, the prevalence of HPV infection in the present study is consistent with a study by Schmit et al. [8] reporting the presence of any of 51 investigated genital HPV types in 33.3%, 83.1%, 98.2%, and 100% of normal, ASC-US, LSIL, and HSIL cervical cytology samples, respectively.Taken together, the present study obviously shows that the prevalence of HPV infection in the southwest of Iran is high.This may be due to the fact that this study was cross-sectional and since HPV infections can be transient and cleared up by the immune system, the prevalence of HPV may accordingly change over time.Furthermore, there is evidence that the distribution of HPV types varies by region and ethnicity [44,45] which might be another explanation for the current finding.In addition, many of the differences observed in the prevalence of HPV infections among different studies can be attributable to the methodological differences in the PCR-based assays used including the size of the PCR product, primer sets, reaction conditions, the efficiency of the polymerase enzyme, the potential of the HPV DNA spectrum amplified to detect multiple types, and even the type (frozen/FFPE/cytology specimens) and quality of the clinical specimens, causing variations in the specificity and sensitivity of the assays [46].Herein, we have used methodological assays that are among the few ones capable of detecting 40 HPV types including HR-HPVs, pHR-HPVs, and LR-HPVs, and are also more sensitive than those used in previous studies in the region, allowing the identification of HPV types in specimens with low viral loads.HPV infection is responsible for almost 100% of cervical SCCs.The underestimation of HPV prevalence in most reports is attributable to the technical limitations of the corresponding studies [47].However, the possibility of bias in the estimation of HPV prevalence should be taken into account since to date, Iran has not had an organized national and regular CC screening program and the cervical samples evaluated in this study were collected only from the referred volunteer outpatients attending routine gynecological visits for cytologybased screening which could not have reflected the real virus epidemiology among the general population.
In the present study, the five most prevalent HR-HPV types in ThinPrep cytology samples were HPV16 (17.6%), 56 (12.6%), 66 (12%), 52 (9.9%), and 18 (8.7%)which comprised 61% of the total HR-HPV-positive samples.In addition, HPV6 (45.6%), 81 (9.2%), and 11 (7.4%) were found to be the most dominant LR-HPV types.In the largest study ever conducted in Iran, the most common HPV types reported by Mobini Kesheh et al. were HPV6 (43.3%),HPV16 (16.6%),HPV11 (11.4%), and HPV52 (9.6%) [20].Another study reported the five most common types to be HPV6, HPV11, HPV16, HPV51, and HPV53 [48].In a study by Bitarafan et al. on 12,076 Iranian women, the five most common HR-HPV types were as follows: HPV16 (16.98%),HPV52 (8.8%), HPV18 (7.69%), HPV39 (7.63%), and HPV31 (7.45%) [19].The most similar results to our study reported from Iran were those by Rezaee Azhar et al. revealing HPV16 (12%), 66 (7%), 18 (6%), 31 (5%), and 52 (5%) as the five most prevalent HR-HPV types and HPV6 (32%) and 11 (6%) as the most prevalent LR-HPV types [33].Interestingly, we found HPV81, a rarely reported type from Iran, to be the second most prevalent LR-HPV type and detected HPV90 and 87, first reported types in Iran, in women Unlike single infections, multiple HPV infections have been reported to be associated with increased risk of high-grade lesions and cancer [52].Therefore, investigating the prevalence and patterns of multiple HPV infections can provide a better understanding of their role in carcinogenesis and the prognosis of patients with persistent infection.A study by Lee et al. has reported an association between multiple HPV infections and an increased risk of CC [53].Further, Schmitt et al. have reported that HPV co-infection lengthens the course of infection [54].In line with previously reported studies [41,55], patients with cytological findings of ASC-US, ASC-H, LSIL, and HSIL showed higher multiple HR-HPV-positive rates (89.1%)than women with normal cytological results (66.8%).Cytological findings have shown high rates of multiple HR-HPV infections in low-grade as well as high-grade lesions, suggesting that multiple HR-HPV infections are associated with all stages of cervical lesions.These results are consistent with those of other cross-sectional and prospective studies [55][56][57].
Data on HPV variants are of value in HPV diagnosis, developing vaccines, and therapeutic approaches to control virus-induced pathological damage.The oncogenicity of different HPV variants may vary among different populations with distinct distribution of human leukocyte antigens (HLA) alleles [58,59].To our knowledge, this is the first study characterizing genetic variations in the E6 gene region of HPV16 and 18 variants simultaneously, in normal, LSIL, HSIL, and CC specimens from women living in the southwest of Iran.Given the critical role of E6 gene in cell immortalization and malignancy, it was selected for the classification of the intra-typic HPV16 and HPV18 variants.In the current work, D1 followed by A4 sublineages were found to be the major HPV16 variants which is in line with previous studies from other parts of Iran [21,24].Moreover, we were able to identify A1, A2, and C1 sublineages which were not previously detected simultaneously in either of the two previous studies from Iran.We also identified 10 new nucleotide substitutions in the sequence of HPV16 E6 gene which were not previously reported in the country and accordingly submitted to GenBank.When aligned with E6 gene sequences available in the database, we noticed that all of the detected substitutions were previously reported by other investigators especially from Asia.
There is strong evidence that HPV16 lineage D is associated with CIN3 + with threefold higher risk than lineage A. It is also associated with a higher risk of persistent infection, invasive and glandular high-grade lesions, and CC development than lineage A [60,61].In addition, HPV16 lineage D infection results in higher rates of genomic integration compared to other HPV16 lineages [62].Due to the high prevalence of HPV16 lineage D variants in the Iranian papulation, it seems that they are most likely at high risk of cancer development and progression which necessitates an immediate action for HPV vaccination.With regard to the distribution of HPV18 sublineages, A4 was the most frequent sublineage detected in the current study.This finding is consistent with the results of two previous studies which also reported HPV18 A4 sublineage to be the predominant variant in Iran [22,23].However, there is a single study from Iran reporting A3 as the most prevalent sublineage in the country [21].In general, our finding is in line with the global distribution pattern of HPV18 variants with a predominance of the A lineage in most parts of the world except sub-Saharan Africa.More specifically, A3 and A4 sublineages strongly predominate in South/ Central Asia, northern Africa, Europe, and South/Central America [17].While the majority of A5 isolates have been detected in Africa [17], this sublineage was also detected in our study and previously reported by another study in Iran [23] and a study in Saudi Arabia [63] which shares a maritime border with Iran.Previous studies have suggested that the distribution of HPV18 variants is different between ADC and SCC cases [64,65].However, due to the small number of HPV18-positive ADC cases included in our study, no significant difference in HPV18 variant distribution was observed between ADC and SCC cases.On the other hand, in a study with a larger sample size including 81 ADC cases, each matched with two SCC cases in terms of country and age, no difference in HPV18 variant distribution, either overall or in any of the regions, was found [17].In our population, eight substitutions were found in HPV18 variants which were not previously reported in the country.Following further examination, we found these mutations previously reported by Korean researchers [18], a finding that supports the geographical distribution of HPV lineages.Previous studies have found that HPV18 lineage B could be associated with a higher risk of CC than lineage A, and a higher risk of persistence and progression [60,66].In contrast, other studies have reported that no significant difference was observed in the risk of pre-invasive lesions among HPV18 lineages (A, B, and C) [17,67].Since we did not detect any non-A lineages of HPV18 in our study, it was not feasible to analyze the relationship between HPV18 lineages and the risk of cervical pre-invasive and invasive lesions.However, our study revealed no statistically significant association between HPV18 A lineage and cervical lesions in Iran.This result may have been affected by the small sample size of HPV18-related cervical lesions in our study.
Generally, the present work had a few limitations, the most important of which was recruiting patients from a single center.In order to confirm the results of this study, multicenter studies are required.Further, since the results from cross-sectional studies may be misleading, longitudinal studies with focus on persistent infections with each specific HPV type need to be performed to further investigate the role of each HPV type in CC development.In addition, the considerable difference observed in the frequency of different HPV types among former studies highlights the need for further investigations to provide additional information on the geographical distribution of HPV types and variants in Iran over time using standard methodological techniques for HPV detection and typing.Such data can help to decide upon the best diagnostic and therapeutic approaches, evaluate the efficiency of currently used vaccines, and develop new generations of them.

Conclusions
To sum up, the present study suggests that the prevalence of HPV infection in women of all age groups with or without premalignant lesions in the southwestern Iran is high.Our data also show that the predominant HPV types in the southwest of Iran may differ from those detected in other parts of the country.Findings from this study illustrate the necessity of initiating HPV vaccination for the general population and developing national cervical screening programs as well as targeted education to the younger population in order to encourage the application of infection control measures.Moreover, the identification of emerging HPV types that are not covered even by the new 9-valent HPV vaccine raises awareness about potentially important HPV variants.Regarding the approach to cervical screening for cancerous and precancerous lesions in the region, it is best to use a combination of thinPrep cytology test and HPV detection assays in order to improve the accuracy of the screening.Finally, accurate data on the geographic distribution of different HPV types and HPV16/18 variants can be beneficial for developing diagnostic probes and targeted HPV vaccines for Iranian populations.

Fig. 1
Fig. 1 Prevalence of HPV types among HPV-positive individuals

Table 3
Variations of HPV16 E6 gene from patients with different grades of cervical lesions

Fig. 3
Fig. 3 Phylogenetic analysis of HPV16 E6 (a) and HPV18 E6 (b) gene regions was conducted using the maximum-likelihood method based on the Kimura 2-parameter model with bootstrap resampling (1000 replicates) by the MEGA 6 package [37].Numbers above the branches indicate the bootstrap values.Ninety-six different nucleotide patterns of studied sequences were indicated by black triangles (GenBank accession numbers OP572427 through OP572474 for HPV16 and OP572475 through OP572522 for HPV18).The accession number of reference sequences of each sublineage used for phylogenetic analysis in this study was indicated by white triangles

Table 1
Distribution of HPV infection, HPV risk groups, and single/multiple HPV infections among individuals with different cervical cytology grades Fig. 2 Distribution of high-risk HPV types (a), possible high-risk HPV types (b), and low-risk HPV types (c) in HPV-positive groups of normal, ASC-US, ASC-H, LSIL, and HSIL cervical cytology

Table 2
Distribution of different cervical cytology diagnoses, HPV risk groups, and single/multiple HPV infections in different age groups

Table 4
Variations of HPV18 E6 gene from patients with different grades of cervical lesions

lesion Type of variant 104 149 153 232 287 317 377 382 485 549 554 Accession number Reference
).Nucleotide positions in E6 are presented at the top of the table according to the reference sequence.Nucleotide changes are shown by the corresponding letters.Dashes indicate positions at which no variation was found.Amino acid sequence variations are shown at the bottom; amino acid changes whose codons contain more than one nucleotide replacement are marked with /.Asterisks represent the new SNPs detected in the studied HPV18 isolates ADC adenocarcinoma/adenosquamous carcinoma, CIN cervical intraepithelial neoplasia, LSIL low-grade squamous intraepithelial lesion, HSIL high-grade squamous intraepithelial lesion, SCC squamous cell carcinoma, HPV human papillomavirus

Table 5
Distribution of HPV16 and HPV18 variants in different cervical cytology grades