Beyond the widespread mortality, it is estimated that more than 700 million individuals have been infected with COVID-19 globally by the end of 2024 [3, 4]. It is now well established that since the declaration of the pandemic in 2020, multiple COVID-19 variant strains emerged, as SARS-CoV-2 has consistently mutated throughout the pandemic [5, 6]. The original virus has been replaced by several variants over the past four years, but five major global variants of concern (VOC) have been identified so far (i.e., Alpha, Beta, Gamma, Delta, and Omicron). Additional variants of interest (VOI) include Epsilon (B.1.427 and B.1.429); Kappa (B.1.617.1); Zeta (P.2); Theta (P.3); Eta (B.1.525); Iota (B.1.526); Lambda (C.37) and Mu (B.1.621) [7, 8]. In the U.S, the variants that caused widespread surges in infection and became dominant variants during these surges were SARS-CoV-2 original, Alpha, Delta, Omicron (and the subvariants of omicron such as BA.2, BA.4, and BA.5) [8-10].
A major problem for survivors of COVID-19 infections now being discussed worldwide is Long-COVID [5, 8, 11-14]. As per the U.S. CDC, this condition can be broadly defined as signs, symptoms, and conditions that continue or develop after acute COVID-19 infection with features of these post-COVID conditions lasting for weeks, months, or even years [11]. The WHO defines Long-COVID more specifically as the continuation or development of new symptoms lasting for at least two months with no other explanation and beginning around three months after the initial SARS-CoV-2 infection [12]. Aiyegbusi and colleagues identified the 10 most common symptoms of Long-COVID as fatigue or myalgia, cough or dyspnea, joint or chest pain, altered smell or taste, headache, or diarrhea [13]. More recently, a comprehensive global review found that among 113 biomarkers from 28 studies, 69.9% (n=79) of the biomarkers were significantly increased, 25.7% (n=29) were decreased, and 4.4% (n=5) required further evaluation in those with Long-COVID [14]. The definitions, diagnostic criteria, and treatment guidelines for Long-COVID are still evolving [13-16]. Simultaneously, many healthcare facility-based studies and reviews have explored the association of Long-COVID symptoms with different variants of COVID-19 [15-18]. However, there is a lack of clarity among these studies on which variants could be associated with Long-COVID. Furthermore, as these are mostly healthcare facility-based studies of observations of individuals reporting Long-COVID symptoms, they are prone to biases (e.g. selection). Population-based studies from individuals experiencing Long-COVID symptoms without being hospitalized or actively being treated by a healthcare provider are needed to understand the broader scope of Long-COVID. Thus, the purpose of our analysis was to evaluate online information-seeking behavior among the general population of Americans on Long-COVID over time. Also, we investigated these Long-COVID-related online searches in proximation with the dominance of different variants of COVID-19 in the U.S. to understand if certain variant surges led to higher interest in the term Long-COVID. The technique used was Infodemiology, which has not yet been used to explore Long-COVID information-related interest among the general public in the U.S. Infodemiology is a technology-driven strategy for the assessment of the distribution of consumer information seeking in electronic mediums such as the Internet. Infodemiology methods explore consumer information-seeking behaviors on the internet to guide surveillance, inform public health practice, and develop public policies. During the COVID-19 pandemic, Infodemiology was used extensively to analyze the public's consumption of information from the Internet to monitor and predict public health issues and trends [6, 19, 20, 21].
Methods
For this investigation, we utilized the application programming interface of Google Health Trends (GHT-API). This is a novel approach to obtaining raw Google searches in a short search session without restrictions on the search volume index. Detailed methodological approaches to accessing data through GHT-API have already been published elsewhere [19-23]. From the Google Trends website, we collected multiple samples of ‘Long Covid + Post Covid’ daily search data in the U.S. from January 1, 2020, to January 10, 2023, and averaged them to obtain better estimates of their true values. The GHT-API provides an estimated probability scale of the search scale of 10 million for readability [20-22]. Data collected were stratified according to COVID-19 variant timelines provided by the CDC [1,9]. We analyzed the data by plotting a line chart to describe the ‘Long Covid’ Google search trends in the U.S. following the COVID-19 variant timelines. A conservative approach (i.e., the Jonckheere-Terpstra test) was used to determine whether there were statistically significant trends in ‘Long Covid’ Google searches over time. Python 3.11.0 was used to obtain GH-API data and SPSS software was used for data processing and analysis. Ethical clearance