Skip to main content

Association of activity-based food environment index with obesity-related cancer mortality in the US

Abstract

Background

Obesity and obesity-related cancers contribute to rising healthcare costs and declining life expectancy in the US and improving diet quality plays a crucial role in reversing such trends. Existing studies on the relationship between healthy food access and obesity-related cancer mortality present mixed findings, whereas food procurement activities are largely overlooked. The paper aims to construct a novel food environment index based on residents’ food retailer visits, and then compare it with the location-based food environment index regarding the strength of associations with obesity-related cancer mortality rates.

Methods

This cross-sectional ecologic study used business location data from InfoGroup and aggregated GPS-based food retailer visit data from SafeGraph in 2018–2019, and mortality data from the Centers for Disease Control and Prevention in 2015–2020. A total of 2925 counties or equivalents with complete information were included. Activity-based index was calculated as the percentage of visits to healthy food retailers out of total visits to all qualified food retailers for residents in each county. Location-based index was calculated as the percentage of healthy food retailers out of all qualified food retailers in each county. The main outcome is age-adjusted obesity-related cancer (13 types of cancer based on evidence from the International Agency for Research on Cancer) mortality rates, which were calculated for each county and counties were further categorized into high- and low-risk (≥ 60.2 and < 60.2 cases per 100,000 population) areas. Linear, non-linear, logistic, and spatial regression analyses were performed to examine the association between each food environment index and obesity-related cancer mortality rates.

Results

The activity-based index demonstrated significant negative association with the 2015–2020 obesity-related cancer mortality rates (coefficient [95% CI]: − 0.980 [− 1.385, − 0.575], P < 0.001), and each standard deviation increase in the activity-based index was associated with an 18% decrease in the odds of being in a high-risk area (odds ratio [95% CI]: 0.821 [0.749, 0.900], P < 0.001), while the location-based index showed much weaker and non-significant effects.

Conclusions

Our findings suggest that health policies and initiatives that combat obesity and obesity-related cancers should consider incorporating food retailer visits into policy formation.

Peer Review reports

Background

Obesity and obesity-related cancers are prevalent and are major drivers of rising healthcare costs, diminished health-related quality of life, and the recent decline in life expectancy in the US [1]. The most recent data shows that 41.9% and 73.6% of the adults aged 20 years and older in the US are obese and overweight [2, 3]. It is estimated that about 5% of new cancer cases in men and 10% in women are attributable to excess body weight every year [4], and the International Agency for Research on Cancer (IARC) has linked excess adiposity to 13 types of cancers, including esophageal cancer, gastric cardia cancer, colorectal cancer, liver cancer, gallbladder cancer, pancreatic cancer, laryngeal cancer, postmenopausal breast cancer, endometrial cancer, ovarian cancer, kidney cancer, thyroid cancer, and multiple myeloma [5].

While it is well recognized that diet quality is associated with obesity and obesity-related cancer incidence and mortality [6,7,8,9,10,11,12], the environmental determinants of diet quality on the individual level or the community level remain far from conclusive. It is widely recognized that a major environmental determinant is access to healthy/unhealthy food provisioning, which is usually measured by geographical models of spatial access (e.g., proximity, density, and variety) to relative health-promoting retailers (e.g., grocery stores) or health-damaging ones (e.g., fast-food restaurants) in one’s residential neighborhood [13]. Because of the presumed link between food environments and diet quality, many national food and health initiatives have incorporated food environment measures, including the United States Department of Agriculture (USDA) Food Access Research Atlas [14] and the Centers for Disease Control and Prevention (CDC) modified retail food environment index (mRFEI) [15], as evidence in designing policy interventions to improve diet-related health outcomes [16].

However, past literature presents mixed findings on the connection between such food environment measures and obesity or obesity-related cancers. For example, some studies identified a positive relationship between access to healthy/unhealthy foods and diet quality, obesity and obesity-related cancers [17,18,19,20,21,22], but others found negative [23], null [24], and mixed relationships [25,26,27,28]. While many factors could contribute to these inconsistencies, one important but often overlooked aspect is food-related human activity, such as food retailer visits [29]. Specifically, the underlying assumption of establishing these place-health connections is that food consumers are exposed to all food retailers within a predefined administrative unit (e.g., city, county, or census tract) and can alter their health behaviors, including their diets, due to such exclusive exposure. However, in practice, consumers often engage in food procurement activities that occur at multiple locations or span over multiple units [30], and their food exposure and behaviors may not accurately reflect the foodscape surrounding their residence due to various behavioral uncertainties, such as food culture, health education, and food perceptions [31,32,33]. For example, recent studies have shown that the majority of the food retailer visits occurred outside of residents’ immediate neighborhoods, and therefore, their patterns of food retailer visit differed significantly from those defined by their residential food environment [33, 34].

Several previous studies have linked food retailer visits to diet quality and obesity, mostly on the local scale [35,36,37,38,39,40]. To our knowledge, there is no study examining the association between food retailer visits and obesity-related cancer mortality on the national level. To fill the gap, this study leverages a large-scale Global Positioning System (GPS)-based human mobility dataset covering over 94 million aggregated visit records to roughly 359,000 food retailers across the US for 2 years. In this study, we have examined a novel index based on county-level food retailer visits and compared it with the widely used location-based food environment index in terms of the strength of associations with obesity-related cancer mortality rates.

Methods

Population and scale

This observational, cross-sectional study included all counties in 50 states and the District of Columbia, drawing on county-level obesity-related cancer mortality rates (2015–2020) from the CDC [41]. County-level covariates such as demographic and economic data were sourced from publicly available data sets (Additional file 1: Table S2). We constructed county-level location-based and activity-based indices of the retail food environment (RFE) using InfoGroup Historical Data and SafeGraph’s Core Places and Patterns datasets. Our final, main analytical sample included 2925 counties with complete information on RFE measures, obesity-related cancer mortality rates, and other demographic and economic variables. All data used in this study were de-identified and publicly available. We followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline.

Measures

Outcome

The main outcome was the 5-year average county-level obesity-related cancer mortality rates 2015–2020. Following previous literature [5, 18], we used the age-adjusted mortality rate (per 100,000 population) from the CDC that consisted of 13 obesity-related cancer types (see Additional file 1: Table S1). As a robustness check, we also considered the mortality rate for each year separately.

Retail food environment (RFE) measures

We constructed two county-level retail food environment (RFE) measures. First, following a similar approach to the modified retail food environment index (mRFEI) [15], we constructed a location-based index with up-to-date data (2018–2019) by calculating the percentage of healthy food retailers out of all qualified food retailers (N = 471,725). This update is necessary because the existing mRFEI is based on business listings of 2008–2009. The location-based index defines healthy and less healthy food retailers based on the 2012 North American Industry Classification System (NAICS) codes. Healthy food retailers include supermarkets and larger grocery stores (NAICS 445110), warehouse clubs (NAICS 452910), and fruit and vegetable markets (NAICS 445230). Less healthy food retailers include limited-service restaurants (NAICS 722513) and convenience stores (NAICS 445120). We also complemented other limited-service restaurants by including food carry-out (SIC code 5812–06) and pizza restaurants (SIC code 5812–22). The data was sourced from InfoGroup Historical Data (2018–2019) [42], which provides a comprehensive list of business establishments in the US. The number of included food establishments under each NAICS category aligned well with the total number of businesses listed in the 2021 NAICS association statistics [34].

Second, we constructed a county-level activity-based index based on residents’ visits to food retailers in 2018 and 2019 using SafeGraph’s Core Places and Patterns datasets [43]. In contrast to the location-based index, the activity-based index estimates the percentage of visits to healthy food retailers out of total visits to both healthy and less healthy food retailers for people living in the same county. The definitions of healthy and less healthy food retailers follow the ones used in the location-based index. The visits were based on anonymized GPS-tracking data aggregated from approximately 10% of all GPS-enabled mobile devices in the US. SafeGraph determined a device’s “home” by analyzing nighttime data (6:00 PM to 7:00 AM) over 6 weeks and assigning it to a Geohash-7 grid (153 × 153 m), which was then mapped to census block groups, census tracts, and counties. This index was formulated using 94,256,870 aggregated visit records to 359,365 food retailers in 2018 and 2019 across the US, where each record indicated the destination store, origin (the visitor’s home census tract), and the number of visits in each year. This activity-based index was previously validated, and additional details about this activity-based index can be found in our previous work [34]. Both the location-based index and activity-based index are expressed as proportions in the subsequent analyses (ranging 0–1, where smaller number indicates a less healthy food environment).

Covariates

To capture relevant demographic, economic, and social characteristics of different counties, we considered a series of covariates that have been widely recognized in the literature as influencing obesity-related cancer mortality [18, 43,44,45,46,47], including racial and ethnic composition, percentage of senior population (aged 65 +), median household income, poverty rate, urbanity, education level, and food desert status. In Additional file 1: Table S2, we provide details of how these variables were measured and the source of the data. The social vulnerability index (SVI) data from the US Centers for Disease Control and Prevention [48] were also extracted as additional covariates as validated measures of community vulnerability to natural or human-caused stressors [49,50,51]. It uses 16 variables from the 5-year American Community Survey (ACS) to identify communities that may require support before, during, or after disasters. These variables are grouped into four themes: socioeconomic status, household composition and disability, minority status and language, and housing and transportation. The percentile ranking of each theme for each county (ranging from 0 to 1, whereas 1 means the most socially vulnerable) was included as additional covariates for further analyses.

Statistical analysis

We conducted descriptive analyses, comparing two RFE measures and a range of demographic, economic, and social characteristics, stratified by whether the obesity-related cancer mortality rates were above or below the national median. Statistical comparisons were made using the Wilcoxon rank sum tests. The spatial distribution of the two RFE measures was also visualized to explore potential differences in the patterns they capture.

To investigate the associations between each RFE measure and obesity-related cancer mortality, we first examined the bivariate relationship using Pearson’s correlation to assess the linear trends. Next, we applied multivariable linear regression models, adjusting for demographic, economic, and social characteristics of each county. Furthermore, we dichotomized the obesity-related cancer mortality based on the national median (i.e., high-risk areas vs. low-risk areas), and conducted binary logistic regression analyses to assess its association with each RFE measure in separate analyses (with the same set of covariates as above). Both RFE measures were standardized before all regression analyses so that their coefficients can be compared.

We employed a series of additional analyses as a robustness check on our findings. First, we conducted the analyses using both the 5-year average mortality rate and the annual mortality rates from 2015 to 2020 as the outcome. We also assessed how results may change by adding SVIs as additional control variables to adjust for community-level social, economic, and demographic vulnerabilities. While the main model already includes a series of demographic, economic, and social characteristics of each county that substantially overlap with the SVI—they are not identical. Including SVI in the sensitivity analyses allowed us to assess the robustness of our findings while avoiding potential over-adjustment in the main model. In addition, we accounted for possible spatial autocorrelations in a series of spatial regression models. Generalized additive models (GAMs) were also employed to capture potential nonlinear effects between the obesity-related cancer mortality rates and each RFE measure. Lastly, we performed exploratory analyses to investigate potential heterogeneity in the associations between each RFE measure and obesity-related cancer mortality by adding interaction terms between each RFE measure and covariates, assessing whether the association between RFE and cancer mortality rates varies across different community contexts. For the covariates with the strongest interaction effects, we further performed stratified analyses (split at the median of the covariate) using separate multivariable linear regression models with the same specifications as the main analysis. All statistical tests were two-sided, with a significance level set at 5%. Statistical analyses were performed using R (version 4.4.1, R Foundation for Statistical Computing).

Results

In Table 1, we compare the two RFE measures between counties with obesity-related cancer mortality rates above and below the national median (60.2 per 100,000 population). There was no statistically significant difference in the location-based index (i.e., the new mRFEI) between the two groups (median [IQR]: 0.21 [0.16, 0.29] for low-risk counties and 0.21 [0.16, 0.30] for high-risk counties, P = 0.8). However, counties with below-median mortality rates had a significantly higher activity-based index compared to those with above-median rates (low-risk counties: 0.25 [0.20, 0.31]; high-risk counties: 0.22 [0.17, 0.26], P < 0.001).

Table 1 RFE measures, demographic, economic, and social characteristics for counties with above (high-risk) and below (low-risk) national median obesity-related cancer mortality rates

Additionally, significant differences were observed in other demographic, economic, and social characteristics between the high-risk and low-risk counties (Table 1). High-risk counties had higher proportions of African American residents (low-risk: 2% [1%, 9%], high-risk: 3% [1%, 15%]; P < 0.001), higher poverty rates (low-risk: 13.5% [10.1%, 17.2%], high-risk: 16.3% [12.7%, 20.8%], P < 0.001), lower median household income (low-risk: 53,306 [45,369, 62,497], high-risk: 46,610 [40,383, 53,354], P < 0.001), higher proportions of residents without a high school diploma (low-risk: 10.9% [7.8%, 15.4%], high-risk: 13.6% [10%, 18.7%], P < 0.001), and higher food desert prevalence (low-risk: 0.19 [0.00, 0.35]; high-risk: 0.25 [0.06, 0.41], P < 0.001).

Figure 1 depicts the spatial distributions of the two RFE measures and Fig. 2 shows the scatterplots of each RFE measure against the 5-year average obesity-related cancer mortality rates, with a fitted linear trend line. The correlation between the location-based index and the mortality rate is not statistically significant (correlation = 0.034, P = 0.067); however, there is a significant negative correlation between the activity-based index and mortality rate (correlation = − 0.197, P < 0.001), suggesting more visits to healthy food retailers are associated with a lower mortality rate.

Fig. 1
figure 1

Spatial distribution of a location-based index, b activity-based index, and c the obesity-related cancer mortality rates (2015–2020)

Fig. 2
figure 2

Scatterplots between standardized a location-based index and b activity-based index and 5-year average county-level obesity-related cancer mortality rates 2015–2020. Note. RFE measures are standardized in the scatterplots to make the two plots more comparable

Figure 3 presents the results from the multivariable analyses of the two RFE measures, controlling for demographic, economic, and social characteristics. Both RFE measures are standardized so that the results can be compared. The patterns align with the bivariate correlations: the activity-based index consistently negatively associates with the mortality rate across both the 5-year average rate and the annual mortality rate (coefficient [95% CI]; 2015–2020: − 0.980 [− 1.385, − 0.575], P < 0.001; see Additional file 1: Table S3 for more details). In contrast, the location-based index shows minimal effect, with the 5-year average approaching significance (− 0.472 [− 0.961, 0.017], P = 0.058). Figure 4 presents the binary logistic regression results using dichotomized obesity-related cancer mortality rates (high- vs. low-risk counties) as the outcome and shows a similar pattern: the location-based index is significant only in 2016, while the activity-based index remains consistently significant across most years, except 2015 and 2017 (see Additional file 1: Table S4 for detailed results). Notably, one standard deviation increase in the activity-based index is associated with about 18% decrease in the odds of being a high-risk area (odds ratio [95% CI] for 5-year average: 0.821 [0.749, 0.900], P < 0.001), while one standard deviation increase in the location-based index is only associated with about 10% decrease in the odds of being a high-risk area (odds ratio [95% CI] for 5-year average: 0.903 [0.811, 1.006], P = 0.064).

Fig. 3
figure 3

Forest plot of multivariable linear regression models predicting obesity-related cancer mortality rates using RFE measures (5-year average and annual mortality rates 2015–2020). Note. The forest plots present regression coefficients and 95% confidence intervals. RFE measures are standardized. All models are adjusted for the following control variables: population density, percentage of White, Black, and Hispanic populations, percentage of seniors, percentage of urban and suburban areas, percentage of the population living in poverty, median household income, percentage without a high school diploma, and the presence of food deserts. Detailed information on the construction of these variables is provided in Additional file 1: Table S2. Numerical results for these forest plots are available in Additional file 1: Table S3

Fig. 4
figure 4

Forest plot of multivariable binary logistic regression models predicting obesity-related cancer mortality rates using RFE measures (5-year average and annual mortality rates 2015–2020). Note. The obesity-related cancer mortality was dichotomized at the national median. The national medians for the 5-year average, 2015, 2016, 2017, 2018, 2019, and 2020 are 60.2, 62.8, 61.1, 60.5, 60.1, 59.65, and 60.2 per 100,000 population, respectively. The forest plots present odds ratios and corresponding 95% confidence intervals. RFE measures are standardized. All models are adjusted for the following control variables: population density, percentage of White, Black, and Hispanic populations, percentage of seniors, percentage of urban and suburban areas, percentage of the population living in poverty, median household income, percentage without a high school diploma, and the presence of food deserts. Detailed information on the construction of these variables is provided in Additional file 1: Table S2. Numerical results for these forest plots are available in Additional file 1: Table S4

Findings from the sensitivity analysis are presented in Additional file 1. Additional file 1: Tables S5 and S6 show results with SVIs added as additional covariates, and Additional file 1: Table S7 provides results from spatial regression models to account for the effect of spatial autocorrelation. Across all model specifications, a higher activity-based index remains consistently associated with a lower mortality rate (or lower odds of being a high-risk area), while the association between the location-based index and mortality rates is much weaker, if present at all. Finally, there are no significant non-linear relationships between each of the two RFE measures and the mortality rate, meaning that these non-linear effects do not distort the main results (Additional file 1: Fig. S1).

The exploratory analyses for heterogeneous associations are summarized in Additional file 1: Table S8 (moderation analyses) and Additional file 1: Table S9 (stratified results). Given the exploratory nature of these analyses, we highlight findings with the strongest statistical significance (P ≤ 0.001). For the location-based index, its association with obesity-related cancer mortality rates appears stronger in counties characterized by higher proportions of suburban areas (P < 0.001 for the interaction term) and minority populations (P = 0.001). Stratified analyses further revealed that the location-based index significantly predicted obesity-related cancer mortality only in communities with above-median suburban levels (− 1.53 [− 2.22, − 0.847], P < 0.001) and minority populations (− 1.34 [− 2.07, − 0.605], P < 0.001), while no significant association was observed in the below-median groups. In contrast, the activity-based index shows stronger associations with cancer mortality rates in counties with a higher proportion of Hispanic population (P = 0.001 for the interaction term) or lower vulnerability, as indicated by lower scores on SVI theme 1 (socioeconomic) (P < 0.001) and theme 2 (household composition and disability) (P < 0.001). Stratified analyses further indicated that the activity-based index significantly predicted mortality only in communities with above-median Hispanic population proportions (− 0.973 [− 1.55, − 0.395], P < 0.001) and in those with below-median vulnerability, as reflected by SVI theme 1 (− 1.41 [− 1.96, − 0.852], P < 0.001) and SVI theme 2 (− 1.33 [− 1.87, − 0.783], P < 0.001).

Discussion

Our study contributes valuable insights into new approaches to measuring the food environment and examining their relationships with obesity-related cancer outcomes. By utilizing a county-level index derived from food retailer visits in the US, we demonstrate that incorporating human activities in the retail food environment can better predict obesity-related cancer mortality compared to using the location-based food environment index. Across models, we found that the effect of the activity-based index is about twice as strong as that of the location-based index in terms of the strength of association with obesity-related cancer mortality rates. This result aligns with past literature regarding the limitations and assumptions underlying location-based indices, which often fail to capture the complexities of the food environment. These complexities include not only the physical availability of food but also factors such as consumer behavior, mobility patterns, and social and cultural influences on food choices, all of which play a significant role in diet-related health outcomes [13, 32, 52].

The activity-based index offers a more nuanced perspective by capturing the frequency and patterns of visits to food retailers, which reflect the behaviors and choices of local residents. While visits do not necessarily equate to food procurement, they provide valuable insight into residents’ engagement with the local food environment that shapes dietary habits. This finding aligns with contemporary theories that conceptualize the local food environment as a dynamic, complex, multidimensional system [13, 53,54,55]. This perspective goes beyond the mere availability of food outlets and can incorporate non-spatial factors, such as the quality and price of food offerings and their alignment with the preferences and needs of local residents [56]. In contrast, location-based indices rely on assumptions about food exposure within a defined geographic area and overlook food behaviors like shopping at stores outside of local boundaries or selecting culturally relevant food sources. By integrating the behavioral component in the formation of the food environment index, the study captures critical human–environment interactions that influence long-term health outcomes, such as the obesity-related cancers as discussed in the paper, which are driven by a multitude of factors such as food availability, price, quality, store services, and marketing, as well as personal factors such as financial resources, environmental perceptions, transportation availability, and nutrition education [53, 56].

Our findings support the robust association between the activity-based index and obesity-related cancer mortality rates. This finding contributes to the ongoing debate on the relationship between the food environment and obesity-related health outcomes, where existing findings remain mixed [17,18,19,20,21,22,23,24,25,26,27,28]. It is also consistent with a previous study demonstrating that the activity-based index is a better predictor of the prevalence of cardiometabolic diseases than the location-based index [34]. Importantly, our findings remain significant even after adjusting for a wide range of demographic, economic, and social factors, as well as SVI scores and the food desert measure. This consistent association highlights the power of the activity-based index to capture behavioral dimensions not fully reflected in traditional food environment measures [32, 52]. However, we acknowledge that visits to food retailers are an imperfect proxy for food procurement activities, as they do not capture the types of food purchased or consumed. Nevertheless, visits provide valuable insights into residents’ interactions with the food environment. Store visits reflect key behavioral aspects of food exposure and can shape dietary patterns and health outcomes over the long term. The significant negative association we found between the activity-based index and obesity-related cancer mortality further supports the notion that where and how individuals engage with the food environment matters.

These findings have important policy implications. In recent decades, many policies aimed at improving diet-related health outcomes have focused on improving food access in areas deprived of healthy food stores (e.g., opening a healthy food retailer in a food desert). However, a systematic review revealed that the majority of these geographic access interventions showed no effects in improving local diets [57]. Our findings offer an alternative explanation for this policy deficiency: ameliorating areas of low food access does not necessarily enhance the quality of food procurement for local residents, as they may shop elsewhere. The findings further call for the need for future policy initiatives to go beyond geographic-based measures (e.g., USDA “low-income, low-access areas”) [14] and consider behavioral factors, such as the frequency and nature of visits to food outlets, which more accurately reflect the local food environment’s impact on diets. Integrating food retailer visits into evidence-based interventions could provide a more comprehensive approach to promoting healthy food consumption.

Furthermore, our exploratory analyses showed that the association between the activity-based index and obesity-related cancer mortality rates may vary by community context. Specifically, the stronger predictive power of the activity-based index in communities with higher proportions of Hispanic residents and higher socioeconomic status (SES) may reflect its ability to capture behavior-driven interactions with the food environment, in contrast to the location-based index. In Hispanic communities, traditional dietary practices and preferences, such as shopping at culturally specific markets or purchasing ethnic food, may lead to more procurement of healthy foods and better diet quality in Hispanic communities, aligning with the “Hispanic Paradox” observed in previous research [58,59,60,61], which suggests that cultural factors can moderate the effects of food access. Similarly, higher SES residents are more likely to have the health awareness to actively seek out healthier food retailers and the financial means to purchase healthy foods [55, 62, 63]. These findings highlight the new activity-based index’s potential in capturing nuanced, behavior-driven dynamics in the food environment but also point to its complexities. Future research should further investigate how sociodemographic, cultural, and economic factors shape these interactions to elucidate how humans interact with the food environment.

Strengths and limitations

To the best of our knowledge, our findings are among the first to investigate the activity-based drivers of obesity-related cancers on a broad scale. Our investigation is based on valuable GPS-based longitudinal data (SafeGraph) on food visits across the US. Our findings are robust given all the sociodemographic and neighborhood characteristics controlled for analysis, as well as the various sensitivity analyses we conducted. Our study has several limitations: Firstly, being observational in nature, the study cannot infer causal relationships. Secondly, the translation of findings from an aggregate level to an individual level carries the risk of ecological fallacy, as the aggregated mobility data at the county level cannot capture unique food activity patterns at smaller geographic scales, such as blocks, households, and individuals. Another limitation is that the activity-based index focuses on visits to food outlets, which do not necessarily equate to actual food procurement or consumption. Future studies are needed to further explore how visits relate to other aspects of food behavior, such as purchasing decisions and dietary intake. Lastly, although SafeGraph sampling reflects a range of sociodemographic characteristics and aligns well with the census population, recent research suggests that specific groups, including Hispanic populations, low-income households, and individuals with lower educational attainment, may be underrepresented in this dataset [64]. This limitation is partly due to the reliance on location-based tracking systems, such as GPS data, which are subject to inherent biases in data collection. Factors like privacy concerns, differences in smartphone ownership and usage, and variability in app-based participation can result in incomplete or skewed representations of certain populations. Additionally, access to location-based data can be restricted by proprietary barriers, potentially limiting its availability for broader research applications. These technological and methodological constraints underscore the need for further studies to critically evaluate the biases in mobility data and to develop strategies for mitigating their impact. Addressing these issues is essential to improving the robustness and generalizability of activity-based indices in characterizing food environments and strengthening their health implications.

While activity-based food environment indices present a promising avenue for advancing policy frameworks, their application is not without challenges. The reliance on location-based tracking systems for data collection, such as GPS or mobile location services, introduces potential limitations, including incomplete or biased data, privacy concerns, technological disparities, and uneven data availability across geographic and socioeconomic contexts. To address these limitations and ensure broader applicability, future efforts should prioritize enhancing data quality, strengthening data ethics, and fully clarifying all data limitations in usage. Such efforts will be critical for building robust, inclusive datasets that accurately represent diverse populations and enable equitable, evidence-based policy interventions.

Conclusions

In a US nationwide study, we show that a novel activity-based food environment index, representing food retailer visits, had significantly stronger associations with obesity-related cancer mortality rates than that of the location-based food environment index. This new index provides an alternative approach to identifying communities in need of healthy food provisioning. It has the potential to guide more effective policy actions and resource allocations aimed at reducing obesity-related diseases and mortality.

Data availability

The datasets used and analyzed at the county level during the current study are available from the corresponding author on a reasonable request.

Abbreviations

CDC:

Centers for Disease Control and Prevention

GAMs:

Generalized additive models

GPS:

Global Positioning System

IARC:

International Agency for Research on Cancer

mRFEI:

Modified retail food environment index

NAICS:

North American Industry Classification System

RFE:

Retail food environment

SVI:

Social vulnerability index

USDA:

United States Department of Agriculture

References

  1. Fast facts: obesity and cancer. https://stop.publichealth.gwu.edu/sites/g/files/zaxdzs4356/files/2022-06/fast-facts-obesity-and-cancer.pdf.

  2. Stierman B, Afful J, Carroll MD, Chen TC, Davy O, Fink S, et al. National health and nutrition examination survey 2017–March 2020 prepandemic data files—development of files and prevalence estimates for selected health outcomes. https://www.cdc.gov/nchs/data/nhsr/nhsr158-508.pdf.

  3. Fryar CD, Carroll MD, Afful J. Prevalence of overweight, obesity, and severe obesity among adults aged 20 and over: United States, 1960–1962 through 2017–2018. 2020.

  4. Islami F, Goding Sauer A, Gapstur SM, Jemal A. Proportion of cancer cases attributable to excess body weight by US state, 2011–2015. JAMA Oncol. 2019;5:384.

    Article  PubMed  Google Scholar 

  5. Lauby-Secretan B, Scoccianti C, Loomis D, Grosse Y, Bianchini F, Straif K. Body fatness and cancer — viewpoint of the IARC Working Group. N Engl J Med. 2016;375:794–8.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Asghari G, Mirmiran P, Yuzbashian E, Azizi F. A systematic review of diet quality indices in relation to obesity. Br J Nutr. 2017;117:1055–65.

    Article  CAS  PubMed  Google Scholar 

  7. Wolongevicz DM, Zhu L, Pencina MJ, Kimokoti RW, Newby PK, D’Agostino RB, et al. Diet quality and obesity in women: the Framingham nutrition studies. Br J Nutr. 2010;103:1223–9.

    Article  CAS  PubMed  Google Scholar 

  8. Villarini A, Pasanisi P, Raimondi M, Gargano G, Bruno E, Morelli D, et al. Preventing weight gain during adjuvant chemotherapy for breast cancer: a dietary intervention study. Breast Cancer Res Treat. 2012;135:581–9.

    Article  CAS  PubMed  Google Scholar 

  9. Thomson CA, Stopeck AT, Bea JW, Cussler E, Nardi E, Frey G, et al. Changes in body weight and metabolic indexes in overweight breast cancer survivors enrolled in a randomized trial of low-fat vs. reduced carbohydrate diets. Nutr Cancer. 2010;62:1142–52.

    Article  CAS  PubMed  Google Scholar 

  10. Scott E, Daley AJ, Doll H, Woodroofe N, Coleman RE, Mutrie N, et al. Effects of an exercise and hypocaloric healthy eating program on biomarkers associated with long-term prognosis after early-stage breast cancer: a randomized controlled trial. Cancer Causes Control. 2013;24:181–91.

    Article  CAS  PubMed  Google Scholar 

  11. Kenfield SA, DuPre N, Richman EL, Stampfer MJ, Chan JM, Giovannucci EL. Mediterranean diet and prostate cancer risk and mortality in the health professionals follow-up study. Eur Urol. 2014;65:887–94.

    Article  PubMed  Google Scholar 

  12. Kohler LN, Garcia DO, Harris RB, Oren E, Roe DJ, Jacobs ET. Adherence to diet and physical activity cancer prevention guidelines and cancer outcomes: a systematic review. Cancer Epidemiol Biomark Prev. 2016;25:1018–28.

    Article  Google Scholar 

  13. Caspi CE, Sorensen G, Subramanian SV, Kawachi I. The local food environment and diet: a systematic review. Health Place. 2012;18:1172–87.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Economic Research Service U.S. Department of Agriculture. USDA ERS - food access research atlas. 2020. https://www.ers.usda.gov/data/fooddesert/. Accessed 23 Sep 2024.

  15. National Center for Chronic Disease Prevention and Health Promotion (U.S. Division of Nutrition, Physical Activity, and Obesity.). Census tract level state maps of the modified food environment index (mRFEI). 2012.

  16. U.S. Department of Agriculture (USDA). Retailer eligibility - clarification of criterion A and criterion B requirements. Food and Nutrition Service; 2020. https://www.fns.usda.gov/snap/retailer-eligibility-clarification-of-criterion. Accessed 17 Sep 2024.

  17. Fong AJ, Lafaro K, Ituarte PHG, Fong Y. Association of living in urban food deserts with mortality from breast and colorectal cancer. Ann Surg Oncol. 2021;28:1311–9.

    Article  PubMed  Google Scholar 

  18. Bevel MS, Tsai M-H, Parham A, Andrzejak SE, Jones S, Moore JX. Association of food deserts and food swamps with obesity-related cancer mortality in the US. JAMA Oncol. 2023;9:909–16.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Pruchno R, Wilson-Genderson M, Gupta AK. Neighborhood food environment and obesity in community-dwelling older adults: individual and neighborhood effects. Am J Public Health. 2014;104:924–9.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Li F, Harmer P, Cardinal BJ, Bosworth M, Johnson-Shelton D. Obesity and the built environment: does the density of neighborhood fast-food outlets matter? Am J Health Promot. 2009;23:203–9.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Xu Y, Wang F. Built environment and obesity by urbanicity in the U.S. Health and place. 2015;34:19.

    Article  PubMed  Google Scholar 

  22. Moore LV, Diez Roux AV, Nettleton JA, Jacobs DR. Associations of the local food environment with diet quality–a comparison of assessments based on surveys and geographic information systems: the multi-ethnic study of atherosclerosis. Am J Epidemiol. 2008;167:917–24.

    Article  PubMed  Google Scholar 

  23. Mazidi M, Speakman JR. Higher densities of fast-food and full-service restaurants are not associated with obesity prevalence. Am J Clin Nutr. 2017;106:603–13.

    Article  CAS  PubMed  Google Scholar 

  24. Paulitsch RG, Dumith SC. Is food environment associated with body mass index, overweight and obesity? A study with adults and elderly subjects from southern Brazil. Prev Med Rep. 2021;21: 101313.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Reitzel LR, Regan SD, Nguyen N, Cromley EK, Strong LL, Wetter DW, et al. Density and proximity of fast food restaurants and body mass index among African Americans. Am J Public Health. 2014;104:110–6.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Cummins S, Flint E, Matthews SA. New neighborhood grocery store increased awareness of food access but did not alter dietary habits or obesity. Health Aff (Millwood). 2014;33:283–91.

    Article  PubMed  Google Scholar 

  27. Dubowitz T, Ghosh-Dastidar M, Cohen DA, Beckman R, Steiner ED, Hunter GP, et al. Diet and perceptions change with supermarket introduction in a food desert, but not because of supermarket use. Health Aff (Millwood). 2015;34:1858–68.

    Article  PubMed  Google Scholar 

  28. Elbel B, Moran A, Dixon LB, Kiszko K, Cantor J, Abrams C, et al. Assessment of a government-subsidized supermarket in a high-need area on household food availability and children’s dietary intakes. Public Health Nutr. 2015;18:2881–90.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Holsten JE. Obesity and the community food environment: a systematic review. PHN. 2008;1. https://doiorg.publicaciones.saludcastillayleon.es/10.1017/s1368980008002267.

  30. Chen X. Take the edge off: a hybrid geographic food access measure. Appl Geogr. 2017;87:149–59.

    Article  Google Scholar 

  31. Burgoine T, Jones AP, Namenek Brouwer RJ, Benjamin Neelon SE. Associations between BMI and home, school and route environmental exposures estimated using GPS and GIS: do we see evidence of selective daily mobility bias in children? Int J Health Geogr. 2015;14: 8.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Chen X, Kwan M-P. Contextual uncertainties, human mobility, and perceived food environment: the uncertain geographic context problem in food access research. Am J Public Health. 2015;105:1734–7.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Jin A, Chen X, Huang X, Li Z, Caspi CE, Xu R. Selective daily mobility bias in the community food environment: case study of Greater Hartford, Connecticut. Nutrients. 2023;15:404.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Xu R, Huang X, Zhang K, Lyu W, Ghosh D, Li Z, et al. Integrating human activity into food environments can better predict cardiometabolic diseases in the United States. Nat Commun. 2023;14:7326.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Zenk SN, Schulz AJ, Matthews SA, Odoms-Young A, Wilbur J, Wegrzyn L, et al. Activity space environment and dietary and physical activity behaviors: a pilot study. Health Place. 2011;17:1150–61.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Chaix B, Méline J, Duncan S, Merrien C, Karusisi N, Perchoux C, et al. GPS tracking in neighborhood and health studies: a step forward for environmental exposure assessment, a step backward for causal inference? Health Place. 2013;21:46–51.

    Article  PubMed  Google Scholar 

  37. Gustafson A, Christian JW, Lewis S, Moore K, Jilcott S. Food venue choice, consumer food environment, but not food venue availability within daily travel patterns are associated with dietary intake among adults, Lexington Kentucky 2011. Nutr J. 2013;12: 17.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Kestens Y, Lebel A, Chaix B, Clary C, Daniel M, Pampalon R, et al. Association between activity space exposure to food establishments and individual risk of overweight. PLoS One. 2012;7: e41418.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Raskind IG, Kegler MC, Girard AW, Dunlop AL, Kramer MR. An activity space approach to understanding how food access is associated with dietary intake and BMI among urban, low-income African American women. Health Place. 2020;66: 102458.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Horn AL, Bell BM, Bulle Bueno BG, Bahrami M, Bozkaya B, Cui Y, et al. Population mobility data provides meaningful indicators of fast food intake and diet-related diseases in diverse populations. npj Digit Med. 2023;6:1–12.

    Article  Google Scholar 

  41. Centers for Disease Control and Prevention. CDC WONDER. https://wonder.cdc.gov/. Accessed 23 Sep 2024.

  42. Infogroup. Infogroup US historical business data.

  43. Williams DR, Mohammed SA. Discrimination and racial disparities in health: evidence and needed research. J Behav Med. 2009;32:20–47.

    Article  PubMed  Google Scholar 

  44. Ng M, Fleming T, Robinson M, Thomson B, Graetz N, Margono C, et al. Global, regional, and national prevalence of overweight and obesity in children and adults during 1980–2013: a systematic analysis for the global burden of disease study 2013. The Lancet. 2014;384:766–81.

    Article  Google Scholar 

  45. Diez Roux AV. Investigating neighborhood and area effects on health. Am J Public Health. 2001;91:1783–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Walker RE, Keane CR, Burke JG. Disparities and access to healthy food in the United States: a review of food deserts literature. Health Place. 2010;16:876–84.

    Article  PubMed  Google Scholar 

  47. Adler NE, Ostrove JM. Socioeconomic status and health: what we know and what we don’t. Ann N Y Acad Sci. 1999;896:3–15.

    Article  CAS  PubMed  Google Scholar 

  48. Agency for Toxic Substances and Disease Registry. CDC/ATSDR social vulnerability index (CDC/ATSDR SVI). 2024. https://www.atsdr.cdc.gov/placeandhealth/svi/index.html. Accessed 20 Sep 2024.

  49. Bakkensen LA, Fox-Lent C, Read LK, Linkov I. Validating resilience and vulnerability indices in the context of natural disasters. Risk Anal. 2017;37:982–1004.

    Article  PubMed  Google Scholar 

  50. Khazanchi R, Beiter ER, Gondi S, Beckman AL, Bilinski A, Ganguli I. County-level association of social vulnerability with COVID-19 cases and deaths in the USA. J Gen Intern Med. 2020;35:2784–7.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Karaye IM, Horney JA. The impact of social vulnerability on COVID-19 in the U.S.: an analysis of spatially varying relationships. American Journal of Preventive Medicine. 2020;59:317–25.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Kwan M-P. The uncertain geographic context problem. Ann Assoc Am Geogr. 2012;102:958–68.

    Article  Google Scholar 

  53. Turner C, Aggarwal A, Walls H, Herforth A, Drewnowski A, Coates J, et al. Concepts and critical perspectives for food environment research: a global framework with implications for action in low- and middle-income countries. Glob Food Sec. 2018;18:93–101.

    Article  Google Scholar 

  54. Brug J, Kremers SP, van Lenthe F, Ball K, Crawford D. Environmental determinants of healthy eating: in need of theory and evidence. Proc Nutr Soc. 2008;67:307–16.

    Article  PubMed  Google Scholar 

  55. Glanz K, Sallis JF, Saelens BE, Frank LD. Nutrition Environment Measures Survey in stores (NEMS-S): development and evaluation. Am J Prev Med. 2007;32:282–9.

    Article  PubMed  Google Scholar 

  56. Glanz K, Sallis JF, Saelens BE, Frank LD. Healthy nutrition environments: concepts and measures. Am J Health Promot. 2005;19:330–3, ii.

    Article  PubMed  Google Scholar 

  57. Mah CL, Luongo G, Hasdell R, Taylor NGA, Lo BK. A systematic review of the effect of retail food environment interventions on diet and health with a focus on the enabling role of public policies. Curr Nutr Rep. 2019;8:411–28.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Ruiz JM, Steffen P, Smith TB. Hispanic mortality paradox: a systematic review and meta-analysis of the longitudinal literature. Am J Public Health. 2013;103:e52-60.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Lommel LL, Thompson L, Chen J-L, Waters C, Carrico A. Acculturation, inflammation, and self-rated health in Mexican American immigrants. J Immigr Minor Health. 2019;21:1052–60.

    Article  PubMed  Google Scholar 

  60. Teruya SA, Bazargan-Hejazi S. The immigrant and Hispanic paradoxes: a systematic review of their predictions and effects. Hisp J Behav Sci. 2013;35:486–509.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Brewer JVV, Miyasato GS, Gates MA, Curto TM, Hall SA, McKinlay JB. Contributors to self-reported health in a racially and ethnically diverse population: focus on Hispanics. Ann Epidemiol. 2013;23:19–24.

    Article  PubMed  Google Scholar 

  62. Turrell G, Blakely T, Patterson C, Oldenburg B. A multilevel analysis of socioeconomic (small area) differences in household food purchasing behaviour. J Epidemiol Community Health. 2004;58:208–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. McCullough ML, Chantaprasopsuk S, Islami F, Rees-Punia E, Um CY, Wang Y, et al. Association of socioeconomic and geographic factors with diet quality in US adults. JAMA Netw Open. 2022;5: e2216406.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Li Z, Ning H, Jing F, Lessani MN. Understanding the bias of mobile location data across spatial scales and over time: a comprehensive analysis of SafeGraph data in the United States. PLoS One. 2024;19: e0294430.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The Infogroup US Historical Business Data used in this study were accessed by author Xukun Xiang during his time at the University of Chicago, and we gratefully acknowledge their role in providing access to this dataset. We also thank Dr. Xiao Huang and Dr. Zhenlong Li for providing access to the SafeGraph data. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agency or the data provider.

Funding

Open access funding provided by University of Gothenburg. This work was supported by the following grants: Alan R. Bennett public health policy research funding from the College of Liberal Arts and Sciences (CLAS), University of Connecticut; an internal funding for pilot studies addressing US health disparities from the Institute for Collaboration on Health, Intervention, and Policy (InCHIP), University of Connecticut; an internal start-up funding from School of Public Health and Community Medicine, Institute of Medicine, University of Gothenburg, Sweden.

Author information

Authors and Affiliations

Authors

Contributions

QL conceptualized the work, analyzed and interpreted the data, and drafted the manuscript. XC conceptualized the work and substantively revised the manuscript. XX acquired, cleaned, and interpreted the data. WL, CM, and GZ acquired, analyzed, and visualized the data. RX conceptualized the work, interpreted the data, drafted the manuscript, and substantively revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Qinyun Lin.

Ethics declarations

Ethics approval and consent to participate

All data used in this study were de-identified and publicly available.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

12916_2025_3967_MOESM1_ESM.docx

Additional file 1: Table S1. ICD-10 codes for identification of obesity-related cancers. Table S2. Covariates (county-level) definitions and data sources. Table S3. Multivariable linear regression models predicting obesity-related cancer mortality rates using standardized RFE measures (5-year average and annual mortality rates from 2015 to 2020). Table S4. Multivariable binary logistic regression models predicting obesity-related cancer mortality rates using standardized RFE measures (5-year average and annual mortality rates from 2015 to 2020). Table S5. Multivariable linear regression models predicting obesity-related cancer mortality rates using standardized RFE measures (5-year average and annual mortality rates from 2015 to 2020), with SVI as additional covariates. Table S6. Multivariable binary logistic regression models predicting obesity-related cancer mortality rates using standardized RFE measures (5-year average and annual mortality rates from 2015 to 2020), with SVI as additional covariates. Table S7. Spatial regression models (spatial lag and spatial error models) predicting obesity-related cancer mortality rates using standardized RFE measures (5-year average and annual mortality rates from 2015 to 2020). Table S8. Moderating effects of community context on the association between obesity-related cancer mortality rates (5-year average rates from 2015 to 2020) and standardized RFE measures using multivariable linear regression models. Table S9. Stratified results between obesity-related cancer mortality rates (5-year average rates from 2015 to 2020) and standardized food environment measures using multivariable linear regression models. Fig. S1. Generalized additive models predicting 5-year average obesity-related cancer mortality rates using each RFE measure.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lin, Q., Chen, X., Xiang, X. et al. Association of activity-based food environment index with obesity-related cancer mortality in the US. BMC Med 23, 167 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12916-025-03967-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12916-025-03967-6

Keywords