* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Validation of a short food frequency questionnaire to assess
Survey
Document related concepts
Transcript
European Journal of Clinical Nutrition (1998) 52, 557±564 ß 1998 Stockton Press. All rights reserved 0954±3007/98 $12.00 http://www.stockton-press.co.uk/ejcn Validation of a short food frequency questionnaire to assess consumption of cereal foods, fruit and vegetables in Chinese Singaporeans AMC Ling, C Horwath and W Parnell Department of Human Nutrition, University of Otago, PO Box 56, Dunedin, New Zealand Objective: To assess the ability of a 16-item food frequency questionnaire (FFQ) to measure consumption of cereal foods, fruit and vegetables in Chinese Singaporeans. Design: Subjects completed the questionnaire twice, at the beginning and end of a six-week period during which they also provided three 24 h diet recalls. Estimates of intake from the questionnaire were compared with those from diet recalls. Subjects: Subjects were recruited from a range of occupational groups through random sampling across divisions in the headquarters of the Singapore Ministry of Health. Of the 81 subjects initially recruited, three failed to complete the diet recalls, one was excluded due to changes in diet, and seven did not return the second questionnaire. Results: Mean difference in food group consumption estimated by the two methods did not differ signi®cantly from zero for fruit (0.00 serving, s.d. 0.54, 95% CI 70.13, 0.12, P 0.95) or vegetables (70.05, s.d. 0.29, 95% CI 70.12, 0.02, P 0.13). For cereal foods, the mean difference was small, but signi®cantly different from zero only in women (70.32 servings, s.d. 0.92, 95% CI 70.59, 70.06, P 0.02). At an individual level, cereal food intake as measured by the FFQ may be 37% below or 59% above the diet recall values; and values for total fruit and vegetables may be half or double the recall values. Among subjects whose intake was classi®ed into the lowest quartiles by diet recalls, 78% and 94% respectively, fell into the lowest two questionnaire quartiles for cereal foods, and total fruit and vegetables. The ability of the questionnaire to predict those having inadequate intake based on recall data was more than 90% for the three food groups. Conclusion: The short questionnaire cannot replace the three-day recalls in intake assessment for individuals, but it could be used to screen for low consumers in intervention programmes, to assess mean food group intake in population groups, and to rank individuals into broad categories of food group intake. Sponsorship: International Life Sciences Institute, South East Asia. Descriptors: cereals; fruit; food frequency questionnaire Introduction The dietary guidelines of most countries emphasize the importance of increasing the consumption of fruit and vegetables and cereal foods. In Singapore, ®ve to six servings (for women) and six to seven servings (for men) of cereal foods, and two servings each of fruit and vegetables are encouraged (Ministry of Health, 1989). It is increasingly recognised that these foods provide not only nutrients, but also a combination of other dietary components that appear to be protective against chronic disease. This has led to recent interest in monitoring population intakes of foods and food groups, not just nutrients (Cleveland et al, 1997; Krebs-Smith et al, 1997). However, short instruments to assess adequacy of food group intake are sparse. Most food frequency questionnaires (FFQs) have been developed and validated as measures of nutrient intake (Jain et al, 1980; Willet et al, 1985; Block et al, 1986; Willett et al, 1987; Pietinen et al, 1988; Hankin et al, 1991; Block et al, 1992; Eck et al, 1996). Correspondence: Ms AMC Ling; Department of Nutrition, Institute of Health, 3 Second Hospital Avenue, Singapore 168937 (after August 1998). Received 30 June 1997; revised 12 April 1998; accepted 17 April 1998 Several studies (Jain et al, 1980; Mullen et al, 1984; Pietinen et al, 1988; Salvini et al, 1989; Hankin et al, 1991; Serdula et al, 1993) have evaluated FFQs at the foodbased level, but with the exception of one (Serdula et al, 1993), the FFQs were relatively long (55±278 items). Serdula et al (1993) evaluated a six-item fruit and vegetable questionnaire for telephone administration in the US Behavioural Risk Factor Surveillance System, and generally found the mean number of daily fruit and vegetable servings to be similar to those estimated by more detailed methods. The authors concluded that the short FFQ is likely to be useful for surveillance purposes. However, Serdula et al (1993) and others investigating validity at the food group level (Jain et al, 1980; Mullen et al, 1984; Pietinen et al, 1988; Hankin et al, 1991) did not evaluate the FFQs in terms of their ability to identify low and high consumers, nor did these studies adequately assess agreement between methods at an individual level. Correlation coef®cients in the range of 0.23±0.91 have been interpreted as indicating moderate agreement. However, Bland & Altman (1986, 1995a) have stressed that correlation coef®cients do not measure the extent to which methods produce the same results, but merely quantify the extent to which they are linearly related. Validation of a short food frequency questionnaire AM Chuan Ling et al 558 In this present study, we developed a short FFQ capable of self-administration as part of a larger study investigating consumption of cereal foods and total fruit and vegetables in Chinese Singaporeans. Both food groups and servings were de®ned as in the Daily Food Guide (Ministry of Health, 1995) to enable assessment of compliance with dietary recommendations. Therefore, foods such as cakes and pastries, french fries and pickled vegetables, or fried fruit fritters were excluded from the respective food groups. However, for consistency with the aims of our larger study, the FFQ assessed only fresh fruit and excluded dried fruits and juices. The suitability of the FFQ was evaluated not only for monitoring purposes, but also for screening for low intakes and ranking individuals into broad categories of intake. We compared estimated food group intake with that measured by three 24 h diet recalls. Between-method agreement was assessed by examination of group mean intakes, mean difference and standard deviation of the differences, classi®cation into quartiles of intake, ability to identify individuals not meeting recommended intakes, and Willett's actual values for surrogate categories (Willett, 1990). Methods Subjects Eighty-one Chinese subjects were recruited from the headquarters of the Ministry of Health through random sampling within divisional status. As there were more female than male employees, the latter were over-sampled. Subjects came from a wide range of occupational groups: from secretaries, clerks, typists, telephone operators, of®ce attendants and drivers to statisticians, ®nance and accounts executives, personnel and public affairs executives, and a few nurses and doctors. Study design Subjects ®lled in the FFQ twice, at the beginning and end of a six-week period. During this time, trained dietitians administered 24 h diet recalls on three occasions. The recalls included two week-days and one week-end day, and were spaced over two to three weeks. The ®rst FFQ was mailed to subjects together with a schedule for the three interviews. Subjects were asked to complete the FFQ and to bring it to the ®rst interview, during which the dietitians reviewed and clari®ed any ambiguity in responses. Interviews were conducted during of®ce hours. The second FFQ was mailed two weeks after completion of all diet recalls and consequently was not reviewed by the dietitians. The food frequency questionnaire (FFQ) Questions and response formats. The 16-item FFQ (available from AMC Ling) was designed to assess usual intake of cereal foods, fruit and vegetables over the previous three months. The FFQ was administered during the ®st two months of 1997. As a variety of fruit and vegetables is readily available throughout the year in Singapore, seasonal variation in intake is not likely to be signi®cant. The aim was to provide estimates in mean daily numbers of servings. The FFQ took an average of ®ve minutes to complete. Subjects were asked to indicate the number of times per day, per week or per month the following types of food were eaten: porridge; plain rice; rice-based dishes (for example chicken rice, fried rice); noodle dishes; breakfast cereals; stir-fried vegetables; meat or seafood dishes with added vegetables; soups with vegetables; vegetable salads; and local vegetable salads (for example rojak and gadogado). For fresh fruit, bread, bread-based items (for example roti prata and chapati) and plain salted biscuits, portion size estimates were requested as pieces or slices. We included black and white photographs of various portion sizes for plain rice and stir-fried vegetables. Five portion sizes were shown as fractions or multiples of amounts which make up one reference serving of rice or vegetables. Photographs were used for plain rice and stirfried vegetables because these are probably the single greatest contributors to the respective food groups, and amounts consumed tend to vary widely. The FFQ was pre-tested among a total of twenty Singapore adults for clarity, ease of completion and interpretation of questions and preferred combinations of photographed portion sizes. None of these subjects were included in the validation study. Special attention was paid to aggregation of foods into single questions. For example, the original question on rice and rice-based items was separated into three questions with porridge preceding more general items like rice and rice-based dishes. Feedback suggested that patterns of consumption differed for these three types of food. Coding. Servings were calculated using as a guide the reference portions in the Daily Food Guide (Ministry of Health, 1995). These were derived with consideration both for nutrient content and quantities of food reported to be used by respondents in a recent food consumption survey (Ministry of Health, 1994). One serving of cereal foods is de®ned as the amount likely to provide the carbohydrateequivalent of half a bowl (100 g) of rice. A serving of fruit refers to units commonly eaten (for example a small orange, a 130 g wedge papaya); and a serving of vegetables is one cup (150 g) of cooked vegetables. Legumes, rather than being included in the vegetable group, are classi®ed as alternatives to meat. For questions on frequency, reported use was converted to daily frequency and multiplied by a serving value. Rice, noodles and vegetables are often eaten as mixed dishes, but there is little information on the contribution of individual components to composite dishes. Therefore, serving values used for several questions on mixed dishes were derived based on our experience, or assessment of typical servings in popular eating outlets for foods commonly eaten away from home. For example, consumption of a meat dish with added vegetables was credited as providing a quarter of the reference vegetable serving; and a rice-based dish was counted as contributing two cereal food servings (namely two half bowls of rice). Amounts reported to be eaten were often fractions of a serving (for example three pieces of fruit per week is equivalent to three-sevenths of a serving per day). A daily average was computed by summing serving estimates across all questions. The reference method 24 h diet recalls. Although diet recording is generally considered the preferred method for validating a FFQ designed to estimate food or nutrient intakes (Willett, 1990), it is probably suboptimal within the context of the local dietary pattern. Families usually share communal dishes, and the practice of portioning out and weighing food would be likely to interfere with usual intake. Moreover, foods are usually eaten as mixed dishes. The need to separately weigh individual ingredients of interest over a period of several days would demand highly-motivated Validation of a short food frequency questionnaire AM Chuan Ling et al subjects. Therefore, non-consecutive 24 h recalls were chosen as the reference method. The FFQ was intended to determined consumption of broad food groups rather than individual foods or nutrients. However, data documenting the number of days required to assess food group intake is lacking. A three-day survey was chosen in order to avoid overburdening the subjects, and because three to seven days are normally adequate to assess macronutrients such as energy and total carbohydrate (Willett, 1990), and thus presumably food group intakes. All interviews were conducted by the ®rst investigator with the help of two dietitians who attended two half-day training sessions in order to ensure consistency of interview approach and coding method. As foods are often eaten as composite dishes, we asked subjects to estimate amounts of individual components eaten by pouring dried beans into standard plates, bowls or cups. Following the interview, the dietitians coded the number of cereal, fruit and vegetable servings on the interview forms, and these were checked by the ®rst investigator to ensure uniform coding decisions. Daily intake was estimated by averaging number of servings over the three days. Coding. Gram weights were converted into servings using reference portions in the Daily Food Guide (Ministry of Health, 1995). These were supplemented by serving values determined for foods not listed in the Guide. In estimating total food group intake, several investigators (Kant et al, 1991; Patterson et al, 1990) excluded foods consumed in amounts below a minimum cut-off (for example 30 g for fruit and vegetables, and 15 g for cereal foods) to avoid giving credit for consumption when amounts eaten were small. Rather than assigning fractions of a serving, Kant et al (1991) credited one serving to individuals who consumed amounts between the minimum cut-off and median serving size; and Patterson et al (1990) counted 30 g±75 g of vegetables and 30 g±120 g of fruit as contributing one serving. As the authors acknowledged, this latter procedure may lead to overestimation. Discounting small amounts on the other hand, would bias towards under-estimation especially considering the way vegetables and some cereals are consumed in the Singaporean diet. Spoonfuls of cereals are often added to milk drinks, and multiple small amounts of vegetables eaten as part of noodle or meat dishes can add up to signi®cant intakes. Therefore, servings were assigned to all recorded foods belonging to the three food groups, using fractions of servings to represent small amounts. Diet recalls thus, re¯ect comparable levels of measurements to FFQ. Statistical analysis The ability of the FFQ to estimate group means was investigated using paired t-tests. As recommended by Bland & Altman (1986), mean and standard deviation of the differences were calculated to assess agreement between methods at the individual level. For the analysis to be meaningful, it must be assumed that the mean difference and variability of the differences remain reasonably constant throughout the range of measurements (Bland & Altman, 1995b). This was checked graphically by plotting for each food group the difference between methods against their mean. As some plots showed a tendency for differences to increase with measurements, we calculated the 95% limits of agreement (de®ned as mean difference 1.96 s.d. of differences) for both log-transformed and actual values. To assess the ability of the FFQ to rank individuals into broad categories of intake, we divided serving estimates from both FFQs and diet recalls into quartiles to examine their cross-classi®cation. Cut-off points for quartiles were determined separately for the two methods. The percentage correctly classi®ed within one quartile and into extreme quartiles was calculated. To quantify intake de®ned by the FFQ quartiles, we calculated mean servings from the three diet recalls for subjects in each quartile. This gives the actual values for surrogate categories as described by Willett (1990). Differences between quartiles are a function of both the true variation in serving intake and the measurement error associated with the FFQ. For comparison, mean servings were also computed for each quartile of the diet recall distribution. Comparing the `actual' means for quartiles as de®ned by diet recalls with means from surrogate categories as de®ned by FFQ permits a simultaneous evaluation of misclassi®cation and error in estimation of intake. As an instrument may perform with different accuracy in those meeting or not meeting the serving recommendations, we evaluated the FFQ for its sensitivity, speci®city and predictive values. Speci®city was de®ned as the proportion of subjects having less than the recommended number of servings by diet recalls who also fell short on the FFQ. Sensitivity was the proportion of those having adequate servings by diet recalls, and who also reported similar intake on the FFQ. Negative predictive value was the proportion of those falling short of recommended servings on the FFQ who were `truly negatives' on the diet recalls. Positive predictive value was the proportion of those meeting serving targets on the FFQ and whose diet recalls showed likewise. Actual values for surrogate categories and sensitivity, speci®city and predictive values were presented for the second rather than ®rst FFQ based on two considerations. As the FFQ asked about food intake during the previous three months, the repeat FFQ referred to a period encompassing the diet recalls. Furthermore, the repeat FFQs were returned by mail as is planned in the main study, unlike the ®rst FFQ which was personally collected from subjects and brie¯y reviewed with them. Since several distributions were skewed towards higher serving values, we looked at both Spearman rank and Pearson (after log transformation of values) correlations. For fruit, some subjects reported no use, so 1.0 was added to all values before transformation. As variation in intake among men and women of varying age arti®cially increases observed correlations (Willett et al, 1987), serving estimates were adjusted for sex and age. FFQ repeatability was assessed by determining the mean differences and standard deviation of the differences between repeat administrations. All analyses were performed using the SPSS statistical programme. Results Seventy-seven subjects completed the ®rst FFQ and three diet recalls, and 70 (21 men and 49 women) returned the second FFQ. One subject failed to complete the diet recalls because of extended sick leave, and two because of an inability to participate. Recalls from one subject were excluded due to changes in diet. Those who completed the study were approximately equally divided between 24± 45 and 46±59 y age groups (mean age 42.6, s.d. 10.2), and came from diverse socio-economic backgrounds: 38% professional, 26% administrative support, 27% clerical 559 Validation of a short food frequency questionnaire AM Chuan Ling et al 560 and 9% blue collar employees. There were no signi®cant differences in demographic characteristics between those who completed the study (n 70) and those who failed to do so (n 11). Repeatability At the group level, no signi®cant mean difference was found between repeat FFQ measures for cereal foods, fruit or vegetables. When fruit and vegetables were combined, we found the difference to be statistically signi®cant from zero (P 0.02), but small (mean difference 70.14 serving). Between repeat administrations, individual FFQ measurements for total fruit and vegetables may be lower by 1.12 servings or higher by 0.84 servings. For cereal foods, the differences between repeated measurements may vary from being 1.34 servings lower to 1.18 servings higher. Validity Mean and median servings from the diet recalls and two FFQ administrations are presented in Table 1. No signi®cant differences were found in mean intakes between the methods for fruit and vegetables, separately and combined. For cereal foods, differences were signi®cant only in women (P 0.01 for FFQ1; P 0.02 or FFQ2). Differences were however, small with the FFQ overestimating cereal intake by an average of 0.28 servings. For the vegetable servings from diet recalls and FFQ, Figure 1 shows that the scatter of the differences between measurements increases with serving valuesÐan indication that the methods did not agree equally across the entire range of measurements. We would expect the limits of agreement to be wider apart than necessary for small serving values and narrower than they should be for larger values. The pattern of variability is improved when the difference between log transformed values was plotted against their mean. Figure 2 shows reduced tendency for systematic variation. Similar agreement was observed between diet recalls and the ®rst or second FFQ (Table 2). Limits obtained using log-transformed values allow assessment of agreement between methods in terms of a ratio; and limits computed from the actual data give an idea of the magnitude of differences between methods. As a reference point, central points have also been included for all intervals. Thus, for cereal foods, FFQ measurements may be about 37% below to 59% above recall values, and 95% of total fruit and vegetable servings estimated by the FFQ would be expected to lie between half or double the diet recall values. Looking at the untransformed values, the FFQ may over-estimate or under-estimate cereal intake by about 1.5 servings, and total fruit and vegetable intake by about 1.25 servings, when compared with estimates from diet recalls. Intervals were similarly wide for fruit and vegetables analysed separately. These levels of between-method differences would mean that the FFQ is unacceptable for intake assessment of individuals. The proportion of subjects in diet recall quartiles correctly classi®ed by the FFQ into the same quartile and into Table 1 Daily number of food group servings based on the average of three 24h diet recalls, and on the food frequency questionnaires administered at the beginning (FFQ1) and end of study (FFQ2) 24 h diet recalls (n 77) FFQ1 (n 77) FFQ2 (n 70) Food group Mean Median Standard deviation Mean Median Standard deviation Mean Median Standard deviation Cereal foods Men Women Fruit Vegetables Fruit and vegetables 4.61 3.34 1.52 0.91 2.43 4.61 3.48 1.33 0.85 2.03 1.37 0.95 1.15 0.45 1.34 4.63 3.58 1.45 0.92 2.37 4.44 3.63 1.00 0.85 1.92 1.08 0.99 1.22 0.47 1.35 4.74 3.62 1.51 0.93 2.43 4.49 3.46 1.00 0.88 2.09 1.45 1.15 1.22 0.45 1.36 Figure 1 Plot of difference between vegetable servings estimated by three 24h diet recalls and the second-administered FFQ against mean servings by the two methods, showing 95% limits of agreement and central point. Validation of a short food frequency questionnaire AM Chuan Ling et al 561 Figure 2 Plot of difference between log-transformed data on vegetable servings from three 24 h diet recalls and the second-administered FFQ against the mean of the log-transformed data by the two methods, showing 95% limits of agreement and central point. Table 2 95% limits of agreement (& central pointa) between the number of food group servings as assessed by three 24 h diet recalls and the ®rst (FFQ1) and second (FFQ2) administrations of food frequency questionnaire Log-transformed valuesb Food group Cereal foods Fruit Vegetables Fruit and vegetables 24h diet recalls vs FFQ1 0.63±1.45 0.64±1.66 0.41±2.38 0.51±1.99 a (0.95 ) (1.03a) (0.99a) (1.01a) Actual valuesc 24h diet recalls vs FFQ2 0.57±1.55 0.62±1.61 0.40±2.25 0.48±1.93 a (0.94 ) (1.00a) (0.95a) (0.96a) 24h diet recalls vs FFQ1 71.59±1.24 71.12±0.98 70.67±0.66 71.16±1.27 a (70.18 ) (0.07a) (70.01a) (0.06a) 24h diet recalls vs FFQ2 72.09±1.60 71.08±1.07 70.84±0.70 71.33±1.19 (70.24a) (0a) (70.07a) (70.07a) a For the log-transformed values, the central point should give a ratio close to 1. For the non-transformed data, the central point should give a mean difference close to 0. b 95% limits of agreement were based on subtraction of the log-transformed FFQ values from those of diet recalls. 95% of diet recall to FFQ ratios would lie between the limits (mean 1.96). c 95% of actual differences between methods would lie between these limits (mean 1.96). Table 3 Cross-classi®cation by quartiles of food group serving distributions derived from three 24h diet recalls and the ®rst (FFQ1) and second (FFQ2) administrations of the food frequency questionnaire Lowest quartiles on diet recalls Food group Cereal foods Fruit Vegetables Fruit and vegetables a Quartiles on diet recalls Lowest two quartiles on FFQ1 (%) Lowest two quartiles on FFQ2 (%) Same quartiles on FFQ1 (%) Within-one quartiles on FFQ1 (%) Extremea quartiles on FFQ1 (%) Same quartiles on FFQ2 (%) Within-one quartiles on FFQ2 (%) Extreme quartiles on FFQ2 (%) 78.9 100 84.2 100 76.5 100 88.9 88.9 46.8 62.3 36.4 54.5 90.9 98.7 83.1 96.1 2.6 1.3 0 0 47.1 64.3 35.7 62.8 85.7 97.1 81.4 94.3 1.4 0 1.4 0 Quartile 1 on diet recalls versus quartile 4 on FFQ or vice versa. within-one quartiles ranged respectively, from an average of 36% and 82% for vegetables to 63% and 98% for fruit (Table 3). Due to chance alone, 25% or 62.5% of subjects would be expected to be correctly classi®ed into the same or within-one quartiles. Of those subjects belonging to the lowest diet recall quartile, an average of 78% fell into the lowest two quartiles for cereal foods, and an average of 94% were in the lowest two quartiles for total fruit and vegetables, when classi®ed by the FFQ. Gross misclassi®- cation was rare (0±2.6%). Better results were observed at the high end of the distributions (data not shown). For all food groups, mean servings as measured by diet recalls increased across the FFQ quartiles (Table 4). Actual values for surrogate categories were presented only for the second FFQ. Stepwise increase across quartiles was slightly better, especially for vegetables with the ®rst FFQ. As a result of misclassi®cation and regression towards the mean (Willett et al, 1985), the ratio of mean servings for highest Validation of a short food frequency questionnaire AM Chuan Ling et al 562 Table 4 Mean daily number of food group servings derived from three 24h diet recalls for quartiles de®ned by diet recalls, and for quartiles derived by the second food frequency questionnaire administration (n 70) Mean 24h diet recall values for FFQ2 and diet recall quartiles Quartiles 1 FFQ2 24 h diet 2 FFQ2 24 h diet 3 FFQ2 24 h diet 4 FFQ2 24 h diet recall recall recall recall Cereal foods Fruit Vegetables Fruit and vegetables 2.65 0.77 2.31 0.46 3.45 0.69 3.30 0.28 3.92 0.86 3.97 0.18 4.86 1.44 5.31 1.02 0.64 0.63 0.39 0.27 1.10 0.53 1.10 0.18 1.69 0.43 1.73 0.20 3.98 1.17 3.18 1.15 0.63 0.30 0.45 0.14 0.81 0.31 0.72 0.08 0.96 0.34 1.01 0.10 1.07 0.32 1.51 0.50 1.29 0.61 1.10 0.37 1.78 0.43 1.81 0.13 2.34 0.58 2.53 0.31 4.06 1.39 4.31 1.13 Table 5 Sensitivity, speci®city and predictive values of the second-administered food frequency questionnaire for assessing adequacy of intake of food group servings Food group No. of subjects Cereal foods 21 men 49 women 70 70 70 Fruit Vegetables Fruit and vegetables Recommended no. of servings % meeting recommendation by diet recalls % meeting recommendation by FFQ Sensitivity (%) Speci®city (%) Negative predictive value (%)a 6 5 2 2 4 19.0 2.0 30.0 1.4 10.0 14.3 14.3 38.6 5.7 14.3 75 100 86 100 100 100 88 82 96 95 94 100 93 100 100 a For cereal foods, negative predictive value for men 17/18 or 94%; negative predictive value for women 42/42 or 100%. For fruit and vegetables, negative predictive value 60/60 or 100%; positive predictive value 7/10 or 70%. Too few individuals met the recommended number of servings to allow computation of other positive predictive values. to lowest quartiles was lower for FFQ. Comparison between FFQ and diet recall across quartiles suggests that the FFQ may have dif®culty in distinguishing between middle levels of intake for cereal foods, and for vegetables. Sensitivity, speci®city and negative predictive values are presented for the second FFQ (Table 5). Similar values were observed for the ®rst FFQ. Too few subjects met the recommended intakes of cereal foods to allow meaningful computation of the positive predictive value. Of ten subjects predicted by the FFQ as having adequate servings of total fruit and vegetables, three were falsely positive (data not shown). The FFQ was suf®ciently sensitive for example, to identify all seven subjects who had four or more servings of fruit and vegetables by diet recalls. Speci®city and negative predictive values were high. For example, 42 out of 48 women who had less than ®ve servings of cereal foods by diet recalls also reported this level of intake in the FFQ; and all women who had inadequate intakes on the FFQ were `truly negatives' by diet recalls. Very similar values were observed for Spearman rank and Pearson correlations. The unadjusted Pearson correlation coef®cients ranged from a mean of 0.63 for vegetables to 0.79 for cereal foods and 0.84 for fruit with only slight reduction following adjustment (data not shown). When fruit and vegetables were combined, correlation coef®cient averaged 0.80. Correlations were generally higher between intakes measured by diet recalls and the ®rst FFQ. Discussion Although a high response rate was achieved from random sampling across the divisions in the headquarters of the Ministry of Health, with more than a third of the sample comprising of clerical and blue collar workers, it must be acknowledged that the sampling frame for this small validation study did not encompass the entire Singapore population. As such, it is likely that the study population is of somewhat higher educational level and socio-economic status than the general population. Nevertheless, it must be noted that none of the subjects were nutritionists or dietitians and none had any prior experience of dietary interviews and were unaware that they would be asked to recall speci®c day's intake. Acceptable repeatability was established. For all food groups, there were no signi®cant differences between repeated measurements except for a signi®cant but small difference between measurements when fruit and vegetables were combined. Most previous studies have assessed repeatability for reported frequencies of individual foods at intervals of between three months and two years, mainly using correlation coef®cients (Nomura et al, 1976; Colditz et al, 1987; Byers et al, 1987; Salvini et al, 1989). For reasons explained earlier, correlation coef®cients for repeated FFQ administrations are presented here only for purposes of comparisonÐPearson's correlations were 0.85 for cereal foods and 0.90 for total fruit and vegetables. These were substantially higher than those observed in other studies for individual cereal (0.38±0.71) and fruit and vegetable items (0.19±0.73). To our knowledge, this is the only published report examining the repeatability of questions on aggregated food groups. The shorter time interval in our study could have overestimated repeatability as subjects may remember earlier responses. However, long intervals mean that true dietary changes are more likely, thus reducing apparent repeatability. Despite the relatively high correlation coef®cients and little variation in mean differences between repeated measures, it is of interest to note individual FFQ measurements from one administration to another may vary by about one serving both for cereal foods ( 1.18, 7 1.34) and for total fruit and vegetables ( 0.84, 7 1.12). Mean servings measured by the FFQ and the diet recalls were remarkably similar. More detailed FFQs Validation of a short food frequency questionnaire AM Chuan Ling et al have generally been found to over-estimate mean intakes for both nutrients and foods (Jain et al, 1980; Mullen et al, 1984; Pietinen et al, 1988; Salvini et al, 1989; Hankin et al, 1991). Krebs-Smith et al (1995) found that estimates of total fruit and vegetables were positively associated with the number of food items included in food frequency questionnaires. They suggested that summing across individual foods may not be a valid way to assess intake of total fruit and vegetables, and that summary food group questions and/or adjustment factors may increase precision of estimates. Therefore, this short FFQ may have an advantage over longer questionnaires for example, when used for dietary surveillance. It is of interest to note that, consistent with other reports (Cleveland et al, 1997), mean intakes for all food groups in this study were below recommendations. Clearly, the FFQ cannot be used interchangeably with three 24 h diet recalls if estimates of individual intake are required. For the FFQ to do this, 95% of differences would have to lie within limits that would not be of practical importance. Tables 3 and 4 however, show that exact agreement between measurements by the two methods is not critical to the ability of the FFQ to rank individuals, an observation similar to that by Wilson & Horwath (1996) using a calcium FFQ. Jain et al (1980) also found that the higher mean intakes given by their diet history questionnaire had little effect on its ability to classify individuals due to the constancy of overestimation. The observation that a FFQ can produce good group means but have unsatisfactory validity at the individual level (or vice versa) points to the usefulness of examining validity at varying levels of precision at both group and individual levels using different approaches. Mean intake of cereal foods from the FFQ was signi®cantly higher than that from the diet recalls for women, but not men. This may suggest that men performed better in estimating portion sizes, which is contrary to what has been observed by other investigators (Yuhas et al, 1989). However, the direction of bias is not clear. Whereas detailed FFQs have produced higher intake estimates, there is some evidence of underreporting with 24 h recalls (Gibson, 1990). To further explore this question, we compared in ®ve men and seven women, weighed portions of rice with estimates using dried beans. Ten subjects (six women) underestimated portions by the bean method. Underestimation ranged from 710% to 750% and was on average, greater in women. Women tended to estimate amounts by spreading the beans thinly. Therefore, the use of beans to aid in serving size estimation may have contributed to underestimation of cereal foods by the 24 h recall method in women. Young & Marion (1995) in their recent review, noted that dif®culties with portion estimations may be related to perceptual problems that have `barely been explored'. As Block (1982) pointed out, in the absence of a `gold standard', we have to accept the approach of relative validity. There was little difference in all measures of agreement for the two FFQs, except perhaps for the slightly better correlations observed with the ®rst FFQ. Other investigators (Willett et al, 1985; Pietinen et al, 1988; Salvini et al, 1989; Eck et al, 1996) have found better agreement with the second rather than ®rst FFQ administration. In the present study, differences between correlations for the ®rst and second FFQ were small, suggesting any bias was probably not great. Precision in terms of absolute intake at the individual level is often not necessary for epidemiologic studies. It may be suf®cient for an instrument to discriminate among groups of individuals (Block, 1982). The mean intakes of individuals in the highest FFQ quartiles for cereal foods and fruit and vegetables are respectively, 83% and 215% higher than in the lowest quartiles. The corresponding percentages are 130% and 292% for quartiles de®ned by diet recalls. Although the between-quartile contrasts are more modest for the FFQ, the variation in intake from the lowest to highest quartiles suggests that the FFQ could readily distinguish between the high and low consumers. Such categorisation of food group intake is valuable in population studies investigating dietdisease relationships. At the public health level, the high negative predictive values of the FFQ would mean that almost all individuals identi®ed in screening programmes as requiring an increased intake of cereal foods and fruit and vegetables would be given appropriate advice. As the proportion of individuals not meeting the guidelines for fruit and vegetables is relatively high, the accuracy of the FFQ in identifying `negatives' is desirable. The ability to identify those consuming less than the recommended amount of fruit and vegetables is consistent with the work of Cox et al (1997), in which responses to a self-monitoring `portion' measurement of fruit and vegetable consumption were compared with weighed intakes. If the FFQ is used to guide intervention efforts, the low error in speci®city (0±12%) would mean that relatively small proportion of individuals would lose the possibility of receiving a required intervention. The sensitivity error is comparably low (less than 5% for all food groups). However, receiving an unneeded intervention is probably of secondary importance to not getting a required intervention. Being relatively short, the FFQ can be easily included as part of overall health surveys. In Australia and the United States, similar short questions have been used as indicators to monitor fruit and vegetable intake (Coles-Rutishauser, 1996; Serdula et al, 1995). Rather than intended as a measure of agreement, Pearson correlation coef®cients were included here simply for comparison with previous studies. Such comparisons are however, limited by differences in the composition of the study populations and reference methods. Generally, correlations fell in the range of 0.08±0.70 for individual fruit and vegetable items (Salvini et al, 1989). At the food group level, correlations ranged from 0.60±0.77 for cereals, and 0.29±0.71 for fruit and vegetables (Jain et al, 1980; Pietinen et al, 1988; Hankin et al, 1991; Serdula et al, 1993). Comparable or higher correlations were observed in the present study. Despite the much shorter FFQ, the validity of food group intake observed in the present study was comparable to or better than that reported for longer FFQs listing individual foods (Jain et al, 1980; Mullen et al, 1984; Pietinen et al, 1988; Hankin et al, 1991). Perhaps, subjects are better able to describe frequency of use for more generalised categories than for speci®c foods (Horwath, 1990). It is possible that the validity of the FFQ in the present study may be over-estimated as a result of reporting errors shared with diet recalls. Under-estimated validity is however, just as likely. Salvini et al (1989) found the week-to-week within- to between-person variability in mean intake to be greater than one for most foods. Although we would expect less within-person variation at the food-group level, it is plausible that three 24 h recalls may have been insuf®cient to characterize the usual diet. 563 Validation of a short food frequency questionnaire AM Chuan Ling et al 564 Conclusions This short FFQ produced repeatable and valid estimates of mean food group intake in a population of Chinese Singaporeans. The FFQ was able to classify individuals into broad categories of intake, and had high predictive value in identifying those not consuming the recommended number of food group servings. Clearly, the FFQ cannot replace the three-day diet recalls in estimating intake for individuals. The relevance of this FFQ to other populations would need to be established by speci®c validation studies. AcknowledgementsÐGrateful thanks to the Ministry of Health, Singapore for their support and assistance in co-ordination of the study; and Pauline Wilson, Sheila Williams and Caryn Thompson for their assistance in data analysis. References Bland JM & Altman DG (1986): Statistical methods for assessing the agreement between two methods of clinical measurement. Lancet 1, 307±310. Bland JM & Altman DG (1995): Comparing two methods of clinical measurement: A personal history. Int. J. Epidemiol. 24, Suppl. 1, S7±S14. Bland JM & Altman DG (1995): Comparing methods of measurement: why plotting difference against standard method is misleading. Lancet 346, 1085±1087. Block G (1982): A review of validations of dietary assessment methods. Am. J. Epidemiol. 115, 492±505. Block G, Hartman AM, Dresser CM, Carroll MD, Gannon J & Gardner L (1986): A data-based approach to diet questionnaire design and testing. Am. J. Epidemiol. 124, 453±469. Block G, Thompson FE, Hartman AM, Larkin FA & Guire KE (1992): Comparison of two dietary questionnaires validated against multiple dietary records collected during a 1-year period. J. Am. Diet. Assoc. 92, 686±693. British Standards Institution (1979): Precision of Test Methods I. Guide for the Determination and Reproducibility for a Standard Test Method. (BS 5497, part 1). London: BSI. Byers T, Marshall J, Anthony E, Fiedler R & Zielezny M (1987): The reliability of dietary history from the distant past. Am. J. Epidemiol. 125, 999±1011. Cleveland LE, Cook DA, Krebs-Smith SM & Friday J (1997): Method for assessing food intakes in terms of servings based on food guidance. Am. J. Clin. Nutr. 65, Suppl, 1254S±1263S. Colditz GA, Willett WC, Stampfer MJ, Sampson L, Rosner B, Hennekens CH & Speizer FE (1987): The in¯uence of age, relative weight, smoking and alcohol intake on the reproducibility of a dietary questionnaire. Int. J. Epidemiol. 16, 392±398. Coles-Rutishauser IHE (1996): A Guide to Instruments for Monitoring Food Intake, Food Habits and Dietary Change. Canberra: Australian Institute of Health and Welfare (Food and Nutrition Monitoring Unit Working paper No. 96.2). Cox DN, Anderson AS, Reynolds J, McKellar S, Mela DJ & Lean MEJ (1997): Measuring fruit and vegetable intake: is ®ve-a-day enough? Eur. J. Clin. Nutr. 51, 177±180. Eck L, Klesges LM & Klesges RC (1996): Precision and estimated accuracy of two short-term food frequency questionnaires compared with recalls and records. J. Clin. Epidemiol 49, 1195±1200. Gibson R (1990): Validity in dietary assessment: a review. J. Can. Diet. Assoc. 51, 275±280. Hankin JH, Wilkens LR, Kolonel LN & Yoshizawa CN (1991): Validation of a quantitative diet history method in Hawaii. Am. J. Epidemiol. 133, 616±628. Horwath CC (1990): Food frequency questionnaires: a review. Aust. J. Diet. Assoc. 47, 71±76. Jain M, Howe GR, Johnson KC & Miller AB (1980): Evaluation of a diet history questionnaire for epidemiologic studies. Am. J. Epidemiol. 111, 212±219. Kant AK, Block G, Schatzkin A, Ziegler RG & Nestle M (1991): Dietary diversity in the US population, NHANES II, 1976±1980. J. Am. Diet. Assoc. 91, 1526±1531. Krebs-Smith SM, Heimendinger J, Subar AF, Patterson BH & Pivonka E (1995): Using food frequency questionnaires to estimate fruit and vegetable intake: association between the number of questions and total intakes. J. Nutr. Edu. 27, 80±85. Krebs-Smith SM, Cleveland LE, Ballard-Barbash R, Cook DA & Kahle LL (1997): Characterising food intake patterns of American adults. Am. J. Clin. Nutr. 65, Suppl, 1264S±1268S. Ministry of Health, Singapore, Training and Health Education Department (1989): Guidelines for a Healthy DietÐRecommendations of the National Advisory Committee on Food and Nutrition. Ministry of Health, Singapore, Food and Nutrition Department (1994): Food Consumption Study 1993. Ministry of Health, Singapore, Food and Nutrition Department (1995): A Balanced DietÐThe Pyramid Way. Mullen BJ, Krantzler NJ, Grivetti LE, Schutz HG & Meiselman HL (1984): Validity of a food frequency questionnaire for the determination of individual food intake. Am. J. Clin. Nutr. 39, 136±143. Nomura A, Hankin JH & Rhoads GG (1976): The reproducibility of dietary intake data in a prospective study of gastrointestinal cancer. Am. J. Clin. Nutr. 29, 1432±1436. Patterson BH, Block G, Rosenberger WF, Pee D & Kahle LL (1990): Fruit and vegetables in the American diet: data from the NHANES II survey. Am. J. Public Health. 80, 1443±1449. Pietinen P, Hartman AM, Haapa E, Rasanen L, Haapakoski J, Palmgren J, Albanes D, Virtamo J & Huttunen JK (1988): Reproducibility and validity of dietary assessment instruments: a self-administered food-use questionnaire with a portion size picture booklet. Am. J. Epidemiol. 128, 655±666. Salvini S, Hunter DJ, Sampson L, Stampfer MJ, Colditz GA, Rosner B & Willett WC (1989): Food-based validation of a dietary questionnaire: the effects of week-to-week variation in food consumption. Int. J. Epidemiol. 18, 858±867. Serdula M, Coates R, Byers T, Mokdad A, Jewell S, Chavez N, MaresPerlman J, Newcomb P, Ritenbaugh C, Treiber F & Block G (1993): Evaluation of a brief telephone questionnaire to estimate fruit and vegetable consumption in diverse study populations. Epidemiol. 4, 455±463. Serdula M, Coates R, Byers T, Simoes E, Mokdad A & Subar A (1995): Fruit and vegetable intake among adults in 16 states: results of a brief telephone survey. Am. J. Public Health 85, 236±239. Willett WC, Sampson L, Stampfer MJ, Rosner B, Bain C, Witschi J, Hennekens CH & Speizer FE (1985): Reproducibility and validity of a semiquantitative food frequency questionnaire. Am. J. Epidemiol. 122, 51±65. Willett WC, Reynolds RD, Cottrell-Hoehner S, Sampson L & Browne ML (1987): Validation of a semiquantitative food frequency questionnaire: comparison with a 1 year diet record. J. Am. Diet. Assoc. 87, 43±47. Willett WC (1990): Nutritional Epidemiology. New York: Oxford University Press. Wilson P & Horwath C (1996): Validation of a short food frequency questionnaire for assessment of dietary calcium in women. Eur. J. Clin. Nutr. 50, 220±228. Young LR & Marion M (1995): Portion sizes in dietary assessment: issues and policy implications. Nutr. Rev. 53, 149±158. Yuhas JA, Bolland JE & Bolland TW (1989): The impact of training, food type, gender, and container size on the estimation of food portion sizes. J. Am. Diet. Assoc. 89, 1473±1477.