Reproducibility and validity testing of appetite ratings and energy intakes are needed in experimental and natural settings. Eighteen healthy young women ate a standardized breakfast for 8 days. Days 1 and 8, they rated their appetite (Hunger, Fullness, Desire to Eat, Prospective Food Consumption (PFC)) over a 3.5 h period using visual analogue scales, consumed an ad libitum lunch, left the research center and recorded food intake for the remainder of the day. Days 2–7, participants rated their at-home Hunger at 0 and 30 min post-breakfast and recorded food intake for the day. Total area under the curve (AUC) over the 180 min period before lunch, and energy intakes were calculated. Reproducibility of satiety measures between days was evaluated using coefficients of repeatability (CR), coefficients of variation (CV) and intra-class coefficients (ri). Correlation analysis was used to examine validity between satiety measures. AUCs for Hunger, Desire to Eat and PFC (ri = 0.73–0.78), ad libitum energy intakes (ri = 0.81) and total day energy intakes (ri = 0.48) were reproducible; fasted ratings were not. Average AUCs for Hunger, Desire to Eat and PFC, Desire to Eat at nadir and PFC at fasting, nadir and 180 min were correlated to total day energy intakes (r = 0.50–0.77, P < 0.05), but no ratings were correlated to lunch consumption. At-home Hunger ratings were weakly reproducible but not correlated to reported total energy intakes. Satiety ratings did not concur with next meal intake but PFC ratings may be useful predictors of intake. Overall, this study adds to the limited satiety research on women and challenges the accepted measures of satiety in an experimental setting.