Advances in Prospect Theory: Cumulative Representation of Uncertainty

<aside> 💡

Understand the CPT parameters and risk modeling that define our policy priors

</aside>

Risk-seeking is a behavioral trait where an individual accepts greater economic uncertainty to achieve potentially higher returns, or values the thrill of uncertainty itself more than the safety of guaranteed outc

The findings from Tversky and Kahneman's Cumulative Prospect Theory have profound implications for our research on enhancing human teleoperation of robots under communication latency conditions. The paper's core insight—that humans systematically deviate from rational decision-making in predictable ways when facing uncertainty—directly applies to teleoperation scenarios where operators must make control decisions without immediate feedback due to communication delays. When a human operator is controlling a Mars rover with several minutes of communication lag, they're essentially operating under extreme uncertainty about the current state of the robot and environment. CPT's mathematical framework reveals that operators will exhibit loss aversion (fearing potential damage to expensive equipment more than they value successful task completion), probability weighting biases (overestimating the likelihood of rare catastrophic events while underestimating more probable minor issues), and reference point dependency (making decisions based on perceived changes from expected states rather than absolute outcomes). This means our AI prediction system cannot assume operators will make mathematically optimal decisions—instead, it must account for these systematic cognitive biases to accurately predict human behavior and provide appropriate assistance.

The fourfold pattern of risk attitudes discovered in the paper is particularly relevant to robotics teleoperation scenarios. According to CPT, humans exhibit risk aversion for high-probability gains but risk-seeking behavior for low-probability gains, while showing risk-seeking behavior for high-probability losses and risk aversion for low-probability losses. In the context of rover operations, this means an operator facing a high-probability successful navigation task will be conservative (risk averse), potentially choosing suboptimal but "safe" paths. However, when facing a low-probability opportunity for significant scientific discovery, they may take excessive risks (risk seeking). Conversely, when confronted with likely equipment damage, operators may engage in desperate risk-taking behavior to avoid certain losses, while being overly cautious about small-probability failure modes. Our AI system needs to recognize these patterns and potentially provide different types of predictive support depending on how the operator frames the situation—whether they perceive it as an opportunity for gains or a threat of losses. The AI could help reframe situations to promote more rational decision-making or adjust its predictions based on the expected bias direction.

The technical innovation of cumulative probability weighting in CPT offers a sophisticated framework for modeling how human operators will actually assess and respond to uncertain situations during teleoperation tasks. Unlike traditional expected utility models that our AI might initially use to predict human behavior, CPT's weighting functions reveal that operators will systematically overweight small probabilities (leading to excessive concern about rare failure modes) and underweight moderate probabilities (potentially causing complacency about more likely issues). This has direct implications for designing our predictive algorithms—rather than simply calculating expected values of different control actions, our system should incorporate CPT's probability weighting functions to better predict which options the human operator is likely to choose. For instance, if our AI calculates that there's a 15% chance of rover wheel damage from a particular maneuver, the human operator is likely to perceive this risk as much higher due to probability overweighting, and our prediction model should account for this bias. Furthermore, the separate weighting functions for gains and losses (w⁺ and w⁻) suggest that our AI should differentiate between situations where operators are focused on achieving positive outcomes versus avoiding negative ones, as their decision patterns will be fundamentally different in these two contexts.

The mathematical precision of CPT's value function and its demonstration of diminishing sensitivity provides actionable insights for calibrating our human-robot interaction systems. The finding that humans exhibit diminishing sensitivity both for outcomes (the difference between $0 and $100 feels larger than between $1000 and $1100) and probabilities (the difference between 0% and 10% feels larger than between 40% and 50%) suggests that our AI should weight recent changes more heavily than absolute states when predicting operator responses. In teleoperation contexts, this means operators will be more sensitive to sudden changes in robot status or environmental conditions than to the absolute values of these parameters. our predictive system could leverage this by focusing on delta values and rate-of-change information rather than just current states. Additionally, the loss aversion coefficient (λ = 2.25 in the experimental data) provides a quantitative basis for understanding how much more operators will care about preventing damage compared to achieving mission success. This ratio could be directly incorporated into our AI's utility calculations when predicting operator preferences and decisions, allowing for more accurate modeling of human behavior under the high-stakes, high-uncertainty conditions typical of robotic teleoperation with communication delays