Y.B. Eisma
Please Note
34 records found
1
Now or never
Eye tracking and response times reveal the dynamics of highway merging decisions
Merging onto a highway is a safety-critical task resulting in a large number of traffic accidents; fundamental research into merging behavior of human drivers can help reduce this toll. Two cognitive processes critical to merging, attention allocation and decision making, have been extensively studied in real-world and simulated driving scenarios. However, how these processes interact during highway merging remains poorly understood. While the relationship between attention and decision making has been widely examined in cognitive science, this work has largely relied on simple decision-making paradigms involving choices between static items on a computer screen, which limits the understanding of more dynamic and naturalistic decisions such as in driving. To address this gap, we investigated the relationship between attention and decision making in a simplified highway merging task. In a video-based experiment, participants (N=24) repeatedly made merging gap acceptance decisions based on the dynamic information about the distance and time-to-arrival to the end of the merging lane and the gap to the target-lane vehicle (available in the front view and the side mirror, respectively). Participants’ decisions, response times, and eye movements were recorded. We found that decisions to accept a gap were considerably faster than decisions to reject a gap. Decision outcomes and timing depended on the distance to and time-to-arrival of the target-lane vehicle, but also on the time pressure due to approaching the end of the merging lane. Most importantly, under high time pressure, a greater proportion of time spent looking at the side mirror was associated with a lower probability of accepting the gap. This finding indicates that differences in visual information sampling can be closely linked to decision outcomes when time budgets are constrained. Our results provide initial empirical insights relevant for future cognitive modeling of the interplay between decision making and attention during highway merging. This work can inform early-stage exploration of driver monitoring and support systems for partially automated driving.
ChatGPT and academic work
New psychological phenomena
This study describes the impact of ChatGPT use on the nature of work from the perspective of academics and educators. We elucidate six phenomena: (1) the cognitive workload associated with conducting Turing tests to determine if ChatGPT has been involved in work productions; (2) the ethical void and alienation that result from recondite ChatGPT use; (3) insights into the motives of individuals who fail to disclose their ChatGPT use, while, at the same time, the recipient does not reveal their awareness of that use; (4) the sense of ennui as the meanings of texts dissipate and no longer reveal the sender’s state of understanding; (5) a redefinition of utility, wherein certain texts show redundancy with patterns already embedded in the base model, while physical measurements and personal observations are considered as unique and novel; (6) a power dynamic between sender and recipient, inadvertently leaving non-participants as disadvantaged third parties. This paper makes clear that the introduction of AI tools into society has far-reaching effects, initially most prominent in text-related fields, such as academia. Whether these implementations represent beneficial innovations for human prosperity, or a rather different line of social evolution, represents the pith of our present discussion.
Loneliness, personality, and attention to AI-generated images depicting social threat
An eye-tracking study
Attention bias towards social threat has been linked to loneliness and anxiety, though findings are mixed and concerns about measurement reliability persist. This study examined whether state and trait loneliness, along with personality, self-esteem, social anxiety, and life satisfaction, are associated with attention bias towards social threat images (indicating rejection or exclusion) in young adults (N = 241). AI-generated images were used to enhance control over stimulus content and category distinctions. Participants completed an eye-tracking free-viewing task comprising 40 image matrices (four images per matrix, displayed for 6000 ms). We then computed attention bias (dwell time percentage, total fixation duration percentage, and fixation count percentage) and initial orientation of attention (first fixation percentage). The attention bias measures showed adequate-to-good internal consistency (α = 0.61–0.86). No significant associations emerged between loneliness and attention to socially threatening stimuli, suggesting that heightened vigilance to social threat may not be a feature of loneliness in non-clinical young adults. However, it was found that females exhibited greater attention to social positive images, and baseline pupil diameter was associated with social anxiety. Future research should assess whether loneliness-specific attention bias is a replicable phenomenon, ideally by using an extreme-sampling approach with very lonely individuals.
Despite the significant advancements in computer vision models, their ability to generalize to novel object-attribute compositions remains limited. Existing methods for Compositional Zero-Shot Learning (CZSL) mainly focus on image classification. This paper aims to enhance CZSL in object detection without forgetting prior learned knowledge. We use Grounding DINO and incorporate Compositional Soft Prompting (CSP) into it and extend it with Compositional Anticipation. We achieve a 70.5% improvement over CSP on the harmonic mean (HM) between seen and unseen compositions on the CLEVR dataset. Furthermore, we introduce Contrastive Prompt Tuning to incrementally address model confusion between similar compositions. We demonstrate the effectiveness of this method and achieve an increase of 14.5% in HM across the pretrain, increment, and unseen sets. Collectively, these methods provide a framework for learning various compositions with limited data, as well as improving the performance of underperforming compositions when additional data becomes available.
Detecting Midjourney-Generated Images
An Eye-Tracking Study
This study investigated human performance in identifying AI-generated images. In a speeded forced-choice task, 255 participants viewed paired images (one real, one AI-generated by Midjourney) of standard or futuristic cars and buildings and had to identify the AI-generated one, while eye movements were recorded using an eye-tracker. Results revealed a powerful “futurism-as-artificiality” heuristic. Specifically, participants performed poorly (55% correct) when an AI-generated standard image was paired with a real futuristic image. Conversely, accuracy was high (91% correct) when the AI-generated futuristic image was paired with a real standard image. Participants’ gaze landed first on the AI-generated image more often when it depicted a futuristic design than when it depicted a standard one. The demonstrated heuristic presents a double-edged sword for information veracity: it may lead to the uncritical acceptance of AI-generated misinformation that appears conventional, while simultaneously causing real forward-thinking designs to be dismissed as fake.
Turing Tests in Chess
An Experiment Revealing the Role of Human Subjectivity
System 2 Thinking in OpenAI’s o1-Preview Model
Near-Perfect Performance on a Mathematics Exam
Ergonomics & Human factors
Fade of a discipline
In this commentary, we argue that the field of Ergonomics and Human Factors (EHF) has the tendency to present itself as a thriving and impactful science, while in reality, it is losing credibility. We assert that EHF science (1) has introduced terminology that is internally inconsistent and hardly predictive-valid, (2) has virtually no impact on industrial practice, which operates within frameworks of regulatory compliance and profit generation, (3) repeatedly employs the same approach of conducting lab experiments within unrealistic paradigms in order to complete deliverables, (4) suggests it is a cumulative science, but is neither a leader nor even an adopter of open-science initiatives that are characteristic of scientific progress and (5) is being assimilated by other disciplines as well as Big Tech. Recommendations are provided to reverse this trend, although we also express a certain resignation as our scientific discipline loses significance. Practitioner Summary: This paper offers criticism of the field of Ergonomics. There are issues such as unclear terminology, unrealistic experiments, insufficient impact and lack of open data. We provide recommendations to reverse the trend. This article concerns a critique of EHF as a science, and is not a critique of EHF practitioners.
Responses to Raven matrices
Governed by visual complexity and centrality
Raven matrices are widely considered a pure test of cognitive abilities. Previous research has examined the extent to which cognitive strategies are predictive of the number of correct responses to Raven items. This study examined whether response times can be explained directly from the centrality and visual complexity of the matrix cells (edge density and perceived complexity). A total of 159 participants completed a 12-item version of the Raven Advanced Progressive Matrices. In addition to item number (an index of item difficulty), the findings demonstrated a positive correlation between the visual complexity of Raven items and both the mean response time and the number of fixations on the matrix (a strong correlate of response time). Moreover, more centrally placed cells as well as more complex cells received more fixations. It is concluded that response times on Raven matrices are impacted by low-level stimulus attributes, namely, visual complexity and eccentricity.
Should an external human-machine interface flash or just show text?
A study with a gaze-contingent setup
Automated vehicles need to prioritize pedestrian safety. One way to achieve this is through external human–machine interfaces (eHMIs) that send visual signals to pedestrians. eHMIs can be either text-based or light-based. However, there has been limited research on the effects of these types of eHMI on human information processing and attention allocation. This study aimed to fill this gap by using a gaze-contingent approach, which blurs the view outside a circular aperture, to test the hypothesis that text-based eHMIs, which require focused or foveal attention, result in longer response times compared to light-based eHMIs, which can be understood using peripheral vision. In this study, 23 participants watched animated video clips of traffic situations involving automated vehicles with either no eHMI, a flashing-light eHMI, or a text-based eHMI. Their eye movements were tracked, and they were asked to press the spacebar when they felt it was safe to cross the road. The results showed faster response times when an eHMI was present, with no significant difference between the two types of eHMIs. Further analysis suggested that the flashing-light eHMI captured attention briefly, while the text-based eHMI held attention for a longer period. When no eHMI was present, participants focused on the approaching vehicle for the longest time. The gaze-contingent window resulted in fewer eye movements and slower response times. In conclusion, the study showed that the gaze-contingent window negatively affected response times and eye movements, emphasizing the importance of considering peripheral vision when designing eHMIs for pedestrian safety.
The interaction between biological tissue and electromagnetic fields (EMF) is a topic of increasing interest due to the rising prevalence of background EMF in the past decades. Previous studies have attempted to measure the effects of EMF on brainwaves using EEG recordings, but are typically hampered by experimental and environmental factors. In this study, we present a framework for measuring the impact of EMF on EEG while controlling for these factors. A Bayesian statistical approach is employed to provide robust statistical evidence of the observed EMF effects. This study included 32 healthy participants in a double-blinded crossover counterbalanced design. EEG recordings were taken from 63 electrodes across 6 brain regions. Participants underwent a measurement protocol comprising two 18-min sessions with alternating blocks of eyes open (EO) and eyes closed (EC) conditions. Group 1 (n = 16) had EMF during the first session and sham during the second session; group 2 (n = 16) had the opposite. Power spectral density plots were generated for all sessions and brain regions. The Bayesian analysis provided statistical evidence for the presence of an EMF effect in the alpha band power density in the EO condition. This measurement protocol holds potential for future research on the impact of novel transmission protocols.
Augmented reality-based telepresence in a robotic manipulation task
An experimental evaluation
A spectrum of control methods in human–robot interaction was investigated, ranging from direct control to telepresence with a virtual representation of the robot arm. A total of 24 participants used a setup that included a Franka Emika Panda robot arm, Varjo XR-3 head-mounted display, and Leap Motion Controller. Participants performed a box-and-block task using the bare hand (A), and under five gesture-controlled robotic operation methods: direct sight (B), sight via video-feedthrough (C), in a 3D telepresence environment with (D) and without (E) virtual representation of the robot arm, and using a 2D video feed (F). The number of grabbing attempts did not differ significantly between conditions, but local operation (B & C) yielded more transferred blocks than teleoperation (D–F). Teleoperation using a 3D presentation was advantageous compared to teleoperation using a 2D video feed, as demonstrated by lower peak forces and smaller range in gripper heights in conditions D and E compared to condition F, a finding supported by analyses of the head movement activity. Finally, the bare hand yielded the best performance and subjective ratings. In summary, teleoperation using a 3D presentation provided a smoother interaction than teleoperation with a 2D video feed. However, direct human interaction remains a benchmark yet to surpass.
In the 1950s and 1960s, John Senders carried out a number of influential experiments on the monitoring of multidegree-of-freedom systems. In these experiments, participants were tasked with detecting events (threshold crossings) for multiple dials, each presenting a signal with different bandwidth. Senders’ analyses showed a nearly linear relationship between signal bandwidth and the amount of attention paid to the dial, and he argued that humans sample according to bandwidth, in line with the Nyquist–Shannon sampling theorem.
Objective
The current study tested whether humans indeed sample the dials based on bandwidth alone or whether they also use salient peripheral cues.
Methods
A dial-monitoring task was performed by 33 participants. In half of the trials, a gaze-contingent window was used that blocked peripheral vision.
Results
The results showed that, without peripheral vision, humans do not effectively distribute their attention across the dials. The findings also suggest that, when given full view, humans can detect the speed of the dial using their peripheral vision.
Conclusion
It is concluded that salience and bandwidth are both drivers of distributed visual attention in a dial-monitoring task.
Application
The present findings indicate that salience plays a major role in guiding human attention. A subsequent recommendation for future human–machine interface design is that task-critical elements should be made salient. ...
In the 1950s and 1960s, John Senders carried out a number of influential experiments on the monitoring of multidegree-of-freedom systems. In these experiments, participants were tasked with detecting events (threshold crossings) for multiple dials, each presenting a signal with different bandwidth. Senders’ analyses showed a nearly linear relationship between signal bandwidth and the amount of attention paid to the dial, and he argued that humans sample according to bandwidth, in line with the Nyquist–Shannon sampling theorem.
Objective
The current study tested whether humans indeed sample the dials based on bandwidth alone or whether they also use salient peripheral cues.
Methods
A dial-monitoring task was performed by 33 participants. In half of the trials, a gaze-contingent window was used that blocked peripheral vision.
Results
The results showed that, without peripheral vision, humans do not effectively distribute their attention across the dials. The findings also suggest that, when given full view, humans can detect the speed of the dial using their peripheral vision.
Conclusion
It is concluded that salience and bandwidth are both drivers of distributed visual attention in a dial-monitoring task.
Application
The present findings indicate that salience plays a major role in guiding human attention. A subsequent recommendation for future human–machine interface design is that task-critical elements should be made salient.
Blinded windows and empty driver seats
The effects of automated vehicle characteristics on cyclists’ decision-making
Automated vehicles (AVs) may feature blinded (i.e. blacked-out) windows and external human–machine interfaces (eHMIs), and the driver may be inattentive or absent, but how these features affect cyclists is unknown. In a crowdsourcing study, participants viewed images of approaching vehicles from a cyclist's perspective and decided whether to brake. The images depicted different combinations of traditional vehicles versus AVs, eHMI presence, vehicle approach direction, driver visibility/window-blinding, visual complexity of the surroundings, and distance to the cyclist (urgency). The results showed that the eHMI and urgency level had a strong impact on crossing decisions, whereas visual complexity had no significant influence. Blinded windows caused participants to brake for the traditional vehicle. A second crowdsourcing experiment aimed to clarify the findings of Experiment 1 by also requiring participants to detect the vehicle features. It was found that the eHMI ‘GO’ and blinded windows yielded high detection rates and that driver eye contact caused participants to continue pedalling. To conclude, blinded windows increase the probability that cyclists brake, and driver eye contact stimulates cyclists to continue cycling. Our findings, which were obtained with large international samples, may help elucidate how AVs (in which the driver may not be visible) affect cyclists’ behaviour.