Adaptive Companionship for Group-Following Robots: Handling Dynamically Changing Group Formations

Cong-Thanh Vu; Yen-Chen Liu

arxiv: 2607.01287 · v1 · pith:55P52E7Unew · submitted 2026-07-01 · 💻 cs.RO · cs.AI· cs.SY· eess.SY

Adaptive Companionship for Group-Following Robots: Handling Dynamically Changing Group Formations

Cong-Thanh Vu , Yen-Chen Liu This is my paper

Pith reviewed 2026-07-03 20:35 UTC · model grok-4.3

classification 💻 cs.RO cs.AIcs.SYeess.SY

keywords group followingsocial robotsvision-language modelsdynamic formationsadaptive companionshipmodel predictive control

0 comments

The pith

Robots use vision-language models to adapt positions while following groups whose formations change over time.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a method that lets a robot accompany a group of people even when the group members shift their relative positions. A perceptual module first creates visual representations of the group's space, which a vision-language model then uses to reason about suitable companion locations, social distances, and overall dynamics. These inferences feed into a Model Predictive Path Integral controller that generates safe, stable robot motion. Tests across five scenarios report higher success rates and fewer collisions than earlier methods, while a user study finds the resulting behaviors appear natural. The work targets the practical problem that fixed-formation techniques break down once real human groups begin to move fluidly.

Core claim

The central claim is that combining visual representations of group interaction space with a vision-language model's semantic reasoning, then feeding the output to a Model Predictive Path Integral controller, produces stable and socially appropriate accompaniment even as group formations change dynamically.

What carries the argument

Vision-language model inference of companion positions and group dynamics from perceptual visual representations of the interaction space, integrated with an MPPI controller for motion generation.

If this is right

The approach yields a 15 percent higher success rate than baseline methods across the tested scenarios.
Collision rates decrease by 25 percent relative to the same baselines.
User evaluations rate the produced companionship behaviors as natural and socially appropriate.
The combination of perceptual module, VLM, and MPPI controller maintains stability and safety during motion.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same visual-reasoning loop could be applied to other social navigation tasks that require understanding changing spatial relations among people.
Real-world deployment would require checking how well the model handles partial occlusions or rapid group splits that were not emphasized in the five scenarios.
If the VLM component generalizes, it may reduce the need for hand-crafted rules about social distance in future robot navigation systems.

Load-bearing premise

The vision-language model can reliably interpret visual group representations to select appropriate positions and distances.

What would settle it

A controlled test in which the robot is placed in a previously unseen group formation and the measured success rate falls below the reported baseline.

Figures

Figures reproduced from arXiv: 2607.01287 by Cong-Thanh Vu, Yen-Chen Liu.

**Figure 2.** Figure 2: The overall structure of the system architecture. VLM uses a visual representation of detected humans and the robot [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 3.** Figure 3: VLM identifies members of the companion group [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 4.** Figure 4: CoT prompting with a one-shot example enables [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

**Figure 6.** Figure 6: Qualitative results illustrating the motion of the proposed method across the five scenarios. [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗

**Figure 7.** Figure 7: User Study Average Scores. TABLE III: Quantitative Results of the Ablation Study Method Scenario 1 Scenario 2 Scenario 3 Scenario 4 Scenario 5 SR CR↓ CD↓ SR↑ CR↓ CD↓ SR↑ CR↓ CD↓ SR↑ CR↓ CD↓ SR↑ CR↓ CD↓ Proposed 100 0 1.1 ± 0.04 80 10 1.05 ± 0.06 90 0 1.07 ± 0.09 85 0 1.03 ± 0.07 95 5 0.87 ± 0.11 w/o VLM group detection 100 0 1.07 ± 0.07 80 0 1.10 ± 0.09 30 20 2.10 ± 0.14 85 10 1.11 ± 0.09 90 10 1.06 ± 0.06… view at source ↗

**Figure 8.** Figure 8: Performance of the proposed method for different [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

read the original abstract

Accompanying a group of humans is an essential aspect of developing human-like social cognition in robots. However, human groups typically do not follow fixed formations, which poses significant challenges for robots in maintaining natural companionship behaviors. In this paper, we propose an adaptive group-accompaniment method for social robots based on Vision-Language Models (VLMs), leveraging their semantic reasoning capabilities to infer companion positions, maintain social distances, and understand group dynamics. The members of the group are first detected, and a perceptual module generates visual representations of the interaction group space as input to the VLM, which is then combined with a Model Predictive Path Integral (MPPI) controller to ensure stability and safety. Experimental evaluations across five scenarios show that the proposed method enables robots to accompany the group effectively, demonstrating a 15\% improvement in success rate and a 25\% reduction in collision rate compared to baseline approaches. Additionally, a user study indicates that the generated companionship behaviors are perceived as natural and socially appropriate.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

VLM position inference is the claimed novelty but the experiments report only aggregate success and collision numbers with no separate check on VLM output quality.

read the letter

The paper integrates a perceptual module that feeds visual group representations into a VLM to pick companion positions and social distances, then hands those targets to an MPPI controller for safe motion. That specific stack for non-fixed formations is the concrete step it takes.

It handles a real practical problem: groups change shape, and fixed-formation methods break. The pipeline description is straightforward and the choice of MPPI for stability is reasonable given the safety needs.

The reported 15 % success gain and 25 % collision drop come from five scenarios plus a user study on perceived naturalness. Those numbers are presented as evidence that the method works.

The main weakness is that nothing isolates the VLM step. There is no metric for how accurately the VLM predicts positions or distances, no failure cases for its outputs, and no ablation that turns the VLM off. The headline improvements could come from the controller, the baselines chosen, or scene-specific tuning instead. The abstract also gives no error bars, no statistical tests, and no explicit comparison to earlier VLM-robot papers.

This is for people already working on group navigation or VLM applications in robotics. They might pick up the perceptual-module idea or the MPPI pairing, but they will not get a clear answer on whether the VLM reasoning is reliable.

Send it to review. The integration is worth a closer look and referees can request the missing VLM-specific measurements, but the current evidence does not yet pin the gains on the stated core mechanism.

Referee Report

2 major / 1 minor

Summary. The paper proposes an adaptive group-accompaniment method for social robots that detects group members, uses a perceptual module to generate visual representations of the interaction space, feeds these to a Vision-Language Model (VLM) to infer companion positions, social distances, and group dynamics, and integrates the VLM output with a Model Predictive Path Integral (MPPI) controller for stability and safety. It claims that experiments across five scenarios demonstrate a 15% improvement in success rate and 25% reduction in collision rate versus baselines, with a user study indicating that the behaviors are perceived as natural and socially appropriate.

Significance. If the central claim holds after the VLM contribution is isolated and the experimental protocol is fully documented, the work would offer a concrete demonstration of combining semantic VLM reasoning with receding-horizon control for dynamic social navigation; this could inform future designs that move beyond fixed-formation assumptions in group-following tasks.

major comments (2)

[Abstract] Abstract, results paragraph: the stated 15% success-rate improvement and 25% collision-rate reduction are presented without any description of the five scenarios, the baseline methods, the precise VLM prompting or fine-tuning procedure, error bars, or statistical tests; consequently the numerical claims cannot be verified and the data cannot be shown to support the headline result.
[Method] Method description (abstract): the central mechanism asserts that the VLM, given perceptual-module visual representations, produces reliable companion positions, social distances, and dynamic targets that the MPPI controller can track; yet no isolated metric (position error, prompt-consistency score, or VLM failure-case analysis) is supplied, so aggregate success/collision figures cannot attribute gains to the VLM inference step rather than to the MPPI safety layer or baseline weaknesses.

minor comments (1)

[Abstract] The abstract would be clearer if it briefly indicated the output format of the perceptual module (e.g., image patches, bounding-box overlays, or scene graphs) before describing VLM input.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting the need for greater clarity in the abstract and stronger isolation of the VLM contribution. We address each major comment below.

read point-by-point responses

Referee: [Abstract] Abstract, results paragraph: the stated 15% success-rate improvement and 25% collision-rate reduction are presented without any description of the five scenarios, the baseline methods, the precise VLM prompting or fine-tuning procedure, error bars, or statistical tests; consequently the numerical claims cannot be verified and the data cannot be shown to support the headline result.

Authors: The abstract is intentionally concise, but the five scenarios, baseline methods, VLM prompting procedure, error bars, and statistical tests are fully documented in the Experimental Evaluation section. We will revise the abstract to briefly name the scenarios and baselines while directing readers to the full details, error bars, and significance tests in the body. This improves verifiability from the abstract without violating length limits. revision: partial
Referee: [Method] Method description (abstract): the central mechanism asserts that the VLM, given perceptual-module visual representations, produces reliable companion positions, social distances, and dynamic targets that the MPPI controller can track; yet no isolated metric (position error, prompt-consistency score, or VLM failure-case analysis) is supplied, so aggregate success/collision figures cannot attribute gains to the VLM inference step rather than to the MPPI safety layer or baseline weaknesses.

Authors: The reported gains are measured against baselines that omit the VLM perceptual module, so the performance delta is attributable to the addition of VLM reasoning within the integrated pipeline. To strengthen attribution, we will add a dedicated VLM evaluation subsection reporting position error and prompt-consistency metrics on the perceptual outputs. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical method with no derivation chain or fitted predictions

full rationale

The paper describes an empirical robotics method combining a perceptual module, VLM for position inference, and MPPI controller, evaluated via success/collision rates and user study across scenarios. No equations, parameter fitting, predictions, or uniqueness theorems are referenced in the provided text. Claims rest on experimental outcomes rather than any self-referential reduction of outputs to inputs by construction. Self-citation patterns are absent from the abstract and description. This matches the default non-circular case for applied empirical work.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the unexamined assumption that current VLMs possess reliable semantic reasoning about social group geometry; no free parameters, new entities, or additional axioms are stated in the abstract.

axioms (1)

domain assumption Vision-language models possess semantic reasoning capabilities sufficient to infer companion positions and group dynamics from visual group-space representations
Invoked as the core mechanism that turns perception into target positions

pith-pipeline@v0.9.1-grok · 5711 in / 1300 out tokens · 23928 ms · 2026-07-03T20:35:51.039653+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages

[1]

A survey on socially aware robot navigation: Taxonomy and future challenges,

P. T. Singamaneni, P. Bachiller-Burgos, L. J. Manso, A. Garrell, A. Sanfeliu, A. Spalanzani, and R. Alami, “A survey on socially aware robot navigation: Taxonomy and future challenges,”The International Journal of Robotics Research, vol. 43, no. 10, pp. 1533–1572, 2024

work page 2024
[2]

Person-following by autonomous robots: A categorical overview,

M. J. Islam, J. Hong, and J. Sattar, “Person-following by autonomous robots: A categorical overview,”The International Journal of Robotics Research, vol. 38, no. 14, pp. 1581–1618, 2019

work page 2019
[3]

Human–robot compan- ionship: Current trends and future agenda,

E. Ahmed, O. O. Buruk, and J. Hamari, “Human–robot compan- ionship: Current trends and future agenda,”International Journal of Social Robotics, vol. 16, no. 8, pp. 1809–1860, 2024

work page 2024
[4]

The human- following strategy for mobile robots in mixed environments,

N. V . Toan, M. Do Hoang, P. B. Khoi, and S.-Y . Yi, “The human- following strategy for mobile robots in mixed environments,”Robotics and Autonomous Systems, vol. 160, p. 104317, 2023

work page 2023
[5]

Collision-free navigation in human-following task using a cognitive robotic system on differential drive vehicles,

C. V . Dang, H. Ahn, J.-W. Kim, and S. C. Lee, “Collision-free navigation in human-following task using a cognitive robotic system on differential drive vehicles,”IEEE Transactions on Cognitive and Developmental Systems, vol. 15, no. 1, pp. 78–87, 2023

work page 2023
[6]

Anticipatory control on human-following robots using online deep-model predictive control,

S. Gui and Y . Luximon, “Anticipatory control on human-following robots using online deep-model predictive control,”IEEE Transactions on Industrial Electronics, vol. 72, no. 2, pp. 1702–1711, 2025

work page 2025
[7]

Walking together: Side-by- side walking model for an interacting robot,

Y . Morales, T. Kanda, and N. Hagita, “Walking together: Side-by- side walking model for an interacting robot,”J. Hum.-Robot Interact., vol. 3, no. 2, p. 50–73, Jul. 2014

work page 2014
[8]

Robust side following robotic wheelchair by using homotopy class of human intention,

K. Y . Tan, N. P. Garg, M. Ramanathan, and W. T. Ang, “Robust side following robotic wheelchair by using homotopy class of human intention,”IEEE Robotics and Automation Letters, vol. 10, no. 5, pp. 5018–5025, 2025

work page 2025
[9]

MPC-based human-accompanying control strategy for improving the motion coor- dination between the target person and the robot,

J. Peng, Z. Liao, H. Yao, Z. Su, Y . Zeng, and H. Dai, “MPC-based human-accompanying control strategy for improving the motion coor- dination between the target person and the robot,” in2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023, pp. 7969–7975

work page 2023
[10]

Autonomous navigation for human-following robots based on optimized position tracking,

C.-T. Vu, H.-H. Huang, and Y .-C. Liu, “Autonomous navigation for human-following robots based on optimized position tracking,” in 2025 10th International Conference on Control and Robotics Engi- neering (ICCRE), 2025, pp. 23–27

work page 2025
[11]

Uncertainty-aware non-linear model predictive control for human-following companion robot,

S. Sekiguchi, A. Yorozu, K. Kuno, M. Okada, Y . Watanabe, and M. Takahashi, “Uncertainty-aware non-linear model predictive control for human-following companion robot,” in2021 IEEE International Conference on Robotics and Automation (ICRA), 2021, pp. 8316– 8322

work page 2021
[12]

Adapting to frequent human direction changes in autonomous frontal following robots,

S. Leisiazar, S. R. R. Rohani, E. J. Park, A. Lim, and M. Chen, “Adapting to frequent human direction changes in autonomous frontal following robots,”IEEE Robotics and Automation Letters, vol. 10, no. 3, pp. 2934–2941, 2025

work page 2025
[13]

A dual closed-loop control strategy for human-following robots respecting social space,

J. Peng, Z. Liao, Z. Su, H. Yao, Y . Zeng, and H. Dai, “A dual closed-loop control strategy for human-following robots respecting social space,” in2024 IEEE International Conference on Robotics and Automation (ICRA), 2024, pp. 11 252–11 258

work page 2024
[14]

Adaptive social planner to accompany people in real-life dynamic environments,

E. Repiso, A. Garrell, and A. Sanfeliu, “Adaptive social planner to accompany people in real-life dynamic environments,”International Journal of Social Robotics, vol. 16, no. 6, pp. 1189–1221, 2024

work page 2024
[15]

Autonomous adjustment of tracking position in dynamic environments for human-following robots using deep reinforcement learning,

C.-T. Vu and Y .-C. Liu, “Autonomous adjustment of tracking position in dynamic environments for human-following robots using deep reinforcement learning,” in2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025, pp. 16 863–16 869

work page 2025
[16]

Obstacle-avoidant leader following with a quadruped robot,

C. Scheidemann, L. Werner, V . Reijgwart, A. Cramariuc, J. Chomarat, J.-R. Chiu, R. Siegwart, and M. Hutter, “Obstacle-avoidant leader following with a quadruped robot,” in2025 IEEE International Con- ference on Robotics and Automation (ICRA), 2025, pp. 1407–1413

work page 2025
[17]

A systematic analysis of subgroup research in pedestrian and evacuation dynamics,

W. Wu and X. Zheng, “A systematic analysis of subgroup research in pedestrian and evacuation dynamics,”IEEE Transactions on Intelligent Transportation Systems, vol. 25, no. 2, pp. 1225–1246, 2024

work page 2024
[18]

Potential for the dynamics of pedestrians in a socially interacting group,

F. Zanlungo, T. Ikeda, and T. Kanda, “Potential for the dynamics of pedestrians in a socially interacting group,”Phys. Rev. E, vol. 89, p. 012811, Jan 2014

work page 2014
[19]

Intrinsic group behaviour ii: On the dependence of triad spatial dynamics on social and personal features; and on the effect of social interaction on small group dynamics,

F. Zanlungo, Z. Y ¨ucel, and T. Kanda, “Intrinsic group behaviour ii: On the dependence of triad spatial dynamics on social and personal features; and on the effect of social interaction on small group dynamics,”PloS one, vol. 14, no. 12, p. e0225704, 2019

work page 2019
[20]

Group- aware robot navigation in crowds using spatio-temporal graph atten- tion network with deep reinforcement learning,

X. Lu, A. Faragasso, Y . Wang, A. Yamashita, and H. Asama, “Group- aware robot navigation in crowds using spatio-temporal graph atten- tion network with deep reinforcement learning,”IEEE Robotics and Automation Letters, vol. 10, no. 4, pp. 4140–4147, 2025

work page 2025
[21]

Movement coordination in hu- man–robot teams: A dynamical systems approach,

T. Iqbal, S. Rack, and L. D. Riek, “Movement coordination in hu- man–robot teams: A dynamical systems approach,”IEEE Transactions on Robotics, vol. 32, no. 4, pp. 909–919, 2016

work page 2016
[22]

People’s v-formation and side-by-side model adapted to accompany groups of people by social robots,

E. Repiso, F. Zanlungo, T. Kanda, A. Garrell, and A. Sanfeliu, “People’s v-formation and side-by-side model adapted to accompany groups of people by social robots,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019, pp. 2082– 2088

work page 2019
[23]

People’s adaptive side-by-side model evolved to accompany groups of people by social robots,

E. Repiso, A. Garrell, and A. Sanfeliu, “People’s adaptive side-by-side model evolved to accompany groups of people by social robots,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 2387–2394, 2020

work page 2020
[24]

Adaptive social planner to accompany people in real-life dynamic environments,

——, “Adaptive social planner to accompany people in real-life dynamic environments,”International Journal of Social Robotics, vol. 16, no. 6, pp. 1189–1221, 2024

work page 2024
[25]

Following is all you need: Robot crowd navigation using people as planners,

Y . Liao, X. Xu, R. Bai, Y . Yang, M. Cao, S. Yuan, and L. Xie, “Following is all you need: Robot crowd navigation using people as planners,”IEEE Robotics and Automation Letters, vol. 10, no. 10, pp. 9814–9821, 2025

work page 2025
[26]

VLM-Social-Nav: Socially aware robot navigation through scoring using vision-language models,

D. Song, J. Liang, A. Payandeh, A. H. Raj, X. Xiao, and D. Manocha, “VLM-Social-Nav: Socially aware robot navigation through scoring using vision-language models,”IEEE Robotics and Automation Let- ters, vol. 10, no. 1, pp. 508–515, 2025

work page 2025
[27]

OLiVia-Nav: an online lifelong vision language approach for mobile robot social navigation,

S. Narasimhan, A. H. Tan, D. Choi, and G. Nejat, “OLiVia-Nav: an online lifelong vision language approach for mobile robot social navigation,” in2025 IEEE International Conference on Robotics and Automation (ICRA), 2025, pp. 9130–9137

work page 2025
[28]

GSON: a group-based social navigation framework with large multimodal model,

S. Luo, P. Sun, J. Zhu, Y . Deng, C. Yu, A. Xiao, and X. Wang, “GSON: a group-based social navigation framework with large multimodal model,”IEEE Robotics and Automation Letters, vol. 10, no. 10, pp. 9646–9653, 2025

work page 2025
[29]

Context-aware adaptive pesticide spraying for agricultural robots under changing weather and terrain using vision–language models,

C.-T. Vu and Y .-C. Liu, “Context-aware adaptive pesticide spraying for agricultural robots under changing weather and terrain using vision–language models,”Computers and Electronics in Agriculture, vol. 252, p. 112092, 2026

work page 2026
[30]

Pointpillars: Fast encoders for object detection from point clouds,

A. H. Lang, S. V ora, H. Caesar, L. Zhou, J. Yang, and O. Beijbom, “Pointpillars: Fast encoders for object detection from point clouds,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 12 697–12 705

work page 2019
[31]

Shield model predictive path integral: A computationally efficient robust mpc method using control barrier functions,

J. Yin, C. Dawson, C. Fan, and P. Tsiotras, “Shield model predictive path integral: A computationally efficient robust mpc method using control barrier functions,”IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7106–7113, 2023

work page 2023

[1] [1]

A survey on socially aware robot navigation: Taxonomy and future challenges,

P. T. Singamaneni, P. Bachiller-Burgos, L. J. Manso, A. Garrell, A. Sanfeliu, A. Spalanzani, and R. Alami, “A survey on socially aware robot navigation: Taxonomy and future challenges,”The International Journal of Robotics Research, vol. 43, no. 10, pp. 1533–1572, 2024

work page 2024

[2] [2]

Person-following by autonomous robots: A categorical overview,

M. J. Islam, J. Hong, and J. Sattar, “Person-following by autonomous robots: A categorical overview,”The International Journal of Robotics Research, vol. 38, no. 14, pp. 1581–1618, 2019

work page 2019

[3] [3]

Human–robot compan- ionship: Current trends and future agenda,

E. Ahmed, O. O. Buruk, and J. Hamari, “Human–robot compan- ionship: Current trends and future agenda,”International Journal of Social Robotics, vol. 16, no. 8, pp. 1809–1860, 2024

work page 2024

[4] [4]

The human- following strategy for mobile robots in mixed environments,

N. V . Toan, M. Do Hoang, P. B. Khoi, and S.-Y . Yi, “The human- following strategy for mobile robots in mixed environments,”Robotics and Autonomous Systems, vol. 160, p. 104317, 2023

work page 2023

[5] [5]

Collision-free navigation in human-following task using a cognitive robotic system on differential drive vehicles,

C. V . Dang, H. Ahn, J.-W. Kim, and S. C. Lee, “Collision-free navigation in human-following task using a cognitive robotic system on differential drive vehicles,”IEEE Transactions on Cognitive and Developmental Systems, vol. 15, no. 1, pp. 78–87, 2023

work page 2023

[6] [6]

Anticipatory control on human-following robots using online deep-model predictive control,

S. Gui and Y . Luximon, “Anticipatory control on human-following robots using online deep-model predictive control,”IEEE Transactions on Industrial Electronics, vol. 72, no. 2, pp. 1702–1711, 2025

work page 2025

[7] [7]

Walking together: Side-by- side walking model for an interacting robot,

Y . Morales, T. Kanda, and N. Hagita, “Walking together: Side-by- side walking model for an interacting robot,”J. Hum.-Robot Interact., vol. 3, no. 2, p. 50–73, Jul. 2014

work page 2014

[8] [8]

Robust side following robotic wheelchair by using homotopy class of human intention,

K. Y . Tan, N. P. Garg, M. Ramanathan, and W. T. Ang, “Robust side following robotic wheelchair by using homotopy class of human intention,”IEEE Robotics and Automation Letters, vol. 10, no. 5, pp. 5018–5025, 2025

work page 2025

[9] [9]

MPC-based human-accompanying control strategy for improving the motion coor- dination between the target person and the robot,

J. Peng, Z. Liao, H. Yao, Z. Su, Y . Zeng, and H. Dai, “MPC-based human-accompanying control strategy for improving the motion coor- dination between the target person and the robot,” in2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023, pp. 7969–7975

work page 2023

[10] [10]

Autonomous navigation for human-following robots based on optimized position tracking,

C.-T. Vu, H.-H. Huang, and Y .-C. Liu, “Autonomous navigation for human-following robots based on optimized position tracking,” in 2025 10th International Conference on Control and Robotics Engi- neering (ICCRE), 2025, pp. 23–27

work page 2025

[11] [11]

Uncertainty-aware non-linear model predictive control for human-following companion robot,

S. Sekiguchi, A. Yorozu, K. Kuno, M. Okada, Y . Watanabe, and M. Takahashi, “Uncertainty-aware non-linear model predictive control for human-following companion robot,” in2021 IEEE International Conference on Robotics and Automation (ICRA), 2021, pp. 8316– 8322

work page 2021

[12] [12]

Adapting to frequent human direction changes in autonomous frontal following robots,

S. Leisiazar, S. R. R. Rohani, E. J. Park, A. Lim, and M. Chen, “Adapting to frequent human direction changes in autonomous frontal following robots,”IEEE Robotics and Automation Letters, vol. 10, no. 3, pp. 2934–2941, 2025

work page 2025

[13] [13]

A dual closed-loop control strategy for human-following robots respecting social space,

J. Peng, Z. Liao, Z. Su, H. Yao, Y . Zeng, and H. Dai, “A dual closed-loop control strategy for human-following robots respecting social space,” in2024 IEEE International Conference on Robotics and Automation (ICRA), 2024, pp. 11 252–11 258

work page 2024

[14] [14]

Adaptive social planner to accompany people in real-life dynamic environments,

E. Repiso, A. Garrell, and A. Sanfeliu, “Adaptive social planner to accompany people in real-life dynamic environments,”International Journal of Social Robotics, vol. 16, no. 6, pp. 1189–1221, 2024

work page 2024

[15] [15]

Autonomous adjustment of tracking position in dynamic environments for human-following robots using deep reinforcement learning,

C.-T. Vu and Y .-C. Liu, “Autonomous adjustment of tracking position in dynamic environments for human-following robots using deep reinforcement learning,” in2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025, pp. 16 863–16 869

work page 2025

[16] [16]

Obstacle-avoidant leader following with a quadruped robot,

C. Scheidemann, L. Werner, V . Reijgwart, A. Cramariuc, J. Chomarat, J.-R. Chiu, R. Siegwart, and M. Hutter, “Obstacle-avoidant leader following with a quadruped robot,” in2025 IEEE International Con- ference on Robotics and Automation (ICRA), 2025, pp. 1407–1413

work page 2025

[17] [17]

A systematic analysis of subgroup research in pedestrian and evacuation dynamics,

W. Wu and X. Zheng, “A systematic analysis of subgroup research in pedestrian and evacuation dynamics,”IEEE Transactions on Intelligent Transportation Systems, vol. 25, no. 2, pp. 1225–1246, 2024

work page 2024

[18] [18]

Potential for the dynamics of pedestrians in a socially interacting group,

F. Zanlungo, T. Ikeda, and T. Kanda, “Potential for the dynamics of pedestrians in a socially interacting group,”Phys. Rev. E, vol. 89, p. 012811, Jan 2014

work page 2014

[19] [19]

Intrinsic group behaviour ii: On the dependence of triad spatial dynamics on social and personal features; and on the effect of social interaction on small group dynamics,

F. Zanlungo, Z. Y ¨ucel, and T. Kanda, “Intrinsic group behaviour ii: On the dependence of triad spatial dynamics on social and personal features; and on the effect of social interaction on small group dynamics,”PloS one, vol. 14, no. 12, p. e0225704, 2019

work page 2019

[20] [20]

Group- aware robot navigation in crowds using spatio-temporal graph atten- tion network with deep reinforcement learning,

X. Lu, A. Faragasso, Y . Wang, A. Yamashita, and H. Asama, “Group- aware robot navigation in crowds using spatio-temporal graph atten- tion network with deep reinforcement learning,”IEEE Robotics and Automation Letters, vol. 10, no. 4, pp. 4140–4147, 2025

work page 2025

[21] [21]

Movement coordination in hu- man–robot teams: A dynamical systems approach,

T. Iqbal, S. Rack, and L. D. Riek, “Movement coordination in hu- man–robot teams: A dynamical systems approach,”IEEE Transactions on Robotics, vol. 32, no. 4, pp. 909–919, 2016

work page 2016

[22] [22]

People’s v-formation and side-by-side model adapted to accompany groups of people by social robots,

E. Repiso, F. Zanlungo, T. Kanda, A. Garrell, and A. Sanfeliu, “People’s v-formation and side-by-side model adapted to accompany groups of people by social robots,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019, pp. 2082– 2088

work page 2019

[23] [23]

People’s adaptive side-by-side model evolved to accompany groups of people by social robots,

E. Repiso, A. Garrell, and A. Sanfeliu, “People’s adaptive side-by-side model evolved to accompany groups of people by social robots,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 2387–2394, 2020

work page 2020

[24] [24]

Adaptive social planner to accompany people in real-life dynamic environments,

——, “Adaptive social planner to accompany people in real-life dynamic environments,”International Journal of Social Robotics, vol. 16, no. 6, pp. 1189–1221, 2024

work page 2024

[25] [25]

Following is all you need: Robot crowd navigation using people as planners,

Y . Liao, X. Xu, R. Bai, Y . Yang, M. Cao, S. Yuan, and L. Xie, “Following is all you need: Robot crowd navigation using people as planners,”IEEE Robotics and Automation Letters, vol. 10, no. 10, pp. 9814–9821, 2025

work page 2025

[26] [26]

VLM-Social-Nav: Socially aware robot navigation through scoring using vision-language models,

D. Song, J. Liang, A. Payandeh, A. H. Raj, X. Xiao, and D. Manocha, “VLM-Social-Nav: Socially aware robot navigation through scoring using vision-language models,”IEEE Robotics and Automation Let- ters, vol. 10, no. 1, pp. 508–515, 2025

work page 2025

[27] [27]

OLiVia-Nav: an online lifelong vision language approach for mobile robot social navigation,

S. Narasimhan, A. H. Tan, D. Choi, and G. Nejat, “OLiVia-Nav: an online lifelong vision language approach for mobile robot social navigation,” in2025 IEEE International Conference on Robotics and Automation (ICRA), 2025, pp. 9130–9137

work page 2025

[28] [28]

GSON: a group-based social navigation framework with large multimodal model,

S. Luo, P. Sun, J. Zhu, Y . Deng, C. Yu, A. Xiao, and X. Wang, “GSON: a group-based social navigation framework with large multimodal model,”IEEE Robotics and Automation Letters, vol. 10, no. 10, pp. 9646–9653, 2025

work page 2025

[29] [29]

Context-aware adaptive pesticide spraying for agricultural robots under changing weather and terrain using vision–language models,

C.-T. Vu and Y .-C. Liu, “Context-aware adaptive pesticide spraying for agricultural robots under changing weather and terrain using vision–language models,”Computers and Electronics in Agriculture, vol. 252, p. 112092, 2026

work page 2026

[30] [30]

Pointpillars: Fast encoders for object detection from point clouds,

A. H. Lang, S. V ora, H. Caesar, L. Zhou, J. Yang, and O. Beijbom, “Pointpillars: Fast encoders for object detection from point clouds,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 12 697–12 705

work page 2019

[31] [31]

Shield model predictive path integral: A computationally efficient robust mpc method using control barrier functions,

J. Yin, C. Dawson, C. Fan, and P. Tsiotras, “Shield model predictive path integral: A computationally efficient robust mpc method using control barrier functions,”IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7106–7113, 2023

work page 2023