Awesome-Interaction-Aware-Trajectory-Prediction

GitHub
1.7k 306 非常简单 1 次阅读 3天前MIT开发框架图像视频Agent
AI 解读 由 AI 自动生成,仅供参考

Awesome-Interaction-Aware-Trajectory-Prediction 是一个专注于“交互感知轨迹预测”领域的精选资源库,由斯坦福大学和加州大学伯克利分校的研究者共同维护。在自动驾驶、机器人导航及人群模拟等场景中,准确预测车辆、行人等智能体的未来运动轨迹至关重要,而难点在于如何建模个体之间复杂的相互影响。该资源库正是为了解决这一挑战而生,它系统性地整理了全球最前沿的研究成果,涵盖高质量数据集(如 Waymo、Argoverse)、综述论文、核心算法代码以及评估基准。

其独特亮点在于不仅关注单一目标的运动规律,更重点收录了那些能够理解并推理多智能体间动态交互关系的先进方案,帮助开发者突破传统预测方法的局限。无论是高校研究人员寻找最新文献灵感,还是企业工程师需要复现 SOTA(最先进)模型或获取训练数据,都能在这里找到极具价值的参考。作为一个持续更新的开源项目,它致力于打通学术界与工业界的信息壁垒,为构建更安全、更智能的移动系统提供坚实的技术支撑。

使用场景

某自动驾驶初创公司的算法团队正在开发城市复杂路口的预测模块,急需提升车辆对行人和其他车辆交互行为的预判能力。

没有 Awesome-Interaction-Aware-Trajectory-Prediction 时

  • 数据筛选耗时巨大:工程师需花费数周在海量文献中手动寻找适合“人车混行”场景的高质量数据集(如 nuScenes 或 Argoverse),效率极低。
  • 模型选型盲目:缺乏对前沿交互感知算法(如动态关系推理)的系统梳理,团队只能复用过时的独立轨迹预测模型,导致路口博弈场景下事故率高。
  • 评估标准缺失:没有统一的基准测试和评估指标参考,难以量化新模型在复杂交互下的真实性能,研发迭代方向模糊。
  • 代码复现困难:找不到官方开源代码或相关技术博客,从零实现论文算法不仅周期长,还极易因细节缺失导致效果不达预期。

使用 Awesome-Interaction-Aware-Trajectory-Prediction 后

  • 资源获取一站式完成:团队直接利用清单中分类清晰的"Vehicles and Traffic"数据集板块,迅速锁定了包含激光雷达与摄像头融合数据的 Waymo Open Dataset。
  • 精准锁定 SOTA 方案:通过"Intelligent Vehicles and Pedestrians"栏目,快速定位到处理多智能体动态关系的最新论文(如 EvolveGraph),显著提升了路口预测准确率。
  • 建立科学评估体系:参考"Benchmark and Evaluation Metrics"部分提供的标准指标,团队建立了客观的模型对比框架,明确了优化路径。
  • 加速工程落地:借助清单中附带的公开代码链接和技术综述,算法工程师在几天内就完成了基线模型的复现与微调,大幅缩短研发周期。

Awesome-Interaction-Aware-Trajectory-Prediction 通过整合全球顶尖的交互感知轨迹预测资源,将研发团队从繁琐的信息搜集工作中解放出来,使其能专注于核心算法的创新与落地。

运行环境要求

GPU

未说明

内存

未说明

依赖
notes该仓库是一个资源清单(Awesome List),汇集了轨迹预测领域的数据集、论文和开源代码链接,本身不是一个可独立运行的软件工具,因此 README 中未包含具体的操作系统、硬件配置或依赖库安装要求。用户需根据列表中引用的具体子项目(如 EvolveGraph 等)查阅其各自的文档以获取运行环境需求。
python未说明
Awesome-Interaction-Aware-Trajectory-Prediction hero image

快速开始

令人惊叹的交互感知行为与轨迹预测

Awesome 版本 最后更新 主题

这是一个关于轨迹预测领域最前沿研究资料(数据集、博客、论文及公开代码)的清单。希望对学术界和工业界都能有所帮助。(仍在持续更新中)

维护者: 李嘉辰(斯坦福大学);马恒博李锦宁(加州大学伯克利分校)

邮箱: jiachen_li@stanford.edu; {hengbo_ma, jinning_li}@berkeley.edu

欢迎随时提交 Pull Request 添加新资源,或发送邮件与我们交流、讨论及合作。

: 这里 也是强化学习、决策制定和运动规划相关资料的集合。

如果您觉得本仓库有用,请考虑引用我们的工作:

@inproceedings{li2020evolvegraph,
  title={EvolveGraph: 多智能体动态关系推理的轨迹预测},
  author={李嘉辰、杨帆、富冢昌义、崔致浩},
  booktitle={2020年神经信息处理系统大会 (NeurIPS)},
  year={2020}
}

@inproceedings{li2019conditional,
  title={用于概率轨迹预测的条件生成神经网络系统},
  author={李嘉辰、马恒博、富冢昌义},
  booktitle={2019年IEEE/RSJ国际智能机器人与系统会议 (IROS)},
  pages={6150--6156},
  year={2019},
  organization={IEEE}
}

目录

数据集

车辆与交通

数据集 智能体 场景 传感器
Waymo开放数据集 车辆 / 自行车骑行者 / 行人 城市 / 高速公路 LiDAR / 摄像头 / 雷达
Argoverse 车辆 / 自行车骑行者 / 行人 城市 / 高速公路 LiDAR / 摄像头 / 雷达
nuScenes 车辆 城市 摄像头 / LiDAR / 雷达
highD 车辆 高速公路 摄像头
inD 车辆 高速公路 摄像头
roundD 车辆 高速公路 摄像头
BDD100k 车辆 / 自行车骑行者 / 行人 高速公路 / 城市 摄像头
KITTI 车辆 / 自行车骑行者 / 行人 高速公路 / 农村地区 摄像头 / LiDAR
NGSIM 车辆 高速公路 摄像头
INTERACTION 车辆 / 自行车骑行者 / 行人 环岛 / 十字路口 摄像头
自行车骑行者数据集 自行车骑行者 城市 摄像头
Apolloscapes 车辆 / 自行车骑行者 / 行人 城市 摄像头
Udacity 车辆 城市 摄像头
Cityscapes 车辆 / 行人 城市 摄像头
斯坦福无人机数据集 车辆 / 自行车骑行者 / 行人 城市 摄像头
Argoverse 车辆 / 行人 城市 摄像头 / LiDAR
TRAF 车辆 / 公交车 / 自行车骑行者 / 摩托车骑行者 / 行人 / 动物 城市 摄像头
阿沙芬堡姿态数据集 自行车骑行者 / 行人 城市 摄像头

行人

数据集 代理 场景 传感器
UCY zara / 学生 摄像机
ETH (ICCV09) 城市 摄像机
VIRAT 人 / 车辆 城市 摄像机
KITTI 车辆 / 自行车手 / 人 高速公路 / 农村地区 摄像机 / LiDAR
ATC 购物中心 测距传感器
Daimler 来自行驶的车辆 摄像机
中央车站 车站内部 摄像机
市中心 城市街道 摄像机
爱丁堡 城市 摄像机
Cityscapes 车辆 / 人 城市 摄像机
Argoverse 车辆 / 人 城市 摄像机 / LiDAR
斯坦福无人机 车辆 / 自行车手 / 人 城市 摄像机
TrajNet 城市 摄像机
PIE 城市 摄像机
ForkingPaths 城市 / 模拟 摄像机
TrajNet++ 城市 摄像机
阿沙芬堡姿态数据集 自行车手 / 人 城市 摄像机
自行车手俯视数据集 (CTV) 自行车手 / 人 城市 摄像机

运动员

数据集 代理 场景 传感器
足球 足球场 摄像机
NBA SportVU 篮球馆 摄像机
NFL 美式橄榄球 摄像机

文献与代码

综述论文

  • 面向自动驾驶车辆轨迹预测的机器学习:综合综述、挑战与未来研究方向,arXiv预印本 arXiv:2307.07527,2023年。[论文]
  • 在基于深度学习的车辆轨迹预测中融入驾驶知识:综述,IEEE T-IV,2023年。[论文]
  • 行人与车辆混杂环境下的行人轨迹预测:系统性综述,IEEE T-ITS,2023年。[论文]
  • 自动驾驶轨迹预测方法综述,IEEE T-IV 2022年。[论文]
  • 基于深度学习模型的车辆轨迹预测综述,可持续专家系统国际会议,ICSES 2022年。[论文]
  • 自动驾驶车辆的情境理解与运动预测——综述与比较,IEEE T-ITS,2022年。[论文]
  • 基于车辆信息的多模态融合技术:综述,arXiv预印本 arXiv:2211.06080,2022年。[论文]
  • 自动驾驶中的深度强化学习:综述,IEEE T-ITS,2022年。[论文]
  • 自动驾驶中的社会交互:回顾与展望,arXiv预印本 arXiv:2208.07541,2022年。[论文]
  • 面向时空数据的生成对抗网络:综述,ACM T-IST,2022年。[论文]
  • 驾驶场景中的行为意图预测:综述,arXiv预印本 arXiv:2211.00385,2022年。[论文]
  • 自动驾驶中行人与车辆运动预测综述,IEEE Access,2021年。[论文]
  • 行人轨迹预测方法综述:深度学习与基于知识的方法对比,arXiv预印本 arXiv:2111.06740,2021年。[论文]
  • 轨迹数据管理、分析与学习综述,CSUR 2021年。[论文]
  • 自动驾驶中的行人行为预测:需求、指标与相关特征,IEEE T-ITS,2021年。[论文]
  • 基于深度学习的行人轨迹预测方法综述,Sensors,2021年。[论文]
  • 自动驾驶中车辆轨迹预测的深度学习方法综述,ROBIO 2021年。[论文] [代码]
  • 自动驾驶中的深度学习技术综述,野外机器人学杂志,2020年。[论文]
  • 人类运动轨迹预测:综述,国际机器人研究期刊,2020年。[论文]
  • 深度学习在自动驾驶中的应用:最新技术综述,arXiv预印本 arXiv:2006.06091,2020年。[论文]
  • 视觉交通仿真综述:模型、评估及在自动驾驶中的应用,计算机图形学论坛,2020年。[论文]
  • 基于深度学习的自动驾驶车辆行为预测:综述,IEEE T-ITS 2020年。[论文]
  • 自动驾驶车辆运动规划中的深度强化学习综述,IEEE T-ITS 2020年。[论文]
  • 车辆轨迹相似性:模型、方法与应用,ACM计算综述(CSUR 2020)。[论文]
  • 人类驾驶员行为建模与预测:综述,2020年。[论文]
  • 城市场景下行人行为预测的文献综述,ITSC 2018年。[论文]
  • 基于视觉的路径预测综述。[论文]
  • 与行人交互的自动驾驶车辆:理论与实践综述。[论文]
  • 轨迹数据挖掘:概述。[论文]
  • 智能车辆的运动预测与风险评估综述。[论文]

具有交互作用的物理系统

  • 使用次等变图神经网络学习物理动力学,NeurIPS 2022。[论文] [代码]
  • EvolveGraph:基于动态关系推理的多智能体轨迹预测,NeurIPS 2020。[论文]
  • 多机器人系统的交互模板,IROS 2019。[论文]
  • 面向多交互系统的因子化神经关系推理,ICML 2019年研讨会。[论文] [代码]
  • 物理即逆向图形:从视频中联合无监督地学习物体与物理规律,2019年。[论文]
  • 用于交互系统的神经关系推理,ICML 2018。[论文] [代码]
  • 利用感知—预测网络进行潜在物理属性的无监督学习,UAI 2018。[论文]
  • 关系归纳偏置、深度学习与图网络,2018年。[论文]
  • 关系神经期望最大化:无监督发现物体及其相互作用,ICLR 2018。[论文]
  • 图网络作为可学习的物理引擎用于推理和控制,ICML 2018。[论文]
  • 用于物理预测的灵活神经表示,2018年。[论文]
  • 一种用于关系推理的简单神经网络模块,2017年。[论文]
  • VAIN:基于注意力的多智能体预测建模,NeurIPS 2017。[论文]
  • 视觉交互网络,2017年。[论文]
  • 基于组合性对象的学习物理动力学方法,ICLR 2017。[论文]
  • 用于学习物体、关系与物理规律的交互网络,2016年。[论文] [代码]

智能车辆、交通与行人

  • Diffusion-Based Environment-Aware Trajectory Prediction, arXiv preprint arXiv:2403.11643, 2024. [paper]
  • MTP-GO: Graph-Based Probabilistic Multi-Agent Trajectory Prediction with Neural ODEs, IEEE T-IV 2023. [paper] [code]
  • MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion, CVPR 2023. [paper]
  • Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction, CVPR 2023. [paper]
  • Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction, CVPR 2023. [paper] [code]
  • Planning-oriented Autonomous Driving, CVPR 2023. [paper] [code]
  • IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction, CVPR 2023. [paper]
  • Stimulus Verification is a Universal and Effective Sampler in Multi-modal Human Trajectory Prediction, CVPR 2023. [paper]
  • Query-Centric Trajectory Prediction, CVPR 2023. [paper] [code] [QCNeXt]
  • FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-tail Trajectory Prediction, CVPR 2023. [paper]
  • Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion, CVPR 2023. [paper] [website]
  • FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs, CVPR 2023. [paper] [website]
  • Leapfrog Diffusion Model for Stochastic Trajectory Prediction, CVPR 2023. [paper] [code]
  • ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries, CVPR 2023. [paper] [website]
  • EqMotion: Equivariant Multi-Agent Motion Prediction with Invariant Interaction Reasoning, CVPR 2023. [paper] [code]
  • V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting, CVPR 2023. [paper] [code]
  • Weakly Supervised Class-agnostic Motion Prediction for Autonomous Driving, CVPR 2023. [paper]
  • Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction, CVPR 2023. [paper]
  • HumanMAC: Masked Motion Completion for Human Motion Prediction, ICCV 2023. [paper] [code]
  • BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction, ICCV 2023. [paper] [code]
  • EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting, ICCV 2023. [paper] [code]
  • ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation, ICCV 2023. [paper] [code]
  • PowerBEV: A Powerful Yet Lightweight Framework for Instance Prediction in Bird’s-Eye View, IJCAI 2023. [paper] [code]
  • Human Joint Kinematics Diffusion-Refinement for Stochastic Motion Prediction, AAAI 2023. [paper]
  • Multi-stream Representation Learning for Pedestrian Trajectory Prediction, AAAI 2023. [paper]
  • Continuous Trajectory Generation Based on Two-Stage GAN, AAAI 2023. [paper] [code]
  • A Set of Control Points Conditioned Pedestrian Trajectory Prediction, AAAI 2023. [paper] [code]
  • Leveraging Future Relationship Reasoning for Vehicle Trajectory Prediction, ICLR 2023. [paper]
  • TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios, ICRA 2023. [paper] [code]
  • GANet: Goal Area Network for Motion Forecasting, ICRA 2023. [paper] [code]
  • TOFG: A Unified and Fine-Grained Environment Representation in Autonomous Driving, ICRA 2023. [paper]
  • SSL-Lanes: Self-Supervised Learning for Motion Forecasting in Autonomous Driving, CoRL 2023. [paper] [code]
  • LimSim: A Long-term Interactive Multi-scenario Traffic Simulator, ITSC 2023. [paper] [code]
  • MVHGN: Multi-View Adaptive Hierarchical Spatial Graph Convolution Network Based Trajectory Prediction for Heterogeneous Traffic-Agents, TITS. [paper]
  • Adaptive and Simultaneous Trajectory Prediction for Heterogeneous Agents via Transferable Hierarchical Transformer Network, TITS. [paper]
  • SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction, TNNLS. [paper] [code]
  • Disentangling Crowd Interactions for Pedestrians Trajectory Prediction, RAL. [paper]
  • VNAGT: Variational Non-Autoregressive Graph Transformer Network for Multi-Agent Trajectory Prediction, IEEE Transactions on Vehicular Technology. [paper]
  • Spatial-Temporal-Spectral LSTM: A Transferable Model for Pedestrian Trajectory Prediction, TIV. [paper]
  • Holistic Transformer: A Joint Neural Network for Trajectory Prediction and Decision-Making of Autonomous Vehicles, PR. [paper]
  • Tri-HGNN: Learning triple policies fused hierarchical graph neural networks for pedestrian trajectory prediction, PR. [paper]
  • Multimodal Vehicular Trajectory Prediction With Inverse Reinforcement Learning and Risk Aversion at Urban Unsignalized Intersections, TITS. [paper]
  • Trajectory prediction for autonomous driving based on multiscale spatial‐temporal graph, IET Intelligent Transport Systems. [paper]
  • Social Self-Attention Generative Adversarial Networks for Human Trajectory Prediction, IEEE Transactions on Artificial Intelligence. [paper]
  • CSIR: Cascaded Sliding CVAEs With Iterative Socially-Aware Rethinking for Trajectory Prediction, TITS. [paper]
  • Multimodal Manoeuvre and Trajectory Prediction for Automated Driving on Highways Using Transformer Networks, RAL. [paper]
  • A physics-informed Transformer model for vehicle trajectory prediction on highways, Transportation Research Part C: Emerging Technologies. [paper] [code]
  • MacFormer: Map-Agent Coupled Transformer for Real-time and Robust Trajectory Prediction, RAL. [paper]
  • MRGTraj: A Novel Non-Autoregressive Approach for Human Trajectory Prediction, TCSVT. [paper] [code]
  • Planning-inspired Hierarchical Trajectory Prediction via Lateral-Longitudinal Decomposition for Autonomous Driving, TIV. [paper]
  • Traj-MAE: Masked Autoencoders for Trajectory Prediction, arXiv preprint arXiv:2303.06697, 2023. [paper]
  • Uncertainty-Aware Pedestrian Trajectory Prediction via Distributional Diffusion, arXiv preprint arXiv:2303.08367, 2023. [paper]
  • Diffusion Model for GPS Trajectory Generation, arXiv preprint arXiv:2304.11582, 2023. [paper]
  • Multiverse Transformer: 1st Place Solution for Waymo Open Sim Agents Challenge 2023, CVPR 2023 Workshop on Autonomous Driving. [paper] [website]
  • Joint-Multipath++ for Simulation Agents: 2nd Place Solution for Waymo Open Sim Agents Challenge 2023, CVPR 2023 Workshop on Autonomous Driving. [paper] [code]
  • MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying, 1st Place Solution for Waymo Open Motion Prediction Challenge 2023, CVPR 2023 Workshop on Autonomous Driving. [paper] [code]
  • GameFormer: Game-theoretic Modeling and Learning of Transformer-based Interactive Prediction and Planning for Autonomous Driving, arXiv preprint arXiv:2303.05760, 2023. [paper] [code] [website]
  • GameFormer Planner: A Learning-enabled Interactive Prediction and Planning Framework for Autonomous Vehicles, the nuPlan Planning Challenge at the CVPR 2023 End-to-End Autonomous Driving Workshop. [paper] [code]
  • trajdata: A Unified Interface to Multiple Human Trajectory Datasets, arXiv preprint arXiv:2307.13924, 2023. [paper] [code]
  • Remember Intentions: Retrospective-Memory-based Trajectory Prediction, CVPR 2022. [paper] [code]
  • STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes, CVPR 2022. [paper] [code]
  • Vehicle trajectory prediction works, but not everywhere, CVPR 2022. [paper] [code]
  • Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion, CVPR 2022. [paper] [code]
  • Non-Probability Sampling Network for Stochastic Human Trajectory Prediction, CVPR 2022. [paper] [code]
  • On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles, CVPR 2022. [paper] [code]
  • Adaptive Trajectory Prediction via Transferable GNN, CVPR 2022. [paper]
  • Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective, CVPR 2022. [paper] [code, code]
  • How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting, CVPR 2022. [paper]
  • Learning from All Vehicles, CVPR 2022. [paper] [code]
  • Forecasting from LiDAR via Future Object Detection, CVPR 2022. [paper] [code]
  • End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps, CVPR 2022. [paper] [code]
  • M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction, CVPR 2022. [paper] [code]
  • GroupNet: Multiscale Hypergraph Neural Networks for Trajectory Prediction with Relational Reasoning, CVPR 2022. [paper] [code]
  • Whose Track Is It Anyway? Improving Robustness to Tracking Errors with Affinity-Based Prediction, CVPR 2022. [paper]
  • ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning, CVPR 2022. [paper] [code]
  • Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction, CVPR 2022. [paper] [code]
  • MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction, CVPR 2022. [paper]
  • LTP: Lane-based Trajectory Prediction for Autonomous Driving, CVPR 2022. [paper]
  • ATPFL: Automatic Trajectory Prediction Model Design under Federated Learning Framework, CVPR 2022. [paper]
  • Human Trajectory Prediction with Momentary Observation, CVPR 2022. [paper]
  • HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction, CVPR 2022. [paper] [code]
  • Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction, ECCV 2022. [paper] [code]
  • Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation, ECCV 2022. [paper] [code] [website]
  • Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory Forecasting, ECCV 2022. [paper] [code]
  • SocialVAE: Human Trajectory Prediction using Timewise Latents, ECCV 2022. [paper] [code]
  • View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums, ECCV 2022. [paper] [code]
  • Entry-Flipped Transformer for Inference and Prediction of Participant Behavior, ECCV 2022. [paper]
  • D2-TPred: Discontinuous Dependency for Trajectory Prediction under Traffic Lights, ECCV 2022. [paper] [code]
  • Human Trajectory Prediction via Neural Social Physics, ECCV 2022. [paper] [code]
  • Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-Agent Trajectory Prediction, ECCV 2022. [paper] [code]
  • Aware of the History: Trajectory Forecasting with the Local Behavior Data, ECCV 2022. [paper] [code]
  • Action-based Contrastive Learning for Trajectory Prediction, ECCV 2022. [paper]
  • AdvDO: Realistic Adversarial Attacks for Trajectory Prediction, ECCV 2022. [paper]
  • ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning, ECCV 2022. [paper] [code]
  • Social ODE: Multi-Agent Trajectory Forecasting with Neural Ordinary Differential Equations, ECCV 2022. [paper]
  • Forecasting Human Trajectory from Scene History, NeurIPS 2022. [paper] [code]
  • Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline, NeurIPS 2022. [paper] [code]
  • Motion Transformer with Global Intention Localization and Local Movement Refinement, NeurIPS 2022. [paper] [website]
  • Interaction Modeling with Multiplex Attention, NeurIPS 2022. [paper] [code]
  • Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models, Conference on Learning for Dynamics and Control (L4DC). [paper] [website]
  • Social Interpretable Tree for Pedestrian Trajectory Prediction, AAAI 2022. [paper] [code]
  • Complementary Attention Gated Network for Pedestrian Trajectory Prediction, AAAI 2022. [paper] [code]
  • Scene Transformer: A unified architecture for predicting future trajectories of multiple agents, ICLR 2022. [paper]
  • You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction, ICLR 2022. [paper]
  • Latent Variable Sequential Set Transformers For Joint Multi-Agent Motion Prediction, ICLR 2022. [paper] [code]
  • THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling, ICLR 2022. [paper]
  • Path-Aware Graph Attention for HD Maps in Motion Prediction, ICRA 2022. [paper]
  • Trajectory Prediction with Linguistic Representations, ICRA 2022. [paper]
  • Leveraging Smooth Attention Prior for Multi-Agent Trajectory Prediction, ICRA 2022. [paper] [website]
  • KEMP: Keyframe-Based Hierarchical End-to-End Deep Model for Long-Term Trajectory Prediction, ICRA 2022. [paper]
  • Domain Generalization for Vision-based Driving Trajectory Generation, ICRA 2022. [paper] [website]
  • A Deep Concept Graph Network for Interaction-Aware Trajectory Prediction, ICRA 2022. [paper]
  • Conditioned Human Trajectory Prediction using Iterative Attention Blocks, ICRA 2022. [paper]
  • StopNet: Scalable Trajectory and Occupancy Prediction for Urban Autonomous Driving, ICRA 2022. [paper]
  • Meta-path Analysis on Spatio-Temporal Graphs for Pedestrian Trajectory Prediction, ICRA 2022. [paper] [website]
  • Propagating State Uncertainty Through Trajectory Forecasting, ICRA 2022. [paper] [code]
  • HYPER: Learned Hybrid Trajectory Prediction via Factored Inference and Adaptive Sampling, ICRA 2022. [paper]
  • Grouptron: Dynamic Multi-Scale Graph Convolutional Networks for Group-Aware Dense Crowd Trajectory Forecasting, ICRA 2022. [paper]
  • Crossmodal Transformer Based Generative Framework for Pedestrian Trajectory Prediction, ICRA 2022. [paper]
  • Trajectory Prediction for Autonomous Driving with Topometric Map, ICRA 2022. [paper] [code]
  • CRAT-Pred: Vehicle Trajectory Prediction with Crystal Graph Convolutional Neural Networks and Multi-Head Self-Attention, ICRA 2022. [paper] [code]
  • MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction, ICRA 2022. [paper]
  • Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving, ICRA 2022. [paper]
  • GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation, ICRA 2022. [paper]
  • TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation, ICRA 2022. [paper]
  • Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty, IROS 2022. [paper] [code] [trajdata]
  • Trajectory Prediction with Graph-based Dual-scale Context Fusion, IROS 2022. [paper] [code]
  • Robust Trajectory Prediction against Adversarial Attacks, CoRL 2022. [paper] [code]
  • Planning with Diffusion for Flexible Behavior Synthesis, ICML 2022. [paper] [website]
  • Synchronous Bi-Directional Pedestrian Trajectory Prediction with Error Compensation, ACCV 2022. [paper]
  • AI-TP: Attention-based Interaction-aware Trajectory Prediction for Autonomous Driving, IEEE T-IV, 2022. [paper] [code]
  • MDST-DGCN: A Multilevel Dynamic Spatiotemporal Directed Graph Convolutional Network for Pedestrian Trajectory Prediction, Computational Intelligence and Neuroscience. [paper]
  • Graph-Based Spatial-Temporal Convolutional Network for Vehicle Trajectory Prediction in Autonomous Driving, IEEE T-ITS, 2022. [paper]
  • Multi-Agent Trajectory Prediction with Heterogeneous Edge-Enhanced Graph Attention Network, IEEE T-ITS, 2022. [paper]
  • Fully Convolutional Encoder-Decoder With an Attention Mechanism for Practical Pedestrian Trajectory Prediction, IEEE T-ITS, 2022. [paper]
  • STGM: Vehicle Trajectory Prediction Based on Generative Model for Spatial-Temporal Features, IEEE T-ITS, 2022. [paper]
  • Trajectory Prediction for Autonomous Driving Using Spatial-Temporal Graph Attention Transformer, IEEE T-ITS, 2022. [paper]
  • Intention-Aware Vehicle Trajectory Prediction Based on Spatial-Temporal Dynamic Attention Network for Internet of Vehicles, IEEE T-ITS, 2022. [paper] [code]
  • Trajectory Forecasting Based on Prior-Aware Directed Graph Convolutional Neural Network, IEEE T-ITS, 2022. [paper]
  • DeepTrack: Lightweight Deep Learning for Vehicle Trajectory Prediction in Highways, IEEE T-ITS, 2022. [paper]
  • Interactive Trajectory Prediction Using a Driving Risk Map-Integrated Deep Learning Method for Surrounding Vehicles on Highways, IEEE T-ITS, 2022. [paper]
  • Vehicle Trajectory Prediction in Connected Environments via Heterogeneous Context-Aware Graph Convolutional Networks, IEEE T-ITS, 2022. [paper]
  • Trajectory Prediction Neural Network and Model Interpretation Based on Temporal Pattern Attention, IEEE T-ITS, 2022. [paper]
  • Learning Sparse Interaction Graphs of Partially Detected Pedestrians for Trajectory Prediction, IEEE RA-L, 2022. [paper] [code]
  • GAMMA: A General Agent Motion Prediction Model for Autonomous Driving, RAL. [paper] [code]
  • Stepwise Goal-Driven Networks for Trajectory Prediction, RAL. [paper] [code]
  • GA-STT: Human Trajectory Prediction with Group Aware Spatial-Temporal Transformer, RAL. [paper]
  • Long-term 4D trajectory prediction using generative adversarial networks, Transportation Research Part C: Emerging Technologies. [paper]
  • A context-aware pedestrian trajectory prediction framework for automated vehicles, Transportation Research Part C: Emerging Technologies. [paper]
  • Explainable multimodal trajectory prediction using attention models, Transportation Research Part C: Emerging Technologies. [paper]
  • CSCNet: Contextual semantic consistency network for trajectory prediction in crowded spaces, PR. [paper]
  • CSR: Cascade Conditional Variational AutoEncoder with Social-aware Regression for Pedestrian Trajectory Prediction, PR. [paper]
  • Step Attention: Sequential Pedestrian Trajectory Prediction, IEEE Sensors Journal. [paper]
  • Vehicle Trajectory Prediction Method Coupled With Ego Vehicle Motion Trend Under Dual Attention Mechanism, IEEE Transactions on Instrumentation and Measurement. [paper]
  • Spatio-temporal Interaction Aware and Trajectory Distribution Aware Graph Convolution Network for Pedestrian Multimodal Trajectory Prediction, IEEE Transactions on Instrumentation and Measurement. [paper]
  • Deep encoder–decoder-NN: A deep learning-based autonomous vehicle trajectory prediction and correction model, Physica A: Statistical Mechanics and its Applications. [paper]
  • PTPGC: Pedestrian trajectory prediction by graph attention network with ConvLSTM, Robotics and Autonomous Systems. [paper]
  • GCHGAT: pedestrian trajectory prediction using group constrained hierarchical graph attention networks, Applied Intelligence. [paper]
  • Vehicles Trajectory Prediction Using Recurrent VAE Network, IEEE Access. [paper] [code]
  • SEEM: A Sequence Entropy Energy-Based Model for Pedestrian Trajectory All-Then-One Prediction, TPAMI. [paper]
  • PTP-STGCN: Pedestrian Trajectory Prediction Based on a Spatio-temporal Graph Convolutional Neural Network, Applied Intelligence. [paper]
  • Trajectory distributions: A new description of movement for trajectory prediction, Computational Visual Media. [paper]
  • Trajectory prediction for autonomous driving based on multiscale spatial-temporal graph, IET Intelligent Transport Systems. [paper]
  • Continual learning-based trajectory prediction with memory augmented networks, Knowledge-Based Systems. [paper]
  • Atten-GAN: Pedestrian Trajectory Prediction with GAN Based on Attention Mechanism, Cognitive Computation. [paper]
  • EvoSTGAT: Evolving spatiotemporal graph attention networks for pedestrian trajectory prediction, Neurocomputing. [paper]
  • Raising context awareness in motion forecasting, CVPR Workshops 2022. [paper] [code]
  • Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction, CVPR Workshops 2022. [paper] [code]
  • Importance Is in Your Attention: Agent Importance Prediction for Autonomous Driving, CVPR Workshops 2022. [paper]
  • MPA: MultiPath++ Based Architecture for Motion Prediction, CVPR Workshops 2022. [paper] [code]
  • TPAD: Identifying Effective Trajectory Predictions Under the Guidance of Trajectory Anomaly Detection Model, arXiv:2201.02941, 2022. [paper]
  • Wayformer: Motion Forecasting via Simple & Efficient Attention Networks, arXiv preprint arXiv:2207.05844, 2022. [paper]
  • PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer, arXiv preprint arXiv:2203.09293, 2022. [paper]
  • LatentFormer: Multi-Agent Transformer-Based Interaction Modeling and Trajectory Prediction, arXiv preprint arXiv:2203.01880, 2022. [paper]
  • Diverse Multiple Trajectory Prediction Using a Two-stage Prediction Network Trained with Lane Loss, arXiv preprint arXiv:2206.08641, 2022. [paper]
  • Semi-supervised Semantics-guided Adversarial Training for Trajectory Prediction, arXiv preprint arXiv:2205.14230, 2022. [paper]
  • Heterogeneous Trajectory Forecasting via Risk and Scene Graph Learning, arXiv preprint arXiv:2211.00848, 2022. [paper]
  • GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction Model, arXiv preprint arXiv:2209.07857, 2022. [paper] [code]
  • Dynamic-Group-Aware Networks for Multi-Agent Trajectory Prediction with Relational Reasoning, arXiv preprint arXiv:2206.13114, 2022. [paper]
  • Collaborative Uncertainty Benefits Multi-Agent Multi-Modal Trajectory Forecasting, arXiv preprint arXiv:2207.05195, 2022. [paper] [code]
  • Guided Conditional Diffusion for Controllable Traffic Simulation, arXiv preprint arXiv:2210.17366, 2022. [paper] [website]
  • PhysDiff: Physics-Guided Human Motion Diffusion Model, arXiv preprint arXiv:2212.02500, 2022. [paper]
  • MPA: MultiPath++ Based Architecture for Motion Prediction, CVPR Workshop on Autonomous Driving 2022. [paper] [code]
  • Collaborative Uncertainty in Multi-Agent Trajectory Forecasting, NeurIPS 2021. [paper]
  • GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction, NeurIPS 2021. [paper] [code]
  • LibCity: An Open Library for Traffic Prediction, SIGSPATIAL 2021. [paper] [code]
  • Predicting Vehicles Trajectories in Urban Scenarios with Transformer Networks and Augmented Information, IEEE Intelligent Vehicles Symposium (IV 2021). [paper]
  • Social-STAGE: Spatio-Temporal Multi-Modal Future Trajectory Forecast, ICRA 2021. [paper]
  • AVGCN: Trajectory Prediction using Graph Convolutional Networks Guided by Human Attention, ICRA 2021. [paper]
  • Exploring Dynamic Context for Multi-path Trajectory Prediction, ICRA 2021. [paper] [code]
  • Pedestrian Trajectory Prediction using Context-Augmented Transformer Networks, ICRA 2021. [paper] [code]
  • Spectral Temporal Graph Neural Network for Trajectory Prediction, ICRA 2021. [paper]
  • Congestion-aware Multi-agent Trajectory Prediction for Collision Avoidance, ICRA 2021. [paper] [code]
  • Anticipatory Navigation in Crowds by Probabilistic Prediction of Pedestrian Future Movements, ICRA 2021. [paper]
  • AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting, ICCV 2021. [paper] [code] [website]
  • Likelihood-Based Diverse Sampling for Trajectory Forecasting, ICCV 2021. [paper] [code]
  • MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction, ICCV 2021. [paper] [code]
  • Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting, ICCV 2021. [paper]
  • Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis, ICCV 2021. [paper]
  • From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting, ICCV 2021. [paper] [code]
  • Where are you heading? Dynamic Trajectory Prediction with Expert Goal Examples, ICCV 2021. [paper] [code]
  • DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets, ICCV 2021. [paper]
  • Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving, ICCV 2021. [paper] [code]
  • LOKI: Long Term and Key Intentions for Trajectory Prediction, ICCV 2021. [paper] [dataset]
  • Human Trajectory Prediction via Counterfactual Analysis, ICCV 2021. [paper] [code]
  • Personalized Trajectory Prediction via Distribution Discrimination, ICCV 2021. [paper] [code]
  • Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction, ICCV 2021. [paper] [code]
  • Social NCE: Contrastive Learning of Socially-aware Motion Representations, ICCV 2021. [paper] [code]
  • RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting, ICCV 2021. [paper]
  • Temporal Pyramid Network for Pedestrian Trajectory Prediction with Multi-Supervision, AAAI 2021. [paper]
  • SCAN: A Spatial Context Attentive Network for Joint Multi-Agent Intent Prediction, AAAI 2021. [paper]
  • Disentangled Multi-Relational Graph Convolutional Network for Pedestrian Trajectory Prediction, AAAI 2021. [paper] [code]
  • MotionRNN: A Flexible Model for Video Prediction with Spacetime-Varying Motions, CVPR 2021. [paper]
  • Multimodal Motion Prediction with Stacked Transformers, CVPR 2021. [paper] [code] [website]
  • SGCN: Sparse Graph Convolution Network for Pedestrian Trajectory Prediction, CVPR 2021. [paper] [code]
  • LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents, CVPR 2021. [paper]
  • Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction, CVPR 2021. [paper]
  • Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers, CVPR 2021. [paper] [dataset]
  • Trajectory Prediction with Latent Belief Energy-Based Model, CVPR 2021. [paper] [code]
  • Shared Cross-Modal Trajectory Prediction for Autonomous Driving, CVPR 2021. [paper]
  • Pedestrian and Ego-vehicle Trajectory Prediction from Monocular camera, CVPR 2021. [paper] [code]
  • Interpretable Social Anchors for Human Trajectory Forecasting in Crowds, CVPR 2021. [paper]
  • Introvert: Human Trajectory Prediction via Conditional 3D Attention, CVPR 2021. [paper]
  • MP3: A Unified Model to Map, Perceive, Predict and Plan, CVPR 2021. [paper]
  • TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors, CVPR 2021. [paper]
  • Multimodal Transformer Network for Pedestrian Trajectory Prediction, IJCAI 2021. [paper] [code]
  • Decoder Fusion RNN: Context and Interaction Aware Decoders for Trajectory Prediction, IROS 2021. [paper]
  • Joint Intention and Trajectory Prediction Based on Transformer, IROS 2021. [paper]
  • Maneuver-based Trajectory Prediction for Self-driving Cars Using Spatio-temporal Convolutional Networks, IROS 2021. [paper]
  • Multiple Contextual Cues Integrated Trajectory Prediction for Autonomous Driving, IROS 2021. [paper]
  • MultiXNet: Multiclass Multistage Multimodal Motion Prediction, IEEE Intelligent Vehicles Symposium (IV 2021). [paper]
  • Trajectory Prediction for Autonomous Driving based on Multi-Head Attention with Joint Agent-Map Representation, IEEE Intelligent Vehicles Symposium (IV 2021). [paper]
  • Social-IWSTCNN: A Social Interaction-Weighted Spatio-Temporal Convolutional Neural Network for Pedestrian Trajectory Prediction in Urban Traffic Scenarios, IV 2021. [paper]
  • Generating Scenarios with Diverse Pedestrian Behaviors for Autonomous Vehicle Testing, Conference on Robot Learning (CoRL 2021). [paper] [code]
  • Multimodal Trajectory Prediction Conditioned on Lane-Graph Traversals, CoRL 2021. [paper] [code]
  • Learning to Predict Vehicle Trajectories with Model-based Planning, CoRL 2021. [paper]
  • Pose Based Trajectory Forecast of Vulnerable Road Users Using Recurrent Neural Networks, International Conference on Pattern Recognition (ICPR 2021). [paper]
  • GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory Prediction, WACV 2021. [paper]
  • Goal-driven Long-Term Trajectory Prediction, WACV 2021. [paper]
  • Multimodal Trajectory Predictions for Autonomous Driving without a Detailed Prior Map, WACV 2021. [paper]
  • Self-Growing Spatial Graph Network for Context-Aware Pedestrian Trajectory Prediction, IEEE International Conference on Image Processing (ICIP 2021). [paper] [code]
  • S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving, Asian Conference on Machine Learning 2021. [paper] [code]
  • Learning Structured Representations of Spatial and Interactive Dynamics for Trajectory Prediction in Crowded Scenes, IEEE Robotics and Automation Letters 2021 [paper], [code]
  • Trajectory Prediction using Equivariant Continuous Convolution, ICLR 2021. [paper] [code]
  • TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation, International Conference on Intelligent Autonomous Systems 2021. [paper]
  • HOME: Heatmap Output for future Motion Estimation, ITSC 2021. [paper]
  • Graph and Recurrent Neural Network-based Vehicle Trajectory Prediction For Highway Driving, ITSC 2021. [paper]
  • SCSG Attention: A Self-Centered Star Graph with Attention for Pedestrian Trajectory Prediction, International Conference on Database Systems for Advanced Applications (DASFAA 2021). [paper]
  • Leveraging Trajectory Prediction for Pedestrian Video Anomaly Detection, IEEE Symposium Series on Computational Intelligence (SSCI 2021). [paper] [code]
  • Are socially-aware trajectory prediction models really socially-aware?, Transportation Research: Part C. [paper, paper] [code]
  • Injecting knowledge in data-driven vehicle trajectory predictors, Transportation Research: Part C. [paper] [code]
  • Decoding pedestrian and automated vehicle interactions using immersive virtual reality and interpretable deep learning, Transportation Research: Part C. [paper]
  • Human Trajectory Forecasting in Crowds: A Deep Learning Perspective, IEEE Transactions on Intelligent Transportation Systems. [paper] [code]
  • NetTraj: A Network-Based Vehicle Trajectory Prediction Model With Directional Representation and Spatiotemporal Attention Mechanisms, TITS. [paper]
  • Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking, TITS. [paper]
  • A Hierarchical Framework for Interactive Behaviour Prediction of Heterogeneous Traffic Participants Based on Graph Neural Network, TITS. [paper]
  • TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning, Transportation Research Part C. [paper] [code]
  • Vehicle Trajectory Prediction Using Generative Adversarial Network With Temporal Logic Syntax Tree Features, IEEE ROBOTICS AND AUTOMATION LETTERS. [paper]
  • Vehicle Trajectory Prediction Using LSTMs with Spatial-Temporal Attention Mechanisms, IEEE Intelligent Transportation Systems Magazine. [paper] [code]
  • Long Short-Term Memory-Based Human-Driven Vehicle Longitudinal Trajectory Prediction in a Connected and Autonomous Vehicle Environment, Transportation Research Record. [paper]
  • Temporal Pyramid Network with Spatial-Temporal Attention for Pedestrian Trajectory Prediction, IEEE Transactions on Network Science and Engineering. [paper]
  • An efficient Spatial–Temporal model based on gated linear units for trajectory prediction, Neurocomputing. [paper]
  • SRAI-LSTM: A Social Relation Attention-based Interaction-aware LSTM for human trajectory prediction, Neurocomputing. [paper]
  • AST-GNN: An attention-based spatio-temporal graph neural network for Interaction-aware pedestrian trajectory prediction, Neurocomputing. [paper]
  • Multi-PPTP: Multiple Probabilistic Pedestrian Trajectory Prediction in the Complex Junction Scene, IEEE Transactions on Intelligent Transportation Systems. [paper]
  • A Novel Graph-Based Trajectory Predictor With Pseudo-Oracle, TNNLS. [paper]
  • Large Scale GPS Trajectory Generation Using Map Based on Two Stage GAN, Journal of Data Science. [paper] [code]
  • Pose and Semantic Map Based Probabilistic Forecast of Vulnerable Road Users’ Trajectories, IEEE Transactions on Intelligent Vehicles. [paper]
  • STI-GAN: Multimodal Pedestrian Trajectory Prediction Using Spatiotemporal Interactions and a Generative Adversarial Network, IEEE Access. [paper]
  • Holistic LSTM for Pedestrian Trajectory Prediction, TIP. [paper]
  • Pedestrian trajectory prediction with convolutional neural networks, PR. [paper]
  • LSTM based trajectory prediction model for cyclist utilizing multiple interactions with environment, PR. [paper]
  • Human trajectory prediction and generation using LSTM models and GANs, PR. [paper]
  • Vehicle trajectory prediction and generation using LSTM models and GANs, Plos one. [paper]
  • BiTraP: Bi-Directional Pedestrian Trajectory Prediction With Multi-Modal Goal Estimation, RAL. [paper] [code]
  • A Kinematic Model for Trajectory Prediction in General Highway Scenarios, RAL. [paper] [code]
  • Trajectory Prediction in Autonomous Driving With a Lane Heading Auxiliary Loss, RAL. [paper]
  • Vehicle Trajectory Prediction Using Generative Adversarial Network With Temporal Logic Syntax Tree Features, RAL. [paper]
  • Tra2Tra: Trajectory-to-Trajectory Prediction With a Global Social Spatial-Temporal Attentive Neural Network, RAL. [paper]
  • Social graph convolutional LSTM for pedestrian trajectory prediction, IET Intelligent Transport Systems. [paper]
  • HSTA: A Hierarchical Spatio-Temporal Attention Model for Trajectory Prediction, IEEE Transactions on Vehicular Technology (TVT). [paper]
  • Environment-Attention Network for Vehicle Trajectory Prediction, TVT. [paper]
  • Where Are They Going? Predicting Human Behaviors in Crowded Scenes, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM). [paper]
  • Multi-Agent Trajectory Prediction with Spatio-Temporal Sequence Fusion, IEEE Transactions on Multimedia (TMM). [paper]
  • EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning, NeurIPS 2020. [paper]
  • V2VNet- Vehicle-to-Vehicle Communication for Joint Perception and Prediction, ECCV 2020. [paper]
  • SMART- Simultaneous Multi-Agent Recurrent Trajectory Prediction, ECCV 2020. [paper]
  • SimAug- Learning Robust Representations from Simulation for Trajectory Prediction, ECCV 2020. [paper]
  • Learning Lane Graph Representations for Motion Forecasting, ECCV 2020. [paper]
  • Implicit Latent Variable Model for Scene-Consistent Motion Forecasting, ECCV 2020. [paper]
  • Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding, ECCV 2020. [paper]
  • Semantic Synthesis of Pedestrian Locomotion, ACCV 2020. [Paper]
  • Kernel Trajectory Maps for Multi-Modal Probabilistic Motion Prediction, CoRL 2019. [paper] [code]
  • Social-WaGDAT: Interaction-aware Trajectory Prediction via Wasserstein Graph Double-Attention Network, 2020. [paper]
  • Social NCE: Contrastive Learning of Socially-aware Motion Representations. [paper], [code]
  • Pose Based Trajectory Forecast of Vulnerable Road Users Using Recurrent Neural Networks, ICPR International Workshops and Challenges 2020. [paper]
  • EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning, NeurIPS 2020. [paper]
  • Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction, ECCV 2020. [paper]
  • It is not the Journey but the Destination- Endpoint Conditioned Trajectory Prediction, ECCV 2020. [paper]
  • How Can I See My Future? FvTraj: Using First-person View for Pedestrian Trajectory Prediction, ECCV 2020. [paper]
  • Dynamic and Static Context-aware LSTM for Multi-agent Motion Prediction, ECCV 2020. [paper]
  • Human Trajectory Forecasting in Crowds: A Deep Learning Perspective, 2020. [paper], [code]
  • SimAug: Learning Robust Representations from 3D Simulation for Pedestrian Trajectory Prediction in Unseen cameras, ECCV 2020. [paper], [code]
  • DAG-Net: Double Attentive Graph Neural Network for Trajectory Forecasting, ICPR 2020. [paper] [code]
  • Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision, WACV 2020. [paper]
  • Social-WaGDAT: Interaction-aware Trajectory Prediction via Wasserstein Graph Double-Attention Network, 2020. [paper]
  • Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction, CVPR 2020. [Paper], [Code]
  • The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction, CVPR 2020. [paper], [code/dataset]
  • Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision, WACV 2020. [paper]
  • Pose Based Trajectory Forecast of Vulnerable Road Users, SSCI 2019. [paper]
  • The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs, ICCV 2019. [paper] [code]
  • STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction, ICCV 2019. [paper] [code]
  • Instance-Level Future Motion Estimation in a Single Image Based on Ordinal Regression, ICCV 2019. [paper]
  • Social and Scene-Aware Trajectory Prediction in Crowded Spaces, ICCV workshop 2019. [paper] [code]
  • Stochastic Sampling Simulation for Pedestrian Trajectory Prediction, IROS 2019. [paper]
  • Long-Term Prediction of Motion Trajectories Using Path Homology Clusters, IROS 2019. [paper]
  • StarNet: Pedestrian Trajectory Prediction Using Deep Neural Network in Star Topology, IROS 2019. [paper]
  • Learning Generative Socially-Aware Models of Pedestrian Motion, IROS 2019. [paper]
  • Situation-Aware Pedestrian Trajectory Prediction with Spatio-Temporal Attention Model, CVWW 2019. [paper]
  • Path predictions using object attributes and semantic environment, VISIGRAPP 2019. [paper]
  • Probabilistic Path Planning using Obstacle Trajectory Prediction, CoDS-COMAD 2019. [paper]
  • Human Trajectory Prediction using Adversarial Loss, hEART 2019. [paper], [code]
  • Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs, CVPR 2019. [Precognition Workshop], [paper], [code]
  • Peeking into the Future: Predicting Future Person Activities and Locations in Videos, CVPR 2019. [paper], [code]
  • Learning to Infer Relations for Future Trajectory Forecast, CVPR 2019. [paper]
  • TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions, CVPR 2019. [paper]
  • Which Way Are You Going? Imitative Decision Learning for Path Forecasting in Dynamic Scenes, CVPR 2019. [paper]
  • Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction, CVPR 2019. [paper][code]
  • Sophie: An attentive gan for predicting paths compliant to social and physical constraints, CVPR 2019. [paper][code]
  • Pedestrian path, pose, and intention prediction through gaussian process dynamical models and pedestrian activity recognition, 2019. [paper]
  • Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing, 2019. [paper]
  • The simpler the better: Constant velocity for pedestrian motion prediction, 2019. [paper]
  • Pedestrian trajectory prediction in extremely crowded scenarios, 2019. [paper]
  • Srlstm: State refinement for lstm towards pedestrian trajectory prediction, 2019. [paper]
  • Location-velocity attention for pedestrian trajectory prediction, WACV 2019. [paper]
  • Pedestrian Trajectory Prediction in Extremely Crowded Scenarios, Sensors, 2019. [paper]
  • Forecasting Trajectory and Behavior of Road-Agents Using Spectral Clustering in Graph-LSTMs, 2019. [paper] [code]
  • Joint Prediction for Kinematic Trajectories in Vehicle-Pedestrian-Mixed Scenes, ICCV 2019. [paper]
  • Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction, ICCV 2019. [paper]
  • Looking to Relations for Future Trajectory Forecast, ICCV 2019. [paper]
  • Jointly Learnable Behavior and Trajectory Planning for Self-Driving Vehicles, IROS 2019. [paper]
  • Sharing Is Caring: Socially-Compliant Autonomous Intersection Negotiation, IROS 2019. [paper]
  • INFER: INtermediate Representations for FuturE PRediction, IROS 2019. [paper] [code]
  • Deep Predictive Autonomous Driving Using Multi-Agent Joint Trajectory Prediction and Traffic Rules, IROS 2019. [paper]
  • NeuroTrajectory: A Neuroevolutionary Approach to Local State Trajectory Learning for Autonomous Vehicles, IROS 2019. [paper]
  • Urban Street Trajectory Prediction with Multi-Class LSTM Networks, IROS 2019. [N/A]
  • Spatiotemporal Learning of Directional Uncertainty in Urban Environments with Kernel Recurrent Mixture Density Networks, IROS 2019. [paper]
  • Conditional generative neural system for probabilistic trajectory prediction, IROS 2019. [paper]
  • Interaction-aware multi-agent tracking and probabilistic behavior prediction via adversarial learning, ICRA 2019. [paper]
  • Generic Tracking and Probabilistic Prediction Framework and Its Application in Autonomous Driving, IEEE Trans. Intell. Transport. Systems, 2019. [paper]
  • Coordination and trajectory prediction for vehicle interactions via bayesian generative modeling, IV 2019. [paper]
  • Wasserstein generative learning with kinematic constraints for probabilistic interactive driving behavior prediction, IV 2019. [paper]
  • GRIP: Graph-based Interaction-aware Trajectory Prediction, ITSC 2019. [paper]
  • AGen: Adaptable Generative Prediction Networks for Autonomous Driving, IV 2019. [paper]
  • TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions, CVPR 2019. [paper], [code]
  • Multi-Step Prediction of Occupancy Grid Maps with Recurrent Neural Networks, CVPR 2019. [paper]
  • Argoverse: 3D Tracking and Forecasting With Rich Maps, CVPR 2019 [paper]
  • Robust Aleatoric Modeling for Future Vehicle Localization, CVPR 2019. [paper]
  • Pedestrian occupancy prediction for autonomous vehicles, IRC 2019. [paper]
  • Context-based path prediction for targets with switching dynamics, 2019.[paper]
  • Deep Imitative Models for Flexible Inference, Planning, and Control, 2019. [paper]
  • Infer: Intermediate representations for future prediction, 2019. [paper][code]
  • Multi-agent tensor fusion for contextual trajectory prediction, 2019. [paper]
  • Context-Aware Pedestrian Motion Prediction In Urban Intersections, 2018. [paper]
  • Generic probabilistic interactive situation recognition and prediction: From virtual to real, ITSC 2018. [paper]
  • Generic vehicle tracking framework capable of handling occlusions based on modified mixture particle filter, IV 2018. [paper]
  • Multi-Modal Trajectory Prediction of Surrounding Vehicles with Maneuver based LSTMs, 2018. [paper]
  • Sequence-to-sequence prediction of vehicle trajectory via lstm encoder-decoder architecture, 2018. [paper]
  • R2P2: A ReparameteRized Pushforward Policy for diverse, precise generative path forecasting, ECCV 2018. [paper]
  • Predicting trajectories of vehicles using large-scale motion priors, IV 2018. [paper]
  • Vehicle trajectory prediction by integrating physics-and maneuver based approaches using interactive multiple models, 2018. [paper]
  • Motion Prediction of Traffic Actors for Autonomous Driving using Deep Convolutional Networks, 2018. [paper]
  • Generative multi-agent behavioral cloning, 2018. [paper]
  • Deep Sequence Learning with Auxiliary Information for Traffic Prediction, KDD 2018. [paper], [code]
  • A data-driven model for interaction-aware pedestrian motion prediction in object cluttered environments, ICRA 2018. [paper]
  • Move, Attend and Predict: An attention-based neural model for people’s movement prediction, Pattern Recognition Letters 2018. [paper]
  • GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds, ACCV 2018, [paper], [demo]
  • Ss-lstm: a hierarchical lstm model for pedestrian trajectory prediction, WACV 2018. [paper]
  • Social Attention: Modeling Attention in Human Crowds, ICRA 2018. [paper][code]
  • Pedestrian prediction by planning using deep neural networks, ICRA 2018. [paper]
  • Joint long-term prediction of human motion using a planning-based social force approach, ICRA 2018. [paper]
  • Human motion prediction under social grouping constraints, IROS 2018. [paper]
  • Future Person Localization in First-Person Videos, CVPR 2018. [paper]
  • Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks, CVPR 2018. [paper][code]
  • Group LSTM: Group Trajectory Prediction in Crowded Scenarios, ECCV 2018. [paper]
  • Mx-lstm: mixing tracklets and vislets to jointly forecast trajectories and head poses, CVPR 2018. [paper]
  • Intent prediction of pedestrians via motion trajectories using stacked recurrent neural networks, 2018. [paper]
  • Transferable pedestrian motion prediction models at intersections, 2018. [paper]
  • Probabilistic map-based pedestrian motion prediction taking traffic participants into consideration, 2018. [paper]
  • A Computationally Efficient Model for Pedestrian Motion Prediction, ECC 2018. [paper]
  • Context-aware trajectory prediction, ICPR 2018. [paper]
  • Set-based prediction of pedestrians in urban environments considering formalized traffic rules, ITSC 2018. [paper]
  • Building prior knowledge: A markov based pedestrian prediction model using urban environmental data, ICARCV 2018. [paper]
  • Depth Information Guided Crowd Counting for Complex Crowd Scenes, 2018. [paper]
  • Tracking by Prediction: A Deep Generative Model for Mutli-Person Localisation and Tracking, WACV 2018. [paper]
  • “Seeing is Believing”: Pedestrian Trajectory Forecasting Using Visual Frustum of Attention, WACV 2018. [paper]
  • Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty, CVPR 2018. [paper], [code+data]
  • Encoding Crowd Interaction with Deep Neural Network for Pedestrian Trajectory Prediction, CVPR 2018. [paper], [code]
  • Multipolicy decision-making for autonomous driving via changepoint-based behavior prediction, 2017. [paper]
  • Probabilistic long-term prediction for autonomous vehicles, IV 2017. [paper]
  • Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network, ITSC 2017. [paper]
  • Desire: Distant future prediction in dynamic scenes with interacting agents, CVPR 2017. [paper][code]
  • Imitating driver behavior with generative adversarial networks, 2017. [paper][code]
  • Infogail: Interpretable imitation learning from visual demonstrations, 2017. [paper][code]
  • Long-term planning by short-term prediction, 2017. [paper]
  • Long-term path prediction in urban scenarios using circular distributions, 2017. [paper]
  • Deep learning driven visual path prediction from a single image, 2016. [paper]
  • Walking Ahead: The Headed Social Force Model, 2017. [paper]
  • Real-time certified probabilistic pedestrian forecasting, 2017. [paper]
  • A multiple-predictor approach to human motion prediction, ICRA 2017. [paper]
  • Forecasting interactive dynamics of pedestrians with fictitious play, CVPR 2017. [paper]
  • Forecast the plausible paths in crowd scenes, IJCAI 2017. [paper]
  • Bi-prediction: pedestrian trajectory prediction based on bidirectional lstm classification, DICTA 2017. [paper]
  • Aggressive, Tense or Shy? Identifying Personality Traits from Crowd Videos, IJCAI 2017. [paper]
  • Natural vision based method for predicting pedestrian behaviour in urban environments, ITSC 2017. [paper]
  • Human Trajectory Prediction using Spatially aware Deep Attention Models, 2017. [paper]
  • Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection, 2017. [paper]
  • Forecasting Interactive Dynamics of Pedestrians with Fictitious Play, CVPR 2017. [paper]
  • Social LSTM: Human trajectory prediction in crowded spaces, CVPR 2016. [paper][code]
  • Comparison and evaluation of pedestrian motion models for vehicle safety systems, ITSC 2016. [paper]
  • Age and Group-driven Pedestrian Behaviour: from Observations to Simulations, 2016. [paper]
  • Structural-RNN: Deep learning on spatio-temporal graphs, CVPR 2016. [paper][code]
  • Intent-aware long-term prediction of pedestrian motion, ICRA 2016. [paper]
  • Context-based detection of pedestrian crossing intention for autonomous driving in urban environments, IROS 2016. [paper]
  • Novel planning-based algorithms for human motion prediction, ICRA 2016. [paper]
  • Learning social etiquette: Human trajectory understanding in crowded scenes, ECCV 2016. [paper][code]
  • GLMP-realtime pedestrian path prediction using global and local movement patterns, ICRA 2016. [paper]
  • Knowledge transfer for scene-specific motion prediction, ECCV 2016. [paper]
  • STF-RNN: Space Time Features-based Recurrent Neural Network for predicting People Next Location, SSCI 2016. [code]
  • Goal-directed pedestrian prediction, ICCV 2015. [paper]
  • Trajectory analysis and prediction for improved pedestrian safety: Integrated framework and evaluations, 2015. [paper]
  • Predicting and recognizing human interactions in public spaces, 2015. [paper]
  • Learning collective crowd behaviors with dynamic pedestrian-agents, 2015. [paper]
  • Modeling spatial-temporal dynamics of human movements for predicting future trajectories, AAAI 2015. [paper]
  • Unsupervised robot learning to predict person motion, ICRA 2015. [paper]
  • A controlled interactive multiple model filter for combined pedestrian intention recognition and path prediction, ITSC 2015. [paper]
  • Real-Time Predictive Modeling and Robust Avoidance of Pedestrians with Uncertain, Changing Intentions, 2014. [paper]
  • Behavior estimation for a complete framework for human motion prediction in crowded environments, ICRA 2014. [paper]
  • Pedestrian’s trajectory forecast in public traffic with artificial neural network, ICPR 2014. [paper]
  • Will the pedestrian cross? A study on pedestrian path prediction, 2014. [paper]
  • BRVO: Predicting pedestrian trajectories using velocity-space reasoning, 2014. [paper]
  • Context-based pedestrian path prediction, ECCV 2014. [paper]
  • Pedestrian path prediction using body language traits, 2014. [paper]
  • Online maneuver recognition and multimodal trajectory prediction for intersection assistance using non-parametric regression, 2014. [paper]
  • Learning intentions for improved human motion prediction, 2013. [paper]
  • Understanding interactions between traffic participants based on learned behaviors, 2016. [paper]
  • Visual path prediction in complex scenes with crowded moving objects, CVPR 2016. [paper]
  • A game-theoretic approach to replanning-aware interactive scene prediction and planning, 2016. [paper]
  • Intention-aware online pomdp planning for autonomous driving in a crowd, ICRA 2015. [paper]
  • Online maneuver recognition and multimodal trajectory prediction for intersection assistance using non-parametric regression, 2014. [paper]
  • Patch to the future: Unsupervised visual prediction, CVPR 2014. [paper]
  • Mobile agent trajectory prediction using bayesian nonparametric reachability trees, 2011. [paper]

移动机器人

  • 基于行人未来运动概率预测的拥挤人群预见性导航,ICRA 2021。[论文]
  • Social NCE:社会感知运动表示的对比学习。[论文],[代码]
  • 面向人机交互的多模态概率模型规划,ICRA 2018。[论文] [代码]
  • 基于深度强化学习的去中心化无通信多智能体避障,ICRA 2017。[论文]
  • 用于运动预测的增强字典学习,ICRA 2016。[论文]
  • 针对动态环境的未来智能体运动预测,ICMLA 2016。[论文]
  • 基于贝叶斯意图推理的未知目标轨迹预测,IROS 2015。[论文]
  • 学习预测协同导航智能体的轨迹,ICRA 2014。[论文]

体育运动员

  • EvolveGraph:基于动态关系推理的多智能体轨迹预测,NeurIPS 2020。[论文]
  • 用于轨迹预测与插补的模仿式非自回归建模,CVPR 2020。[论文]
  • DAG-Net:用于轨迹预测的双重注意力图神经网络,ICPR 2020。[论文] [代码]
  • 多智能体体育比赛中的多样化生成,CVPR 2019。[论文]
  • 基于部分观测的多智能体交互随机预测,ICLR 2019。[论文]
  • 利用程序化弱监督生成多智能体轨迹,ICLR 2019。[论文]
  • 生成式多智能体行为克隆,ICML 2018。[论文]
  • 他们将去往何处?利用条件变分自编码器预测精细粒度的对抗性多智能体运动,ECCV 2018。[论文]
  • 协调式多智能体模仿学习,ICML 2017。[论文]
  • 使用深度层次网络生成长期轨迹,2017年。[论文]
  • 学习用于动态体育比赛预测的精细空间模型,ICDM 2014。[论文]
  • 多模态多人行为的生成式建模,2018年。[论文]
  • 接下来会发生什么?体育视频中球员动作的预测,ICCV 2017,[论文]

基准与评估指标

  • 无人机数据集轨迹预测研究的预处理与评估工具箱,arXiv预印本arXiv:2405.00604,2024年。[论文] [代码]
  • Social-Implicit:重新思考轨迹预测评估及隐式最大似然估计的有效性,ECCV 2022。[论文] [代码]
  • OpenTraj:评估人类轨迹数据集中预测的复杂性,ACCV 2020。[论文] [代码]
  • 通过模拟感知与预测测试自动驾驶车辆的安全性,ECCV 2020。[论文]
  • PIE:用于行人意图估计和轨迹预测的大规模数据集及模型,ICCV 2019。[论文]
  • 面向高度交互驾驶场景的概率性反应预测的致死率敏感型基准,ITSC 2018。[论文]
  • 我的预测有多好?寻找轨迹预测评估的相似性度量,ITSC 2017。[论文]
  • Trajnet:迈向人类轨迹预测的基准。[网站]

其他

  • 基于姿态的骑行者出发意图检测,ITSC 2019。[论文]
  • 使用双向循环神经网络进行骑行者轨迹预测,AI 2018。[论文]
  • 用于轨迹预测的道路基础设施指标,2018年。[论文]
  • 利用道路拓扑改善骑行者路径预测,2017年。[论文]
  • 基于物理模型和人工神经网络的骑行者轨迹预测,2016年。[论文]

相似工具推荐

openclaw

OpenClaw 是一款专为个人打造的本地化 AI 助手,旨在让你在自己的设备上拥有完全可控的智能伙伴。它打破了传统 AI 助手局限于特定网页或应用的束缚,能够直接接入你日常使用的各类通讯渠道,包括微信、WhatsApp、Telegram、Discord、iMessage 等数十种平台。无论你在哪个聊天软件中发送消息,OpenClaw 都能即时响应,甚至支持在 macOS、iOS 和 Android 设备上进行语音交互,并提供实时的画布渲染功能供你操控。 这款工具主要解决了用户对数据隐私、响应速度以及“始终在线”体验的需求。通过将 AI 部署在本地,用户无需依赖云端服务即可享受快速、私密的智能辅助,真正实现了“你的数据,你做主”。其独特的技术亮点在于强大的网关架构,将控制平面与核心助手分离,确保跨平台通信的流畅性与扩展性。 OpenClaw 非常适合希望构建个性化工作流的技术爱好者、开发者,以及注重隐私保护且不愿被单一生态绑定的普通用户。只要具备基础的终端操作能力(支持 macOS、Linux 及 Windows WSL2),即可通过简单的命令行引导完成部署。如果你渴望拥有一个懂你

349.3k|★★★☆☆|今天
Agent开发框架图像

stable-diffusion-webui

stable-diffusion-webui 是一个基于 Gradio 构建的网页版操作界面,旨在让用户能够轻松地在本地运行和使用强大的 Stable Diffusion 图像生成模型。它解决了原始模型依赖命令行、操作门槛高且功能分散的痛点,将复杂的 AI 绘图流程整合进一个直观易用的图形化平台。 无论是希望快速上手的普通创作者、需要精细控制画面细节的设计师,还是想要深入探索模型潜力的开发者与研究人员,都能从中获益。其核心亮点在于极高的功能丰富度:不仅支持文生图、图生图、局部重绘(Inpainting)和外绘(Outpainting)等基础模式,还独创了注意力机制调整、提示词矩阵、负向提示词以及“高清修复”等高级功能。此外,它内置了 GFPGAN 和 CodeFormer 等人脸修复工具,支持多种神经网络放大算法,并允许用户通过插件系统无限扩展能力。即使是显存有限的设备,stable-diffusion-webui 也提供了相应的优化选项,让高质量的 AI 艺术创作变得触手可及。

162.1k|★★★☆☆|昨天
开发框架图像Agent

everything-claude-code

everything-claude-code 是一套专为 AI 编程助手(如 Claude Code、Codex、Cursor 等)打造的高性能优化系统。它不仅仅是一组配置文件,而是一个经过长期实战打磨的完整框架,旨在解决 AI 代理在实际开发中面临的效率低下、记忆丢失、安全隐患及缺乏持续学习能力等核心痛点。 通过引入技能模块化、直觉增强、记忆持久化机制以及内置的安全扫描功能,everything-claude-code 能显著提升 AI 在复杂任务中的表现,帮助开发者构建更稳定、更智能的生产级 AI 代理。其独特的“研究优先”开发理念和针对 Token 消耗的优化策略,使得模型响应更快、成本更低,同时有效防御潜在的攻击向量。 这套工具特别适合软件开发者、AI 研究人员以及希望深度定制 AI 工作流的技术团队使用。无论您是在构建大型代码库,还是需要 AI 协助进行安全审计与自动化测试,everything-claude-code 都能提供强大的底层支持。作为一个曾荣获 Anthropic 黑客大奖的开源项目,它融合了多语言支持与丰富的实战钩子(hooks),让 AI 真正成长为懂上

140.4k|★★☆☆☆|今天
开发框架Agent语言模型

ComfyUI

ComfyUI 是一款功能强大且高度模块化的视觉 AI 引擎,专为设计和执行复杂的 Stable Diffusion 图像生成流程而打造。它摒弃了传统的代码编写模式,采用直观的节点式流程图界面,让用户通过连接不同的功能模块即可构建个性化的生成管线。 这一设计巧妙解决了高级 AI 绘图工作流配置复杂、灵活性不足的痛点。用户无需具备编程背景,也能自由组合模型、调整参数并实时预览效果,轻松实现从基础文生图到多步骤高清修复等各类复杂任务。ComfyUI 拥有极佳的兼容性,不仅支持 Windows、macOS 和 Linux 全平台,还广泛适配 NVIDIA、AMD、Intel 及苹果 Silicon 等多种硬件架构,并率先支持 SDXL、Flux、SD3 等前沿模型。 无论是希望深入探索算法潜力的研究人员和开发者,还是追求极致创作自由度的设计师与资深 AI 绘画爱好者,ComfyUI 都能提供强大的支持。其独特的模块化架构允许社区不断扩展新功能,使其成为当前最灵活、生态最丰富的开源扩散模型工具之一,帮助用户将创意高效转化为现实。

107.7k|★★☆☆☆|3天前
开发框架图像Agent

LLMs-from-scratch

LLMs-from-scratch 是一个基于 PyTorch 的开源教育项目,旨在引导用户从零开始一步步构建一个类似 ChatGPT 的大型语言模型(LLM)。它不仅是同名技术著作的官方代码库,更提供了一套完整的实践方案,涵盖模型开发、预训练及微调的全过程。 该项目主要解决了大模型领域“黑盒化”的学习痛点。许多开发者虽能调用现成模型,却难以深入理解其内部架构与训练机制。通过亲手编写每一行核心代码,用户能够透彻掌握 Transformer 架构、注意力机制等关键原理,从而真正理解大模型是如何“思考”的。此外,项目还包含了加载大型预训练权重进行微调的代码,帮助用户将理论知识延伸至实际应用。 LLMs-from-scratch 特别适合希望深入底层原理的 AI 开发者、研究人员以及计算机专业的学生。对于不满足于仅使用 API,而是渴望探究模型构建细节的技术人员而言,这是极佳的学习资源。其独特的技术亮点在于“循序渐进”的教学设计:将复杂的系统工程拆解为清晰的步骤,配合详细的图表与示例,让构建一个虽小但功能完备的大模型变得触手可及。无论你是想夯实理论基础,还是为未来研发更大规模的模型做准备

90.1k|★★★☆☆|今天
语言模型图像Agent

Deep-Live-Cam

Deep-Live-Cam 是一款专注于实时换脸与视频生成的开源工具,用户仅需一张静态照片,即可通过“一键操作”实现摄像头画面的即时变脸或制作深度伪造视频。它有效解决了传统换脸技术流程繁琐、对硬件配置要求极高以及难以实时预览的痛点,让高质量的数字内容创作变得触手可及。 这款工具不仅适合开发者和技术研究人员探索算法边界,更因其极简的操作逻辑(仅需三步:选脸、选摄像头、启动),广泛适用于普通用户、内容创作者、设计师及直播主播。无论是为了动画角色定制、服装展示模特替换,还是制作趣味短视频和直播互动,Deep-Live-Cam 都能提供流畅的支持。 其核心技术亮点在于强大的实时处理能力,支持口型遮罩(Mouth Mask)以保留使用者原始的嘴部动作,确保表情自然精准;同时具备“人脸映射”功能,可同时对画面中的多个主体应用不同面孔。此外,项目内置了严格的内容安全过滤机制,自动拦截涉及裸露、暴力等不当素材,并倡导用户在获得授权及明确标注的前提下合规使用,体现了技术发展与伦理责任的平衡。

88.9k|★★★☆☆|今天
开发框架图像Agent