Earn

Real human data for

robotics

Vision · Depth · Tactile · Motion · Wrist POV

Our Team Background

Robots need more than pixels. Pixels show what the world looks like — Tridi captures how it moves, twists, grips, and resists.

Modalities

Five senses, all synchronized.

Vision, stereo depth, tactile force, body motion, and wrist POV — all captured together, aligned frame-by-frame. Unlike other providers, we capture more than just video.

time
0s
1s
2s
3s
4s
5s
6s
7s
8s
9s
Vision
Depth
Tactile
L
R
L
R
L
R
L
R
L
R
Motion
Wrist POV
Environments

Real environments, not studio stages.

Homes, offices, labs, factories, fields — every session is captured where your robot will actually work. Studio data doesn't transfer; our operators show up on location.

Homes

Homes

Offices

Offices

Restaurants

Restaurants

Retail

Retail

Industrial

Industrial

Logistics

Logistics

Construction

Construction

Agriculture

Agriculture

Diversity

Built to generalize.

In addition to diverse environment coverage, Tridi stretches every other axis that drives real-world generalization — objects, tasks, and operators.

Object diversity

Thousands of unique instances across materials (rigid, deformable, clear), sizes, shapes, and weights.

Task variety

Pick and place, stacking, insertion, tool use, navigation, and multi-step sequential tasks.

Operator diversity

Multiple demonstrators per task with varied skill levels, styles, handedness, and demonstration speeds.

Outputs

Structured 3D datasets, beyond raw footage.

We offer training-ready derived signals — not just raw data you still have to process. Depth, body pose, tactile forces, object segmentation, and action labels — all synchronized.

Capture multimodal data

RGB · Depth · IMU · Tactile

Capture multimodal data
Stereo depth maps

Stereo depth maps

Per-pixel depth from synchronized stereo with calibrated intrinsics and extrinsics.

Body pose estimation

Body pose estimation

3D skeletal joints tracked frame-by-frame across the operator's body.

MANO hand meshes

MANO hand meshes

Articulated MANO meshes for left and right hands frame-by-frame.

Tactile force maps

Tactile force maps

Per-sensor pressure grids time-synced to grasp and contact events.

Object segmentation

Object segmentation

Instance and part masks aligned to geometry, depth, and contact.

Atomic action labels

Atomic action labels

Verb-object spans labeling fine-grained actions for imitation and VLA training.

Delivery Formats
Why Tridi?

What we capture that others don’t.

Side-by-side against other robotics and physical-AI data providers — the modalities, environments, and structured outputs they skip.

CAPABILITY*TridiOthers
Egocentric RGB video
Wrist POV video
Tactile force sensing
Stereo depth (IR dot projection)
Inertial Measurement Unit (IMU) motion data

* Comparison based on publicly listed capabilities of robotics and physical AI data providers, as of May 2026.

Security and Privacy

Data you can deploy with confidence

Privacy by design and enterprise-grade security built into every step — from collection to delivery.

Security

Enterprise-grade controls, encrypted pipelines, & strict access governance. Built for scale and reliability.

Privacy

Engagement data stays confidential and separated, with detailed audit logs for traceability.

FAQs

Frequently Asked Questions

Ready to train your embodied AI?

Capture how humans interact with the physical world to model embodied spatial intelligence, at scale.