A question about the process of action in BridgedataV2

Thank you for your work！It helps a lot. 
But there's a question that i am confused about.
https://github.com/OpenDriveLab/UniVLA/blob/2b780a5992406262680c6c66150564b3d8befaf2/prismatic/vla/datasets/rlds/oxe/transforms.py#L64-L90

https://github.com/OpenDriveLab/UniVLA/blob/2b780a5992406262680c6c66150564b3d8befaf2/prismatic/vla/datasets/rlds/utils/data_utils.py#L165-L172

I'm using data from https://rail.eecs.berkeley.edu/datasets/bridge_release/data/tfds/bridge_dataset/1.0.0/.
I'd like to ask why state interpolation is used here to represent actions instead of directly using the action values?
I directly downloaded the data and retrieved state0, 1, 2, as well as action0, 1. I then calculated the difference between states and found the values aren't entirely consistent.

<img width="1328" height="416" alt="Image" src="https://github.com/user-attachments/assets/6fe37fcc-fd35-40e2-9d86-4f179f12ae5f" />

Is my understanding incorrect, or is there another consideration at play? I'd be very grateful if you could clarify this.



	def bridge_orig_dataset_transform(trajectory: Dict[str, Any]) -> Dict[str, Any]:
	"""
	Applies to original version of Bridge V2 from the official project website.

	Note =>> In original Bridge V2 dataset, the first timestep has an all-zero action, so we remove it!
	"""
	for key in trajectory.keys():
	if key == "traj_metadata":
	continue
	elif key == "observation":
	for key2 in trajectory[key]:
	trajectory[key][key2] = trajectory[key][key2][1:]
	else:
	trajectory[key] = trajectory[key][1:]

	trajectory["action"] = tf.concat(
	[
	trajectory["action"][:, :6],
	binarize_gripper_actions(trajectory["action"][:, -1])[:, None],
	],
	axis=1,
	)
	# print(trajectory.keys(), trajectory['observation'].keys())
	trajectory = relabel_bridge_actions(trajectory)
	trajectory["observation"]["EEF_state"] = trajectory["observation"]["state"][:, :6]
	trajectory["observation"]["gripper_state"] = trajectory["observation"]["state"][:, -1:]
	return trajectory

	# === Bridge-V2 =>> Dataset-Specific Transform ===
	def relabel_bridge_actions(traj: Dict[str, Any]) -> Dict[str, Any]:
	"""Relabels actions to use reached proprioceptive state; discards last timestep (no-action)."""
	movement_actions = traj["observation"]["state"][1:, :6] - traj["observation"]["state"][:-1, :6]
	traj_truncated = tf.nest.map_structure(lambda x: x[:-1], traj)
	traj_truncated["action"] = tf.concat([movement_actions, traj["action"][:-1, -1:]], axis=1)

	return traj_truncated

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A question about the process of action in BridgedataV2 #68

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

A question about the process of action in BridgedataV2 #68

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions