kaolin.io.render

API

kaolin.io.render.import_synthetic_view(root_dir, idx, rgb=True, depth_linear=False, semantic=False, instance=False, normals=False, bbox_2d_tight=False, bbox_2d_loose=False)

Import views of synthetic data simulating sensors on 3D models, following the format output by the Data Generator extension in the Omniverse Kaolin App.

Parameters
  • root_dir (str) – path to the root directory containin the views.

  • idx (int) – index of the view selected.

  • rgb (bool, optional) – if True, load RGB image. Default: True.

  • depth_linear (bool, optional) – if True, load depth map with linear scaling. Default: False.

  • semantic (bool, optional) – if True, load semantic segmentation map. Default: False.

  • instance (bool, optional) – if True, load instance segmentation map. Default: False.

  • normals (bool, optional) – if True, load normals map. Default: False.

  • bbox_2d_tight (bool, optional) – if True, load tight 2d bounding box. Default: False.

  • bbox_2d_loose (bool, optional) – if True, load loose 2d bounding box. Default: False.

Returns

A dictionary of all the sensors selected depending on the arguments:

  • rgb (torch.FloatTensor): the RGB image, of shape \((B, H, W, 3)\).

  • depth_linear (torch.FloatTensor): the depth map with linear scaling, of shape \((B, H, W)\).

  • semantic (torch.IntTensor): the semantic segmentation map, of shape \((B, H, W)\).

  • instance (torch.IntTensor): the instance segmentation map, of shape \((B, H, W)\).

  • bbox_2d_tight (dict): the bounding box, as 4 floats (xmin, xmax, ymin, ymax).

  • normals (torch.FloatTensor): the normals map, of shape \((B, H, W, 3)\).

  • And metadata, a dictionary containing:

    • assets_transform (torch.FloatTensor): the transformation matrix of the combined assets transformations.

    • cam_transform (torch.FloatTensor): the transformation matrix, of shape \((4, 3)\).

    • cam_proj (torch.FloatTensor): the projection matrix, of shape \((3, 1)\).

    • clipping_range (list of float): the range at which the object are seen, as a list of (min, max).

Return type

(dict)