omnihuman.utils.io module

IO Module

This module provides utility functions for reading and writing files.

read_frames: Read frames from image or video file as 4D tensor (n_frames, n_channels, height, width).
fetch_pretrained_weights: Downloads & reads specific tensors from a Hugging Face Hub repository.

omnihuman.utils.io.fetch_pretrained_weights(repo_id: str, weight_name_to_file_name: Dict[str, str]) → Dict[str, Tensor][source][source]

Downloads specific files from a Hugging Face Hub repository and loads specific tensors from them.

Parameters:

repo_id (str) – Model repository name on Hugging Face Hub (e.g. “organization/their-awesome-model”).
weight_name_to_file_name (Dict[str, str]) – mapping of layer name to the shard file that contains its weights (Explore the model.safetensors.index.json in the HF model repo for names)

Returns:

mapping from layer name to its weights tensor

Return type:

Dict[str, torch.Tensor]

omnihuman.utils.io.read_frames(path: str) → Tensor[source][source]

Read frames from image or video file as 4D tensor (n_frames, n_channels, height, width).

Parameters:: path (str) – Where the image or video file is located.
Raises:: ValueError – If the file type is neither image nor video.
Returns:: Frames as 4d torch tensor of shape (n_frames, n_channels, height, width).
Return type:: torch.Tensor