Skip to content

Nymeria: a massive collection of multimodal egocentric daily motion in the wild

License

Notifications You must be signed in to change notification settings

facebookresearch/nymeria_dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Nymeria Dataset

[Project Page] [Data Explorer] [Paper] [Bibtex]

Nymeria is the world's largest dataset of human motion in the wild, capturing diverse people performing diverse activities across diverse locations. It is first of a kind to record body motion using multiple egocentric multimodal devices, all accurately synchronized and localized in one metric 3D world. Nymeria is also the world's largest motion dataset with natural language descriptions. The dataset is designed to accelerate research in egocentric human motion understanding and presents exciting challenges to advance contextualized computing and future AR/VR technology. This repository hosts the API for downloading and visualizing the dataset.

Nymeria dataset teaser with 100 random samples Nymeria dataset highlight statistics

Getting Started

Installation

Run the following commands to create a conda environment pymeria with this repository installed by pip.

   git clone git@github.com:facebookresearch/nymeria_dataset.git
   cd nymeria_dataset
   conda env create -f environment.yml

Note. Currently we support Linux and MacOS but not Windows. This is due to how we install pymomentum. This dependency is used to load the parametric mesh model for body motion. If your workflow does not require this modality, removing pymomentum should be sufficient to get the code running on Windows.

Download dataset

Review the dataset LICENSE to ensure your use case is covered.

Before you start. Nymeria has more than 1200 sequences, and each sequence contains data/annotations recorded by multiple devices. Altogether the dataset is approximately 70TB. To easy access, the dataset is chunked into sequences and sequences into data groups. A data group is a fixed collection of files, which must be downloaded together via a url. The predefined data groups are specified in definition.py. Each sequence is tagged by a list of attributes, as described in sequence_attributes.py. We have built basic support to filter sequences by their attributes. With this in mind, continue with one of the following paths.

Option 1 - Download sample files. Visit dataset explorer, click any sequences for detailed view. On the right panel, locate a list of links to download particular data groups for that sequence.

Option 2 - Batch download multiple sequences. For batch download, you need to obtain a JSON file with urls. There are two ways to achieve this. First, you can visit the project page, and sign up for Access the Dataset located at the bottom to be directed to the download page. The downloaded file will contain the urls to the full dataset. Alternatively, you can generate a customized JSON file with selected sequences and modalities on dataset explorer. Either way, the urls provided by the JSON file is valid for 14 days. Please obtain a new JSON file upon expiration.

With the JSON file, you can visit the urls to download data. For convinience, we provide download.py as an example script to parses the JSON file and download data into formatted directories. Run the script as follows.

conda activate pymeria
cd nymeria_dataset

python download.py -i <nymeria_download_urls.json> -o <output_path> [-k <partial_matching_key>]

The downloading script will produce a download_summary.json under the <output_path>. To customize the data groups to be downloaded, modify the function get_groups(). The optional argument -k implements a partial key matching to select sequences. Nymeria sequences are named by the following convention <date>_<session_id>_<fake_name>_<act_id>_<uid>. Here are some examples for how to use the sequence filter.

# E.g., get all sequences collected in June
python download.py -i <Nymeria_download_urls.json> -o <output_path> -k 202306

# E.g., get all sequences from participant with fake_name, 'james_johnson'
python download.py -i <Nymeria_download_urls.json> -o <output_path> -k james_johnson

# E.g., get a particular sequence with uid egucf6
python download.py -i <Nymeria_download_urls.json> -o <output_path> -k egucf6

Load Nymeria data with visualization

To load Nymeria sequences, please refer to the implementation of NymeriaDataProvider. This class expects formatted sequence directory as downloaded by previous section. It can be configured to load different modalities (c.f. NymeriaDataProviderConfig). Currently, this class implements the following functions

  • Loading Vrs recordings and their MPS location output.
  • Loading body motion as XSens kinematic skeleton and Momentum parametric mesh model.
  • Synchronize data from multiple sources.
  • Compute alignment to register body motion into the same world coordinates of Aria devices.

To visualize a sequence, run the viewer as follows. Please download all modalities for one sequence, to ensure the code runs as expected.

python viewer.py -i <nymeria_sequence_path> [-s]

The following two figures shows how the visualizer looks like. The 3D view renders body motion, point clouds and device trajectories. The 2D view renders synchronized RGB video from the participants Aria glasses and the observer Aria glasses. The code uses rerun for rendering. You can toggle the viewer to show different modalities, and configure it by NymeriaViewerConfig.

Nymeria sequence viewer teaser1 Nymeria sequence viewer teaser2

License

Nymeria dataset and code is released by Meta under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). Data and code may not be used for commercial purposes. For more information, please refer to the LICENSE file included in this repository.

Attribution

When using the dataset and code, please attribute it as follows:

@inproceedings{ma24eccv,
      title={Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild},
      author={Lingni Ma and Yuting Ye and Fangzhou Hong and Vladimir Guzov and Yifeng Jiang and Rowan Postyeni and Luis Pesqueira and Alexander Gamino and Vijay Baiyya and Hyo Jin Kim and Kevin Bailey and David Soriano Fosas and C. Karen Liu and Ziwei Liu and Jakob Engel and Renzo De Nardi and Richard Newcombe},
      booktitle={the 18th European Conference on Computer Vision (ECCV)},
      year={2024},
      url={https://arxiv.org/abs/2406.09905},
}

Contribute

We welcome contributions! Go to CONTRIBUTING and our CODE OF CONDUCT for how to contribute.

About

Nymeria: a massive collection of multimodal egocentric daily motion in the wild

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages