speech2gesture_PoseGAN

A model which predict gestures from speech
- This repository is based on text2gesture
- original paper

Procedure

1. Download raw data

See "Download raw data" in "Speech_driven_gesture_generation_with_autoencoder" repository

2. Split dataset

See "Split dataset" in "Speech_driven_gesture_generation_with_autoencoder"

3. Convert the dataset into vectors

python create_vector.py DATA_DIR

Dataset is created by separating 64 frames each (both speech and motion)
Shape
- Speech: (block of frames, 26, 64)
- Motion: (block of frames, 192, 64)
The mean and standard deviation parameters obtained when standardizing the training data are located in . /norm/.

4. train

python train.py [--batch_size] [--epochs] [--lr] [--weight_decay] [--embedding_dimension]
                [--outdir_path] [--device] [--gpu_num] [--speech_path] [--pose_path] [--generator]
                [--gan] [--discriminator] [--lambda_d]

See "Usage" in "text2gesture" for details.

5. predict

python predict.py [--modelpath] [--inputpath] [--outpath]

The argument of --modelpath is set to specifies the folder where the generator model is located
- model is output by train.py and located in ./out/datetime/generator_datetime_weights.pth

6. reshape

python reshape-predict.py [--denorm] [--denormpath] [--datatype] [--npypath] [--outpath]

If you want to undo the normalized data, set the argument of --denorm to 1. In this case, --denormpath and --datatype should be set. (--datatype defaults to train.)
- --denormpath and --datatype are arguments to specify the directory where mean and standard deviation parameters obtained when standardizing the training data are located (Same as /norm/ output path in chapter 3.)
--npypath is set to the folder where the test data is located

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dataset		dataset
norm		norm
out		out
predict		predict
predict_reshaped		predict_reshaped
test_inputs		test_inputs
.DS_Store		.DS_Store
README.md		README.md
README_ja.md		README_ja.md
create_vector.py		create_vector.py
hierarchy.txt		hierarchy.txt
log_output.py		log_output.py
loss.py		loss.py
model.py		model.py
predict.py		predict.py
prepare_data.py		prepare_data.py
reshape-predict.py		reshape-predict.py
silence.wav		silence.wav
tools.py		tools.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

speech2gesture_PoseGAN

Procedure

1. Download raw data

2. Split dataset

3. Convert the dataset into vectors

4. train

5. predict

6. reshape

About

Uh oh!

Releases

Packages

Languages

GestureGeneration/speech2gesture_PoseGAN

Folders and files

Latest commit

History

Repository files navigation

speech2gesture_PoseGAN

Procedure

1. Download raw data

2. Split dataset

3. Convert the dataset into vectors

4. train

5. predict

6. reshape

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages