forked from sdbds/hallo-for-windows
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'fudan-generative-vision:main' into main
- Loading branch information
Showing
5 changed files
with
69 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -33,7 +33,7 @@ | |
## 📸 Showcase | ||
|
||
|
||
https://github.com/fudan-generative-vision/hallo/assets/17402682/294e78ef-c60d-4c32-8e3c-7f8d6934c6bd | ||
https://github.com/fudan-generative-vision/hallo/assets/17402682/9d1a0de4-3470-4d38-9e4f-412f517f834c | ||
|
||
### 🎬 Honoring Classic Films | ||
|
||
|
@@ -75,6 +75,8 @@ Explore the resources developed by our community to enhance your experience with | |
- [hallo-webui](https://github.com/daswer123/hallo-webui) - Explore the WebUI created by [@daswer123](https://github.com/daswer123). | ||
- [hallo-for-windows](https://github.com/sdbds/hallo-for-windows) - Utilize Hallo on Windows with the guide by [@sdbds](https://github.com/sdbds). | ||
- [ComfyUI-Hallo](https://github.com/AIFSH/ComfyUI-Hallo) - Integrate Hallo with the ComfyUI tool by [@AIFSH](https://github.com/AIFSH). | ||
- [hallo-docker](https://github.com/ashleykleynhans/hallo-docker) - Docker image for Hallo by [@ashleykleynhans](https://github.com/ashleykleynhans). | ||
- [RunPod Template](https://runpod.io/console/deploy?template=aeyibwyvzy&ref=2xxro4syy) - Deploy Hallo to RunPod by [@ashleykleynhans](https://github.com/ashleykleynhans). | ||
|
||
Thanks to all of them. | ||
|
||
|
@@ -292,6 +294,12 @@ Interested individuals are encouraged to contact us at [[email protected]](ma | |
|
||
The development of portrait image animation technologies driven by audio inputs poses social risks, such as the ethical implications of creating realistic portraits that could be misused for deepfakes. To mitigate these risks, it is crucial to establish ethical guidelines and responsible use practices. Privacy and consent concerns also arise from using individuals' images and voices. Addressing these involves transparent data usage policies, informed consent, and safeguarding privacy rights. By addressing these risks and implementing mitigations, the research aims to ensure the responsible and ethical development of this technology. | ||
|
||
## 🤗 Acknowledgements | ||
|
||
We would like to thank the contributors to the [magic-animate](https://github.com/magic-research/magic-animate), [AnimateDiff](https://github.com/guoyww/AnimateDiff), [ultimatevocalremovergui](https://github.com/Anjok07/ultimatevocalremovergui), [AniPortrait](https://github.com/Zejun-Yang/AniPortrait) and [Moore-AnimateAnyone](https://github.com/MooreThreads/Moore-AnimateAnyone) repositories, for their open research and exploration. | ||
|
||
If we missed any open-source projects or related articles, we would like to complement the acknowledgement of this specific work immediately. | ||
|
||
## 👏 Community Contributors | ||
|
||
Thank you to all the contributors who have helped to make this project better! | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,51 @@ | ||
""" | ||
This script is a gradio web ui. | ||
The script takes an image and an audio clip, and lets you configure all the | ||
variables such as cfg_scale, pose_weight, face_weight, lip_weight, etc. | ||
Usage: | ||
This script can be run from the command line with the following command: | ||
python scripts/app.py | ||
""" | ||
import argparse | ||
|
||
import gradio as gr | ||
from inference import inference_process | ||
|
||
|
||
def predict(image, audio, pose_weight, face_weight, lip_weight, face_expand_ratio, progress=gr.Progress(track_tqdm=True)): | ||
""" | ||
Create a gradio interface with the configs. | ||
""" | ||
_ = progress | ||
config = { | ||
'source_image': image, | ||
'driving_audio': audio, | ||
'pose_weight': pose_weight, | ||
'face_weight': face_weight, | ||
'lip_weight': lip_weight, | ||
'face_expand_ratio': face_expand_ratio, | ||
'config': 'configs/inference/default.yaml', | ||
'checkpoint': None, | ||
'output': ".cache/output.mp4" | ||
} | ||
args = argparse.Namespace() | ||
for key, value in config.items(): | ||
setattr(args, key, value) | ||
return inference_process(args) | ||
|
||
app = gr.Interface( | ||
fn=predict, | ||
inputs=[ | ||
gr.Image(label="source image (no webp)", type="filepath", format="jpeg"), | ||
gr.Audio(label="source audio", type="filepath"), | ||
gr.Number(label="pose weight", value=1.0), | ||
gr.Number(label="face weight", value=1.0), | ||
gr.Number(label="lip weight", value=1.0), | ||
gr.Number(label="face expand ratio", value=1.2), | ||
], | ||
outputs=[gr.Video()], | ||
) | ||
app.launch() |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters