HowieMa

Haoyu Ma HowieMa

Research Scientist @ Meta, GenAI | CS Ph.D. @ UC, Irvine

102 followers · 58 following

@uci @seu @meta
Menlo Park, CA, USA
https://howiema.github.io/
https://ai.meta.com/people/926455432572211/haoyu-ma/

Achievements

Stars

congwei1230 / MoCha-Demo

[NeurIPS 2025 Spotlight] Demo implementation of MoCha Towards Movie-Grade Talking Character Synthesis

Python 14 2 Updated Dec 27, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,638 2,476 Updated Mar 5, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,761 506 Updated Oct 27, 2025

showlab / FAR

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 300 14 Updated Apr 23, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 25,331 1,867 Updated Jul 31, 2025

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,650 1,120 Updated Sep 14, 2024

deepseek-ai / DeepSeek-V3

Python 102,278 16,591 Updated Aug 28, 2025

IDEA-Research / DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python 2,683 166 Updated Dec 12, 2023

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 28,318 2,628 Updated Mar 21, 2026

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,857 1,211 Updated Nov 21, 2025

showlab / ROICtrl

Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation

Python 110 Updated Apr 16, 2025

junjiehe96 / UniPortrait

[ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization

Python 276 13 Updated May 1, 2025

AILab-CVC / SEED-X

Multimodal Models in Real World

Jupyter Notebook 557 23 Updated Feb 24, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,131 71 Updated Feb 7, 2025

rese1f / aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 141 6 Updated Jun 4, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,374 95 Updated Jan 12, 2026

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,498 422 Updated Jun 28, 2024

FireRedTeam / StoryMaker

StoryMaker: Towards consistent characters in text-to-image generation

Python 722 60 Updated Dec 2, 2024

ai-forever / MoVQGAN

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 264 18 Updated Oct 31, 2023

Adamdad / kat

[ICLR2025] Kolmogorov-Arnold Transformer

Python 855 58 Updated Mar 23, 2025

TencentARC / SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 887 69 Updated Oct 11, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,940 94 Updated Aug 15, 2024

Jeff-LiangF / streamv2v

Official Pytorch implementation of StreamV2V.

Python 541 59 Updated Dec 29, 2025

LukasBommes / mv-extractor

Extract frames and motion vectors from H.264 and MPEG-4 encoded video.

C++ 393 74 Updated Oct 14, 2025

PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

1,113 33 Updated Dec 31, 2024

heheyas / V3D

[T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators

Python 519 18 Updated Mar 26, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,732 2,916 Updated Apr 30, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,926 879 Updated Jul 18, 2024

showlab / DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 505 16 Updated Jul 2, 2024

thu-ml / CRM

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.

Python 684 55 Updated Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haoyu Ma HowieMa

Achievements

Achievements

Block or report HowieMa

Stars

congwei1230 / MoCha-Demo

Wan-Video / Wan2.1

ByteDance-Seed / Bagel

showlab / FAR

black-forest-labs / flux

fudan-generative-vision / hallo

deepseek-ai / DeepSeek-V3

IDEA-Research / DWPose

Genesis-Embodied-AI / Genesis

Tencent-Hunyuan / HunyuanVideo

showlab / ROICtrl

junjiehe96 / UniPortrait

AILab-CVC / SEED-X

rhymes-ai / Allegro

rese1f / aurora

baaivision / Emu3

tencent-ailab / IP-Adapter

FireRedTeam / StoryMaker

ai-forever / MoVQGAN

Adamdad / kat

TencentARC / SEED-Story

FoundationVision / LlamaGen

Jeff-LiangF / streamv2v

LukasBommes / mv-extractor

PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models

heheyas / V3D

hpcaitech / Open-Sora

instantX-research / InstantID

showlab / DragAnything

thu-ml / CRM