James Hunter Carter's picture

James Hunter Carter PRO

jameshuntercarter

·

https://www.jameshuntercarter.com

platformkit

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

YatharthS/LinaCodec

liked a model about 21 hours ago

Kijai/LTXV2_comfy

liked a Space 1 day ago

ovi054/ltx-2-Audio-to-Video

View all activity

Organizations

upvoted a paper 2 days ago

End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Paper • 2202.08520 • Published Feb 17, 2022 • 2

upvoted a collection 2 days ago

Audio Spaces

168 items • Updated 2 days ago • 21

upvoted a paper 3 days ago

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Paper • 2508.03448 • Published Aug 5, 2025 • 6

upvoted a collection 9 days ago

SoulX-Podcast

Models of SoulX-Podcast • 5 items • Updated Oct 29, 2025 • 46

upvoted a collection about 1 month ago

Marigold Computer Vision

All things Marigold • 17 items • Updated May 15, 2025 • 24

upvoted a collection about 2 months ago

GLM-4.6V

3 items • Updated Dec 8, 2025 • 48

upvoted a collection 3 months ago

Edit-R1

5 items • Updated Oct 21, 2025 • 8

upvoted 2 papers 4 months ago

DreamOmni2: Multimodal Instruction-based Editing and Generation

Paper • 2510.06679 • Published Oct 8, 2025 • 73

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22, 2025 • 66

upvoted a collection 5 months ago

MGM-Omni

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech • 18 items • Updated Oct 11, 2025 • 11

upvoted a paper 5 months ago

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

Paper • 2402.06894 • Published Feb 10, 2024 • 1

upvoted a collection 10 months ago

VACE

VACE: All-in-One Video Creation and Editing • 7 items • Updated May 15, 2025 • 34

upvoted a paper 10 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30, 2025 • 138

upvoted a collection 10 months ago

LipSync and Face Operations

23 items • Updated 23 days ago • 62

upvoted a paper 11 months ago

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Paper • 2502.13995 • Published Feb 19, 2025 • 9

upvoted 4 papers 12 months ago

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Paper • 2502.07531 • Published Feb 11, 2025 • 12

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106

Stable Flow: Vital Layers for Training-Free Image Editing

Paper • 2411.14430 • Published Nov 21, 2024 • 22

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5, 2025 • 31

upvoted a collection about 1 year ago

Zero-Shot Voice Cloning

TTS models that support zero-shot voice cloning • 8 items • Updated about 1 month ago • 14