cuuupid / idm-vton

Best-in-class clothing virtual try on in the wild (non-commercial use only)

741.1K runs
Public

cuuupid / research

Deep Research ported to Cog!

9 runs
Public

cuuupid / mel-medarda-tts

TTS with the voice of Mel Medarda from Arcane, trained using Zonos-v0.1

5 runs
Public

cuuupid / zonos

Zonos-v0.1 beta, a SOTA text-to-speech Transformer model with extraordinary expressive range, built by Zyphra.

254 runs
Public

cuuupid / markitdown

Microsoft's tool to convert Office documents, PDFs, images, audio, and more to LLM-ready markdown.

34.1K runs
Public

cuuupid / qwen2-vl-2b

SOTA open-source model for chatting with videos and the newest model in the Qwen family

518 runs
Public

cuuupid / cogvideox-5b

Generate high quality videos from a prompt

2.1K runs
Public

cuuupid / flux-lineart

Flux finetuned for black and white line art.

6.1K runs
Public

cuuupid / gte-qwen2-7b-instruct

Embed text with Qwen2-7b-Instruct

708.2K runs
Public

cuuupid / whisper-webrtc

Talk to an AI "friend" with ultra low latency over WebRTC

3 runs
Public

cuuupid / glm-4v-9b

GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.

86.5K runs
Public

cuuupid / sdxl-lineart

SDXL finetuned on line art

1.7K runs
Public

cuuupid / garden-state-llama

Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!

106 runs
Public

cuuupid / golden-gate-llama

An example using Garden State Llama to ReFT on the Golden Gate bridge.

32 runs
Public

cuuupid / e5-mistral-7b-instruct

Finetuned E5 embeddings for instruct based on Mistral.

138 runs
Public

cuuupid / sdxl-meow

make meow emojis!

71 runs
Public

cuuupid / marker

Convert scanned or electronic documents to markdown, very very very fast

2.5K runs
Public

cuuupid / seamless_​expressive

Translate audio while keeping the original style, pronunciation and tone of your original audio.

788 runs
Public

cuuupid / minicpm-llama3-v-2.5

MiniCPM LLama3-V 2.5, a new SOTA open-source VLM that surpasses GPT-4V-1106 and Phi-128k on a number of benchmarks.

128 runs
Public