cuuupid / markitdown
Microsoft's tool to convert Office documents, PDFs, images, audio, and more to LLM-ready markdown.
cuuupid / qwen2-vl-2b
SOTA open-source model for chatting with videos and the newest model in the Qwen family
cuuupid / cogvideox-5b
Generate high quality videos from a prompt
cuuupid / idm-vton
Best-in-class clothing virtual try on in the wild (non-commercial use only)
cuuupid / flux-lineart
Flux finetuned for black and white line art.
cuuupid / gte-qwen2-7b-instruct
Embed text with Qwen2-7b-Instruct
cuuupid / whisper-webrtc
Talk to an AI "friend" with ultra low latency over WebRTC
cuuupid / glm-4v-9b
GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.
cuuupid / sdxl-lineart
SDXL finetuned on line art
cuuupid / garden-state-llama
Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!
cuuupid / golden-gate-llama
An example using Garden State Llama to ReFT on the Golden Gate bridge.
cuuupid / e5-mistral-7b-instruct
Finetuned E5 embeddings for instruct based on Mistral.
cuuupid / sdxl-meow
make meow emojis!
cuuupid / marker
Convert scanned or electronic documents to markdown, very very very fast
cuuupid / seamless_expressive
Translate audio while keeping the original style, pronunciation and tone of your original audio.
cuuupid / minicpm-llama3-v-2.5
MiniCPM LLama3-V 2.5, a new SOTA open-source VLM that surpasses GPT-4V-1106 and Phi-128k on a number of benchmarks.