Collections

Use handy tools

These models provide utility functions for working with media like images, audio, and video. They serve as convenient building blocks for media processing pipelines and workflows.

Key capabilities:

  • Background removal - Remove backgrounds from images and videos
  • Content moderation - Detect NSFW images
  • Audio visualization - Convert audio clips to animated waveforms
  • Video editing - Extract frames, add captions, remove background, make cuts via transcript
  • Format conversion - Convert between images, frames, video and GIFs

Our Pick: cjwbw/rembg

For most people, we recommend the cjwbw/rembg model for removing backgrounds from images. It’s blazing fast, easy to use, and produces great results. With over 1.4 million runs, it’s battle-tested and very popular.

Also Great: falcons-ai/nsfw_image_detection

If you need to moderate image content, falcons-ai/nsfw_image_detection is an excellent choice. This fine-tuned vision transformer classifies images as safe-for-work or not-safe-for-work. It’s widely used with over 25 million runs.

Noteworthy

A few other models to consider for common media processing tasks:

Recommended models

smoretalk / rembg-enhance

A background removal model enhanced with ViTMatte.

2.6M runs

lucataco / deep3d

Deep3D: Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning

421 runs

lucataco / depth-anything-video

Depth Anything on full video files

354 runs

camenduru / bria-rmbg

Remove background from images using BRIA-RMBG-1.4

10.6K runs

lucataco / img-and-audio2video

Take an image and an audio file and create a video clip

2.5K runs

fofr / toolkit

Video toolkit – convert, make GIFs, extract audio

4.3K runs

fictions-ai / autocaption

Automatically add captions to a video

26.1K runs

charlesmccarthy / addwatermark

Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI

25.3K runs

jd7h / edit-video-by-editing-text

A pipeline for superfast video editing! Make cuts to a video by editing its transcript.

551 runs

falcons-ai / nsfw_​image_​detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

31.2M runs

chigozienri / image-urls-to-video

Take a list of image URLs as frames and output a video

694 runs

fofr / controlnet-preprocessors

Canny, soft edge, depth, lineart, segmentation, pose, etc

38.5K runs

lucataco / remove-bg

Remove background from an image

4.6M runs

fofr / audio-to-waveform

Create a waveform video from audio

381.3K runs

fofr / video-to-frames

Split a video into frames

12.8K runs

fofr / frames-to-video

Convert a set of frames to a video

1.5K runs

arielreplicate / robust_​video_​matting

extract foreground of a video

48.1K runs

cjwbw / rembg

Remove images background

7.3M runs