tomasmcm / v1olet-marcoroni-go-bruins-merge-7b

Source: v1olet/v1olet_marcoroni-go-bruins-merge-7B ✦ Quant: TheBloke/v1olet_marcoroni-go-bruins-merge-7B-AWQ ✦ Merge AIDC-ai-business/Marcoroni-7B-v3 and rwitz/go-bruins-v2 using slerp merge

  • Public
  • 70 runs
  • Paper
  • License

Input

Output

Run time and cost

This model costs approximately $0.011 to run on Replicate, or 90 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A40 GPU hardware. Predictions typically complete within 19 seconds. The predict time for this model varies significantly based on the inputs.

Readme

12th December 2023

We are ranked 6th on the overall leaderboard and 1st in the 7B leaderboard! 🔥🔥🔥

Merge AIDC-ai-business/Marcoroni-7B-v3 and rwitz/go-bruins-v2 using slerp merge from https://github.com/cg123/mergekit.

config.yaml

slices:
  - sources:
      - model: AIDC-ai-business/Marcoroni-7B-v3
        layer_range: [0, 32]
      - model: rwitz/go-bruins-v2
        layer_range: [0, 32]
merge_method: slerp
base_model: AIDC-ai-business/Marcoroni-7B-v3
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5 
dtype: float16

You can use alpaca template.

template_format = """{system}
### Instruction:
{prompt}

### Response:
"""

Developed by: Trong-Hieu Nguyen-Mau