cappr下载 - cappr源代码下载

cappr

Ai源码

v0.9.6 - fix Llama 3 tokenizer

下载

CAPPR：及时概率后完成

从选择列表中挑选您的LLM。
或计算给定提示的完成的概率，这可能很有用。
从开源LLM中挤出更多。

用法

使用GGUF型号

 from llama_cpp import Llama
from cappr . llama_cpp . classify import predict

model = Llama ( "./TinyLLama-v0.Q8_0.gguf" , verbose = False )

prompt = """Gary told Spongebob a story:
There once was a man from Peru; who dreamed he was eating his shoe. He
woke with a fright, in the middle of the night, to find that his dream
had come true.

The moral of the story is to"""

completions = (
  "look at the bright side" ,
  "use your imagination" ,
  "eat shoes" ,
)

pred = predict ( prompt , completions , model )
print ( pred )
# use your imagination

有关使用GGGUF模型的更多信息，请参见文档的此页面。

使用拥抱面孔变压器模型

 from transformers import AutoModelForCausalLM , AutoTokenizer
from cappr . huggingface . classify import predict

model_name = "gpt2"
model = AutoModelForCausalLM . from_pretrained ( model_name )
tokenizer = AutoTokenizer . from_pretrained ( model_name )

prompt = "Which planet is closer to the Sun: Mercury or Earth?"
completions = ( "Mercury" , "Earth" )

pred = predict ( prompt , completions , model_and_tokenizer = ( model , tokenizer ))
print ( pred )
# Mercury

有关使用transformers模型的更多信息，请参见文档的此页面。

缓存说明节省时间

许多提示从相同的指令开始，例如系统提示以及少数示例输入输出对。与其在通用指令上反复运行模型，不如将其缓存，以使未来的计算更快。

这是一个使用cappr.huggingface.classify.cache_model的示例。

 from transformers import AutoModelForCausalLM , AutoTokenizer
from cappr . huggingface . classify import cache_model , predict

# Load model and tokenizer
model = AutoModelForCausalLM . from_pretrained ( "gpt2" )
tokenizer = AutoTokenizer . from_pretrained ( "gpt2" )
model_and_tokenizer = ( model , tokenizer )

# Create data
prompt_prefix = '''Instructions: complete the sequence.
Here are examples:
A, B, C => D
1, 2, 3 => 4

Complete this sequence:'''

prompts = [ "X, Y =>" , "10, 9, 8 =>" ]
completions = [ "7" , "Z" , "Hi" ]

# Cache prompt_prefix because it's used for all prompts
cached_model_and_tokenizer = cache_model (
    model_and_tokenizer , prompt_prefix
)

# Compute
preds = predict (
    prompts , completions , cached_model_and_tokenizer
)
print ( preds )
# ['Z', '7']

计算令牌级的日志探针

这是一个使用cappr.huggingface.classify.log_probs_conditional的示例。

 from transformers import AutoModelForCausalLM , AutoTokenizer
from cappr . huggingface . classify import log_probs_conditional

# Load model and tokenizer
model = AutoModelForCausalLM . from_pretrained ( "gpt2" )
tokenizer = AutoTokenizer . from_pretrained ( "gpt2" )

# Create data
prompts = [ "x y" , "a b c" ]
completions = [ "z" , "d e" ]

# Compute
log_probs_completions = log_probs_conditional (
    prompts , completions , model_and_tokenizer = ( model , tokenizer )
)

# Outputs (rounded) next to their symbolic representation

print ( log_probs_completions [ 0 ])
# [[-4.5],        [[log Pr(z | x, y)],
#  [-5.6, -3.2]]   [log Pr(d | x, y),    log Pr(e | x, y, d)]]

print ( log_probs_completions [ 1 ])
# [[-9.7],        [[log Pr(z | a, b, c)],
#  [-0.2, -0.03]]  [log Pr(d | a, b, c), log Pr(e | a, b, c, d)]]

使用cappr.utils.classify.agg_log_probs有效地汇总这些日志概况。

有关稍微高级的演示，请参见./demos/huggingface/dpo.ipynb 。

从分步完成中提取最终答案

逐步和经过思考的提示是使LLM``理性''更复杂的任务的高效方法。但是，如果您需要结构化的输出，则逐步完成是笨拙的。给定可能的答案列表，使用CAPPR从这些类型的完成中提取最终答案。

在文档中查看此想法。

分批运行，预测概率

 from transformers import AutoModelForCausalLM , AutoTokenizer
from cappr . huggingface . classify import predict_proba

# Load a model and its tokenizer
model_name = "gpt2"
model = AutoModelForCausalLM . from_pretrained ( model_name )
tokenizer = AutoTokenizer . from_pretrained ( model_name )

prompts = [
    "Stephen Curry is a" ,
    "Martina Navratilova was a" ,
    "Dexter, from the TV Series Dexter's Laboratory, is a" ,
    "LeBron James is a" ,
]

# Each of the prompts could be completed with one of these:
class_names = ( "basketball player" , "tennis player" , "scientist" )
prior =       (      1 / 6 ,                1 / 6 ,            2 / 3    )
# Say I expect most of my data to have scientists

# Run CAPPr
pred_probs = predict_proba (
    prompts = prompts ,
    completions = class_names ,
    model_and_tokenizer = ( model , tokenizer ),
    batch_size = 2 ,  # whatever fits on your CPU/GPU
    prior = prior ,
)

# pred_probs[i,j] = probability that prompts[i] is classified as class_names[j]
print ( pred_probs . round ( 1 ))
# [[0.5 0.3 0.2]
#  [0.3 0.6 0.2]
#  [0.1 0.1 0.8]
#  [0.8 0.2 0. ]]

# For each prompt, which completion is most likely?
pred_class_idxs = pred_probs . argmax ( axis = - 1 )
preds = [ class_names [ pred_class_idx ] for pred_class_idx in pred_class_idxs ]
print ( preds )
# ['basketball player',
#  'tennis player',
#  'scientist',
#  'basketball player']

分批运行，每个提示都有不同的可能完成集

同样，让我们预测概率。

 from transformers import AutoModelForCausalLM , AutoTokenizer
from cappr . huggingface . classify import predict_proba_examples
from cappr import Example

# Load a model and its tokenizer
model_name = "gpt2"
model = AutoModelForCausalLM . from_pretrained ( model_name )
tokenizer = AutoTokenizer . from_pretrained ( model_name )

# Create a sequence of Example objects representing your classification tasks
examples = [
    Example (
        prompt = "Jodie Foster played" ,
        completions = ( "Clarice Starling" , "Trinity in The Matrix" ),
    ),
    Example (
        prompt = "Batman, from Batman: The Animated Series, was played by" ,
        completions = ( "Pete Holmes" , "Kevin Conroy" , "Spongebob!" ),
        prior =      (     1 / 3      ,      2 / 3     ,      0      ),
    ),
]

# Run CAPPr
pred_probs = predict_proba_examples (
    examples , model_and_tokenizer = ( model , tokenizer )
)

# pred_probs[i][j] = probability that examples[i].prompt is classified as
# examples[i].completions[j]
print ([ example_pred_probs . round ( 2 ) for example_pred_probs in pred_probs ])
# [array([0.7, 0.3]),
#  array([0.03, 0.97, 0.  ])]

# For each example, which completion is most likely?
pred_class_idxs = [
    example_pred_probs . argmax () for example_pred_probs in pred_probs
]
preds = [
    example . completions [ pred_class_idx ]
    for example , pred_class_idx in zip ( examples , pred_class_idxs )
]
print ( preds )
# ['Clarice Starling',
#  'Kevin Conroy']

有关更难的分类任务的演示，请参见demos 。

对于CAPPR，GPTQ模型是最具计算机的性能。这些模型与cappr.huggingface.classify兼容。有关使用这些模型的更多信息，请参见文档的此页面。

文档

https://cappr.readthedocs.io

安装

请参阅文档的此页面。

动机

降低工程复杂性。

有关更多信息，请参见文档的此页面。

表现

统计表现

计算性能

它如何工作

您输入一个prompt字符串， end_of_prompt字符串（一个空格或空）和一组候选completion字符串，使字符串 -

{ prompt }{ end_of_prompt }{ completion }

- 自然流动的思想。 CAPPR选择completion ，该完成大多可能通过prompt -

c ompletion
后
迅速的
公关的可行性

- 正如我关于Cross验证的问题的充实。

地方发展

请参阅文档的此页面。

托多

我在这里倾倒毒品：

代码更改

研究实验

随时提出问题

展开

附加信息

版本 v0.9.6 - fix Llama 3 tokenizer
类型 Ai源码
更新时间 2025-07-01
大小 1.62MB
来自于 Github

cappr

CAPPR：及时概率后完成

用法

文档

安装

相关工作

动机

表现

它如何工作

地方发展

托多

c ares

C计划

C驾驶汽车

代号C手游

代号SC

c哩c哩动漫

chat.petals.dev

GPT Prompt Templates

GPTyped

ML stack

awesome free chatgpt

promptl

Google Dorks

shepherd

hidusbf