prompting.validators.reward.dahoas
#
Module Contents#
Classes#
- class prompting.validators.reward.dahoas.DahoasRewardModel(path, device)#
Bases:
prompting.validators.reward.reward.BaseRewardModel
- model_name = 'EleutherAI/gpt-j-6b'#
- reward(prompt, completion, name)#
- Parameters:
- Return type:
- get_rewards(prompt, completions, name)#
- Parameters:
- Return type:
- forward(input_ids=None, past_key_values=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, mc_token_ids=None, labels=None, return_dict=False, output_attentions=False, output_hidden_states=False)#