MBPP
This is the test set of Mostly Basic Python Problems Dataset, with id in [11, 510].
Configuration
Field | Value | Description |
---|---|---|
is_fewshot | Append fewshot examples before problem | |
run_timeout | Execution timeout in seconds |
Usage
from datasets import load_dataset
import requests
config = {
'is_fewshot': True,
'run_timeout': 20,
'dataset_type': "MBPPDataset"
}
# Get dataset data in sandbox format
data = list(load_dataset("laylarsssss/FusedMBPP", split="test"))
config['provided_data'] = data
prompts = requests.post('http://localhost:8080/get_prompts', json={
'dataset': 'mbpp',
'config': config
}).json()
print('please perform model inference on these prompts:')
print('\n'.join([p['prompt'] for p in prompts[:3]]))
print('...')
# your model inference code here
completions = ['' for _ in prompts]
for completion, sample in zip(completions, data):
config['provided_data'] = sample
res = requests.post('http://localhost:8080/submit', json={
'dataset': 'mbpp',
'id': '',
'completion': completion,
'config': config
})
print(f'result: {res.json()}')
break
Note: always put raw completion in the request, Sandbox will handle the extraction of code according to different modes.