Spaces:
Running
Running
File size: 43,873 Bytes
f09dece |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 |
{"id": 930, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A baby is crying.", "ground_truth_text": "A baby is crying in distress in short bursts rapidly."}
{"id": 8272, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A man speaks as a door slams.", "ground_truth_text": "Men speak followed by a door shutting."}
{"id": 1272, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A cat meows and a door slams.", "ground_truth_text": "A cat meows followed by the sounds of items clattering."}
{"id": 2733, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A dog barks and growls while people speak.", "ground_truth_text": "A dog barking as a man and a woman are talking followed by water splashing and children talking in the background."}
{"id": 670, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A loud screeching sound followed by people cheering.", "ground_truth_text": "A loud engine revving, then the crowd claps and cheers."}
{"id": 2796, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A person is sniffling and sniffing.", "ground_truth_text": "Water trickling followed by a person snickering."}
{"id": 8779, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A chainsaw is revving up and then idling.", "ground_truth_text": "A chainsaw revs then idles and eventually stops."}
{"id": 7437, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A motor revs up and then a hiss is heard.", "ground_truth_text": "Speedboat with water spraying."}
{"id": 1402, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nPlease suggest a caption for the audio I've shared. ", "generated_text": "A dog barks and growls followed by a dog whimpering.", "ground_truth_text": "A dog barks then runs followed by dogs growling."}
{"id": 4934, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A man and a woman are speaking and a baby is making sounds.", "ground_truth_text": "People are talking as a snake hisses."}
{"id": 907, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A helicopter flies overhead.", "ground_truth_text": "A helicopter flies close to the ground overhead and flies off."}
{"id": 2187, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A train is moving along the tracks and a bell is ringing.", "ground_truth_text": "Steam train whistle as it approaches on the tracks then recedes."}
{"id": 5944, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A series of electronic beeps.", "ground_truth_text": "A loud, sharp electronic beep is occurring repetitively."}
{"id": 607, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A telephone rings and then stops.", "ground_truth_text": "Phone ringing then the click of being picked up and put back down."}
{"id": 8591, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A vehicle engine running and a dog barking.", "ground_truth_text": "Water is running and frog croaks."}
{"id": 6163, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A horse is trotting and birds are chirping.", "ground_truth_text": "Traffic passing in the distance and birds chirp as a running horse gets closer with louder clip-clops."}
{"id": 4560, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A gentle wind blows and birds chirp.", "ground_truth_text": "Light wind blowing followed by birds chirping as more wind blows."}
{"id": 1526, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A large bell is being rung.", "ground_truth_text": "Several bells are chimed."}
{"id": 2348, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A train is passing by and a dog barks.", "ground_truth_text": "A bus engine idles followed by faint beeping and people conversing in the distance."}
{"id": 366, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A toilet is being flushed and a man speaks.", "ground_truth_text": "Food is frying and being stirred followed by a man talking."}
{"id": 9142, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A woman speaks while water runs and food sizzles.", "ground_truth_text": "A soft sizzle of food frying and a scraping noise which followed by a woman with a soft voice saying a few words."}
{"id": 8405, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A dog barks as music plays.", "ground_truth_text": "A dog barks several times, and then music starts to play in the background."}
{"id": 4771, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A woman whistling and tapping on a surface.", "ground_truth_text": "A baby cries and a surface is tapped."}
{"id": 2391, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A large group of people are clapping followed by a man speaking and laughing.", "ground_truth_text": "An audience applause as a man laughs and then speaks."}
{"id": 4317, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A woman speaks followed by a goat bleating.", "ground_truth_text": "A young female voice with goat bleating in the background."}
{"id": 5326, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A vehicle is driving by and the wind is blowing.", "ground_truth_text": "Thunder booms in the distance as traffic passes by."}
{"id": 3420, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A horn is honking while people are talking.", "ground_truth_text": "A horn sounds while people talk in the background."}
{"id": 7749, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A dog barks and growls while children talk.", "ground_truth_text": "Dogs yipping and barking over human chatter."}
{"id": 7948, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A metal object is being dragged across a surface followed by a loud bang.", "ground_truth_text": "Some clicking followed by scraping then a loud bang."}
{"id": 5579, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A man speaks followed by a metal hit.", "ground_truth_text": "Speaking followed by a bang."}
{"id": 9861, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A truck engine idling followed by a horn.", "ground_truth_text": "Wind blowing with faint humming of engines followed by a distant horn."}
{"id": 4150, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A woman speaks while metal clinks.", "ground_truth_text": "A woman speaks, metal clinks."}
{"id": 134, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A phone beeps and a man speaks.", "ground_truth_text": "A beep and a tone sounds then an electronic voice talks."}
{"id": 10007, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A bird caws and caws again.", "ground_truth_text": "Wind noises and seagulls whining."}
{"id": 3932, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A man speaks while birds chirp.", "ground_truth_text": "Bird tweeting in the background and a man talking."}
{"id": 1547, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A horn blares, followed by a man speaking.", "ground_truth_text": "Loud musical horn then vehicle zooming by then male voice."}
{"id": 5263, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A dog barks and growls and a man speaks.", "ground_truth_text": "Dogs bark and whimper then a woman speaks."}
{"id": 5052, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A man speaks and ducks quack.", "ground_truth_text": "Ducks quack as a woman speaks followed by flapping wings."}
{"id": 3143, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A fly buzzes and then a frog croaks.", "ground_truth_text": "A fly buzzing followed by a gulping sound is repeated, after which a frog croaks."}
{"id": 3282, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A telephone is ringing and then someone is typing on a keyboard.", "ground_truth_text": "A telephone is ringing and beeping."}
{"id": 4923, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A sizzle followed by a tap.", "ground_truth_text": "Grease sizzles as a condiment is shaken then put down."}
{"id": 4144, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A dog barks followed by a gunshot.", "ground_truth_text": "Pounding hitting sounds followed by a dog barking frantically."}
{"id": 6241, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption best describes the content of this audio? ", "generated_text": "A boat motor runs followed by a man speaking.", "ground_truth_text": "An aircraft flies while wind blows and a man talks."}
{"id": 2281, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A train is moving along the tracks while people are talking.", "ground_truth_text": "Horses and a child speaking in the background."}
{"id": 869, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A woman laughs and speaks while people laugh in the background.", "ground_truth_text": "A man speaks when someone laughs quickly as the wind blows and ducks are quacking from a distance followed by laughter and a man speaks."}
{"id": 6129, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A dog growling followed by a hiss and a loud engine revving.", "ground_truth_text": "A frog croaks and a whoosh goes off."}
{"id": 7864, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A man speaks and a spray can is used.", "ground_truth_text": "A man speaks followed by a loud hiss."}
{"id": 3871, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A cat meows and a man speaks.", "ground_truth_text": "A cat is meowing softly several times then a man speaks."}
{"id": 4153, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A man speaking while water is splashing.", "ground_truth_text": "A man speaks with some sizzling."}
{"id": 7242, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A man speaks and a baby cries.", "ground_truth_text": "A woman talking with snoring in the background."}
{"id": 2458, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A large group of people are cheering and a car is driving by.", "ground_truth_text": "People are talking and cheering then a vehicle speeds off loudly."}
{"id": 6881, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A telephone rings.", "ground_truth_text": "Telephone is ringing loudly."}
{"id": 619, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A woman laughs and a man speaks while a whistle blows.", "ground_truth_text": "A woman is screaming loudly while duck is quacking in background."}
{"id": 2956, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A woman speaks while a bird chirps and a plate is moved.", "ground_truth_text": "A woman talks while in the background there is the splashing of ducks in the water."}
{"id": 9193, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A machine gun is fired.", "ground_truth_text": "An automatic gunfire is triggered for a long period."}
{"id": 4298, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A man speaks while a bird squawks and a dog barks.", "ground_truth_text": "Rain and hail hit a window while a person speaks."}
{"id": 1483, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A bird is chirping and whistling.", "ground_truth_text": "Whistling with birds in the background."}
{"id": 5009, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A man speaks followed by a whistle.", "ground_truth_text": "A man talks and then whistles."}
{"id": 2056, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A bird chirps, then an explosion occurs.", "ground_truth_text": "An explosion take place then birds chirp while an engine runs."}
{"id": 1874, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A helicopter is flying nearby.", "ground_truth_text": "A motorboat is cruising on the water."}
{"id": 9736, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A man speaks while food sizzles.", "ground_truth_text": "A man talking followed by clanking against a metal surface as food and oil sizzle."}
{"id": 3712, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A bell rings and a clock ticks.", "ground_truth_text": "An old fashioned bell rings and at the end a click and the ringing stops."}
{"id": 4847, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A motor starts up and then stops.", "ground_truth_text": "An engine running followed by revving and then idling with faint beeping in the background."}
{"id": 8816, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A train horn blares and people laugh.", "ground_truth_text": "Loud boat engine passing and then laughing."}
{"id": 2389, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A man is speaking and a bird is chirping.", "ground_truth_text": "Running water with male speech and bird calls."}
{"id": 5542, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A cat meows and growls followed by a woman speaking.", "ground_truth_text": "A cat meows, followed by a person talking, followed by several meows."}
{"id": 10045, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A man speaks followed by a large explosion.", "ground_truth_text": "A man speaks and an object bursts."}
{"id": 8153, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A strong wind blows and birds chirp as people talk in the distance.", "ground_truth_text": "Cars pass by as the wind blows heavily and birds chirp followed by multiple people communicating from a distance."}
{"id": 2906, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A man speaks and a computer beeps.", "ground_truth_text": "A computer voice is talking and beeps sound then a man talks at the same time as the computer voice, more beeps and the man gets the last word."}
{"id": 5355, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A man speaking followed by a door closing.", "ground_truth_text": "An adult male speaking followed by a sliding door shutting."}
{"id": 8397, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A telephone ringing.", "ground_truth_text": "Different telephone ringings."}
{"id": 8122, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption best describes the content of this audio? ", "generated_text": "A motor is running.", "ground_truth_text": "A small engine runs continuously."}
{"id": 5103, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A man speaking followed by a dog barking.", "ground_truth_text": "A woman is speaking and animals barking."}
{"id": 6025, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A car horn blares and a child yells.", "ground_truth_text": "A police car accelerates uses its siren briefly and squeals its tires."}
{"id": 8576, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A car engine revs up and then a door slams.", "ground_truth_text": "An engine starts up and then revs every second."}
{"id": 5884, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A series of beeps followed by a series of ticking sounds.", "ground_truth_text": "Something is beeping and another object is stomping around."}
{"id": 3482, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A sheep bleats and a bird chirps.", "ground_truth_text": "A goat baaing followed by a person softly talking and heavy footsteps on dirt as a bird chirps in the background."}
{"id": 9331, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A woman sneezes twice.", "ground_truth_text": "A person sneezing."}
{"id": 1917, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A metal clang followed by a man speaking.", "ground_truth_text": "Silverware clinking and man speaking."}
{"id": 10088, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A man is speaking while a dog is barking and growling.", "ground_truth_text": "A man shouts as a horse runs with a dog barking in the background."}
{"id": 2627, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A bird is chirping while the wind is blowing.", "ground_truth_text": "Wind blowing, birds chirp, some horse footsteps on dirt."}
{"id": 1502, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A child speaks while someone types on a keyboard.", "ground_truth_text": "A male child speaks and plastic crumples and crinkles."}
{"id": 3740, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A man speaking followed by a woman speaking and a goat bleating.", "ground_truth_text": "Children and man with sheep sounds."}
{"id": 1857, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A woman talks and a bird chirps.", "ground_truth_text": "A woman speaks as birds chirp."}
{"id": 5327, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A woman yells and then an explosion occurs.", "ground_truth_text": "Yelling followed by a pop and silence."}
{"id": 1375, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A man speaks and laughs while a bird chirps.", "ground_truth_text": "Men speaking with distant traffic then sprays and men laughing."}
{"id": 3042, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A person burps, then laughs, then burps again.", "ground_truth_text": "A girl burps loudly and other girls laugh, complain and talk."}
{"id": 3062, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A baby is crying.", "ground_truth_text": "A baby continuously crying."}
{"id": 6900, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A machine gun is firing rapidly.", "ground_truth_text": "A sequence of sustained machine gun fire followed by shorter bursts of rapid gunfire."}
{"id": 7914, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nGenerate an audio caption from this sound file. ", "generated_text": "A train horn blows followed by people talking and laughing.", "ground_truth_text": "A train horn blares loudly several times as people speak and laugh."}
{"id": 9641, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A bus passes by and a person yells.", "ground_truth_text": "N engine followed by a hiss and faint honks of a horn as traffic passes."}
{"id": 9499, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A dog barks, then a knock is heard, followed by music.", "ground_truth_text": "Pigeon cooing, birds chirping and then drum beats."}
{"id": 6537, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A car passes by and then a horn honks followed by speech.", "ground_truth_text": "A vehicle driving by quickly followed by honking and distant murmuring."}
{"id": 2500, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A vehicle revs its engine and then a person yells.", "ground_truth_text": "A motor vehicle engine is running and revving, tires squeal, an adult male yell, a blast occurs, and metal clangs."}
{"id": 8862, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A vehicle engine revs up and then a man speaks and laughs.", "ground_truth_text": "Engines are revving followed by an engine idling with people talking and giggling."}
{"id": 5050, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nPlease suggest a caption for the audio I've shared. ", "generated_text": "A strong wind blows and water splashes.", "ground_truth_text": "Splashing water accompanied by wind noises."}
{"id": 2050, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A man speaks and then sneezes.", "ground_truth_text": "A woman talks and someone belches loudly."}
{"id": 10098, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption best describes the content of this audio? ", "generated_text": "A man speaks while a door slams.", "ground_truth_text": "A man is speaking and opens a door."}
{"id": 10206, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A man speaks followed by beeping.", "ground_truth_text": "A man speaks while electronics beeps."}
{"id": 10109, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A man speaking followed by a car horn.", "ground_truth_text": "A man is speaking and a car horn honks."}
{"id": 9166, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A woman speaks followed by a man speaking and then a bell ringing.", "ground_truth_text": "A man speaking briefly followed by woman speaking alongside an electronic bell dinging with another man speaking then a seagull squawking."}
{"id": 3823, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A woman is speaking over music and a man is speaking over music.", "ground_truth_text": "People loudly talking with a bell ringing."}
{"id": 10137, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nGenerate an audio caption from this sound file. ", "generated_text": "A woman speaks and pans are clanged.", "ground_truth_text": "A woman talks while tapping plastic and wood objects."}
{"id": 3231, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A crowd is talking and a motorcycle is revving its engine.", "ground_truth_text": "A crowd of people talking with traffic ambiance in the background followed by a vehicle engine revving."}
{"id": 10406, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A bird whistles and then a person whistles.", "ground_truth_text": "A bird whistle, and then a person whistles."}
{"id": 10287, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A dog snoring and then barking.", "ground_truth_text": "A soft, breathy snore in the foreground, while a child yells distantly in the background."}
{"id": 10514, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A helicopter is flying above.", "ground_truth_text": "Pressured air blowing."}
{"id": 11241, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A vehicle engine starts up and revs.", "ground_truth_text": "Clanking followed by engine starting and revving."}
{"id": 10720, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A wind blowing followed by a metal clank.", "ground_truth_text": "The wind is blowing, rustling is present, and metal is clanging."}
{"id": 11082, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A woman speaking followed by a siren.", "ground_truth_text": "White noise an woman speaking followed by whistling."}
{"id": 11406, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A man speaks as a train passes by.", "ground_truth_text": "Waves splashing, then a woman speaks."}
{"id": 11411, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCome up with a fitting caption for this audio track. ", "generated_text": "A man speaking followed by a loud thunder.", "ground_truth_text": "A woman and a men talking into a microphone as the waves are heard coming from the ocean."}
{"id": 11673, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A truck engine runs and a horn honks.", "ground_truth_text": "Motorcycle engine runs in idle followed by a horn beeping."}
{"id": 10565, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCaption this audio file for me, please. ", "generated_text": "A child laughing and music playing, followed by a child speaking.", "ground_truth_text": "Laughter followed by music and a child speaking."}
{"id": 10931, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A man speaks followed by an explosion and a fire burning.", "ground_truth_text": "Someone speaks then an explosion occurs."}
{"id": 11445, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A toilet is flushed and water is running.", "ground_truth_text": "A toilet flushing followed by a door slamming open."}
{"id": 10586, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWhat caption would you suggest for this audio? ", "generated_text": "A helicopter is flying nearby while a man and a woman are talking.", "ground_truth_text": "A man talking in the background as a vehicle horn honks in the distance followed by a faint emergency siren going off and another man talking in the foreground as wind moderately blows into a microphone."}
{"id": 11989, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A man speaks while food is being cooked.", "ground_truth_text": "A man speaks followed by sizzling of food in a pan then he speaks again."}
{"id": 12214, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A wind blows and a vehicle skids.", "ground_truth_text": "The wind is blowing, an engine runs, and tires squeal."}
{"id": 12172, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A cat meows and people laugh.", "ground_truth_text": "A cat is caterwauling while several people laugh."}
{"id": 12179, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A train horn blows followed by a train whistle.", "ground_truth_text": "A large horn followed by a chime."}
{"id": 12219, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A cat meows and a child speaks.", "ground_truth_text": "An animal groans then a child speaks."}
{"id": 12547, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A person burps and then laughs.", "ground_truth_text": "People laughing with something snoring."}
{"id": 11984, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you draft a caption for this audio recording? ", "generated_text": "A man speaks followed by a loud explosion and people screaming.", "ground_truth_text": "A man speaks followed by explosions and crashing with others speaking and screaming."}
{"id": 12276, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A sheep bleats, then a man speaks.", "ground_truth_text": "Goat bleating followed by a man speaking."}
{"id": 12536, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nPlease suggest a caption for the audio I've shared. ", "generated_text": "A woman is wailing and people are cheering.", "ground_truth_text": "A slightly, muffled cat meowing is followed by people laughing and loud applause."}
{"id": 12311, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A man speaks followed by a hiss of air.", "ground_truth_text": "A man speaking followed by faucet water flowing and hitting a metallic surface."}
{"id": 12989, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nTranslate the essence of this audio into a caption. ", "generated_text": "A horse neighs and people talk.", "ground_truth_text": "Sheep bleat as a man laughs and a woman speaks while people talking in the background."}
{"id": 12901, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you provide a caption for this sound recording? ", "generated_text": "A man speaks while a horse neighs.", "ground_truth_text": "Bees are buzzing as a man speaks followed by a woman speaking."}
{"id": 13231, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nI need a caption for this piece of audio; can you help? ", "generated_text": "A river is flowing and a man speaks.", "ground_truth_text": "A flowing stream then a man speaks."}
{"id": 13085, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A clock ticking followed by a whistle blowing.", "ground_truth_text": "Opening clatter with soft quite bees buzzing."}
{"id": 13354, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A train whistle blows followed by people talking.", "ground_truth_text": "Hissing with some rustling and people speaking followed by a train whistle blowing."}
{"id": 13228, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A vehicle is moving fast and people are cheering.", "ground_truth_text": "A vehicle is driving by while people are shouting."}
{"id": 13431, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A child is crying and a woman is speaking.", "ground_truth_text": "A child cries and yells and an adult speaks in response."}
{"id": 13080, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A man speaking followed by a woman speaking and a hissing sound.", "ground_truth_text": "People talk nearby as a sprayer sprays liquid, and then a machine hums in the distance."}
{"id": 13378, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A man speaks followed by a large explosion and people laughing.", "ground_truth_text": "A loud burst followed by men speaking and laughing."}
{"id": 13473, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A horn honks followed by a man speaking.", "ground_truth_text": "Vehicle horn followed by man talking."}
{"id": 13855, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nKindly generate a caption for the following audio. ", "generated_text": "A man talks and whistles.", "ground_truth_text": "A woman is speaking then whistles."}
{"id": 13332, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCraft a caption that encapsulates this audio. ", "generated_text": "A car engine revs up, a woman screams, and a man speaks.", "ground_truth_text": "A high pitched whine with animals bleating and a person talking."}
{"id": 13951, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A cat meows and a child speaks.", "ground_truth_text": "An animal groans then a child speaks."}
{"id": 14057, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nHow would you caption this audio snippet? ", "generated_text": "A baby is crying and laughing.", "ground_truth_text": "Screaming followed by goat noises."}
{"id": 13970, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A woman is speaking while a train is moving nearby.", "ground_truth_text": "Steam hisses nearby as many people talk."}
{"id": 13591, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCould you come up with a caption for this audio segment? ", "generated_text": "A group of people are talking and laughing while a bird is chirping.", "ground_truth_text": "A crowd is talking and horse neighs a couple of times."}
{"id": 14312, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCreate a caption that summarizes this audio. ", "generated_text": "A man talks while a clock ticks.", "ground_truth_text": "A man is speaking while birds chirp in the background."}
{"id": 14716, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nGenerate an audio caption from this sound file. ", "generated_text": "A horn honks twice.", "ground_truth_text": "A car horn honks three times as birds chirp in the distance."}
{"id": 14556, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nWould you generate a caption based on this audio? ", "generated_text": "A man speaks and a whistle blows.", "ground_truth_text": "A person talking with chime sound and a person whistling."}
{"id": 15009, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A train blows its horn and passes by.", "ground_truth_text": "A toy helicopter starting up as wind blows moderately into a microphone."}
{"id": 14268, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nCan you formulate a caption for the given audio piece?", "generated_text": "A man speaks and laughs while a baby laughs and a man snores.", "ground_truth_text": "A man singing before laughing followed by a goat baaing then snorting a few times as a dog barks in the background."}
{"id": 14657, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nListen and generate a caption for the provided audio. ", "generated_text": "A woman speaks and a child speaks while paper crinkles.", "ground_truth_text": "A woman talks nearby, and then a child speaks as paper crinkles."}
{"id": 14989, "lang_q": "en", "lang_g": "en", "lang_o": "en", "original_question": "<audio>\nDevise a caption that matches the theme of this audio. ", "generated_text": "A large engine is running and then a small engine starts up.", "ground_truth_text": "A soft whirring sound."}
|