Refresh: ok @ 2026-06-15T12:31:10Z · Bisect: idle

transformers · integration-test failure triage

Generated 2026-06-13T12:30:56Z · window 2026-06-072026-06-13 (7 daily runs, ≥5/7 intersection)

TL;DR

Regression-day clustering (historical first-failure)

For every persistent failure we walked the daily CI dataset backwards to find the first day it appeared as failing. The table below groups failures by that day — large buckets are likely fleet regressions from a single landed PR. Click a date to see the commits merged in the 24h window before it.

first-failure dayfailuresshare
unknown733100.0%

Top regression days — failure breakdown

unknown — 733 failures

Failure-mode mix: output_mismatch 487 other 153 OOM 57 import_or_config 22 load_error 12 cuda_runtime 2 · 161 distinct models touched. commit log around unknown

modelfailuressample modesample trace excerpt
whisper42other(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
musicgen_melody16output_mismatch(line 1307) AssertionError: Tensor-likes are not close!
generation15output_mismatch(line 3215) AssertionError: Lists differ: ['Tel[23 chars]key. Sure, here\'s one for you:\n\nWhy did the[67 chars]s"!'] != ['Tel[23 chars]key. Why did the monkey go to the doctor?…
gemma14output_mismatch(line 337) AssertionError: Lists differ: ['Hel[196 chars]tdi 105bhp.\nI have a problem with the engine [37 chars]the'] != ['Hel[196 chars]tdi 110bhp.\nI have a problem with the e…
dac12output_mismatch(line 819) AssertionError: Tensor-likes are not close!
edgetam12import_or_config(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
glm46v12other(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking argumen…
glm_ocr12other(line 456) assert [151331, 1513..., 151343, ...] == [59248, 59250...6, 59280, ...]
gemma310output_mismatch(line 874) AssertionError: 'DynamicSlidingWindowLayer' unexpectedly found in 'DynamicCache(layers=[DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlidingWindowLayer…
gemma3n10output_mismatch(line 1196) AssertionError: Lists differ: [' and I find it very relaxing. I also lik[112 chars]re'"] != [" and the people are so friendly. I'm so [93 chars]re'"]
vision_encoder_decoder10output_mismatch(line 1352) AssertionError: Tensor-likes are not close!
cohere2_vision8output_mismatch(line 687) AssertionError: False is not true : Actual logits: tensor([2.3711, 1.6689, 1.8389, 1.9785, 1.9121], dtype=torch.float16)
… and 149 more models
Show all 733 failures in this bucket
modelgputestfailure_modedays_seentrace excerpt
bambamultitest_simple_batched_generate_with_paddingOOM7/7(line 779) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 8.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 3.37 GiB is free. Process 87938 has 18.93 GiB memory in use. Of the allocated memory 1…
bambamultitest_simple_generateOOM7/7(line 779) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 4.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 3.28 GiB is free. Process 87938 has 19.01 GiB memory in use. Of the allocated memory 1…
bambasingletest_simple_batched_generate_with_paddingOOM7/7(line 779) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 8.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 3.50 GiB is free. Process 98882 has 18.80 GiB memory in use. Of the allocated memory 1…
bambasingletest_simple_generateOOM7/7(line 779) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 4.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 3.42 GiB is free. Process 98882 has 18.88 GiB memory in use. Of the allocated memory 1…
cohere2_visionmultitest_model_integration_generate_chat_templateOOM6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 5.86 GiB. GPU 1 has a total capacity of 22.30 GiB of which 1.48 GiB is free. Process 270159 has 20.82 GiB memory in use. Of the allocated memory …
cohere2_visionsingletest_model_integration_generate_chat_templateOOM6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 5.86 GiB. GPU 0 has a total capacity of 22.30 GiB of which 720.69 MiB is free. Process 70312 has 21.59 GiB memory in use. Of the allocated memory…
cwmmultitest_cwm_sliding_window_long_sequenceOOM7/7(line 255) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 102.00 MiB. GPU 1 has a total capacity of 22.30 GiB of which 46.69 MiB is free. Process 795802 has 22.25 GiB memory in use. Of the allocated memo…
deepseek_v2multitest_batch_fa2OOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 704.00 KiB is free. Process 381415 has 22.29 GiB memory in use. Of the allocated memo…
deepseek_v2multitest_deepseek_v2_liteOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 21.10 GiB. GPU 0 has a total capacity of 22.30 GiB of which 140.69 MiB is free. Process 381415 has 22.16 GiB memory in use. Of the allocated mem…
deepseek_v2multitest_logits_eagerOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 21.10 GiB. GPU 0 has a total capacity of 22.30 GiB of which 140.69 MiB is free. Process 381415 has 22.16 GiB memory in use. Of the allocated mem…
deepseek_v2singletest_batch_fa2OOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 2.69 MiB is free. Process 183760 has 22.29 GiB memory in use. Of the allocated memory…
deepseek_v2singletest_deepseek_v2_liteOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 21.10 GiB. GPU 0 has a total capacity of 22.30 GiB of which 222.69 MiB is free. Process 183760 has 22.08 GiB memory in use. Of the allocated mem…
deepseek_v2singletest_logits_eagerOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 21.10 GiB. GPU 0 has a total capacity of 22.30 GiB of which 222.69 MiB is free. Process 183760 has 22.08 GiB memory in use. Of the allocated mem…
deepseek_vl_hybridmultitest_model_text_generation_batchedOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 86.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 38.69 MiB is free. Process 587134 has 22.26 GiB memory in use. Of the allocated memo…
deepseek_vl_hybridmultitest_model_text_generation_with_multi_imageOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 18.69 MiB is free. Process 587134 has 22.28 GiB memory in use. Of the allocated memo…
emu3multitest_model_generationOOM7/7(line 92) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 2.10 GiB. GPU 0 has a total capacity of 22.30 GiB of which 694.69 MiB is free. Process 322396 has 21.62 GiB memory in use. Of the allocated memory…
emu3multitest_model_generation_batchedOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 9.38 GiB. GPU 0 has a total capacity of 22.30 GiB of which 1.00 GiB is free. Process 322396 has 21.29 GiB memory in use. Of the allocated memory…
emu3multitest_model_generation_multi_imageOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 9.71 GiB. GPU 0 has a total capacity of 22.30 GiB of which 1.00 GiB is free. Process 322396 has 21.29 GiB memory in use. Of the allocated memory…
emu3singletest_model_generation_batchedOOM7/7(line 2397) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 458.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 378.69 MiB is free. Process 361351 has 21.93 GiB memory in use. Of the allocated me…
emu3singletest_model_generation_multi_imageOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 9.66 GiB. GPU 0 has a total capacity of 22.30 GiB of which 376.69 MiB is free. Process 361351 has 21.93 GiB memory in use. Of the allocated memo…
exaone4multitest_model_generation_beyond_sliding_windowOOM7/7(line 291) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 220.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 18.69 MiB is free. Process 1249037 has 22.28 GiB memory in use. Of the allocated mem…
exaone4multitest_model_generation_eagerOOM7/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1000.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 992.69 MiB is free. Process 1249037 has 21.33 GiB memory in use. Of the allocated m…
exaone4multitest_model_generation_sdpaOOM7/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1000.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 992.69 MiB is free. Process 1249037 has 21.33 GiB memory in use. Of the allocated m…
exaone4multitest_model_logitsOOM7/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1000.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 994.69 MiB is free. Process 1249037 has 21.32 GiB memory in use. Of the allocated m…
exaone4_5multitest_model_generation_image_textOOM7/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.46 GiB. GPU 0 has a total capacity of 22.30 GiB of which 1.46 GiB is free. Process 343938 has 20.83 GiB memory in use. Of the allocated memory …
exaone4_5multitest_model_generation_text_onlyOOM7/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.46 GiB. GPU 0 has a total capacity of 22.30 GiB of which 1.46 GiB is free. Process 343938 has 20.84 GiB memory in use. Of the allocated memory …
gemma4multitest_export_text_onlyOOM7/7(line 2301) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 4.38 GiB. GPU 0 has a total capacity of 22.30 GiB of which 2.67 GiB is free. Process 455140 has 19.62 GiB memory in use. Of the allocated memory…
gemma4singletest_export_text_onlyOOM7/7(line 2301) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 4.38 GiB. GPU 0 has a total capacity of 22.30 GiB of which 2.80 GiB is free. Process 433771 has 19.49 GiB memory in use. Of the allocated memory…
glmmultitest_model_9b_fp16OOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 32.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 22.69 MiB is free. Process 1362136 has 22.27 GiB memory in use. Of the allocated mem…
glmmultitest_model_9b_sdpaOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.16 GiB. GPU 0 has a total capacity of 22.30 GiB of which 22.69 MiB is free. Process 1362136 has 22.27 GiB memory in use. Of the allocated memo…
glmsingletest_model_9b_fp16OOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 214.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 74.69 MiB is free. Process 1317689 has 22.22 GiB memory in use. Of the allocated me…
glmsingletest_model_9b_sdpaOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.16 GiB. GPU 0 has a total capacity of 22.30 GiB of which 74.69 MiB is free. Process 1317689 has 22.22 GiB memory in use. Of the allocated memo…
glm4_moemultitest_compile_static_cacheOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 120.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 118.69 MiB is free. Process 535825 has 22.18 GiB memory in use. Of the allocated mem…
glm4_moesingletest_compile_static_cacheOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 120.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 74.69 MiB is free. Process 1341191 has 22.22 GiB memory in use. Of the allocated mem…
glm4_moe_litemultitest_compile_static_cacheOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 8.69 MiB is free. Process 574492 has 22.29 GiB memory in use. Of the allocated memory…
glm4_moe_litesingletest_compile_static_cacheOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 12.69 MiB is free. Process 348066 has 22.28 GiB memory in use. Of the allocated memor…
llavamultitest_pixtralOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 40.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 6.69 MiB is free. Process 1522550 has 22.29 GiB memory in use. Of the allocated memor…
llavamultitest_pixtral_4bitOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 7.77 GiB. GPU 0 has a total capacity of 22.30 GiB of which 6.69 MiB is free. Process 1522550 has 22.29 GiB memory in use. Of the allocated memor…
llavasingletest_pixtralOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 140.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 86.69 MiB is free. Process 1603048 has 22.21 GiB memory in use. Of the allocated mem…
llavasingletest_pixtral_4bitOOM7/7(line 5095) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 7.77 GiB. GPU 0 has a total capacity of 22.30 GiB of which 36.69 MiB is free. Process 1603048 has 22.26 GiB memory in use. Of the allocated memo…
mamba2multitest_batched_equivalence_with_cacheOOM7/7(line 532) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 12.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 8.03 GiB is free. Process 732234 has 14.26 GiB memory in use. Of the allocated memory…
mamba2multitest_batched_equivalence_without_cacheOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 64.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 46.69 MiB is free. Process 732234 has 22.25 GiB memory in use. Of the allocated memo…
mamba2multitest_mamba2_mixer_train_vs_eval_equivalenceOOM7/7(line 370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 12.69 MiB is free. Process 732234 has 22.28 GiB memory in use. Of the allocated memor…
mamba2multitest_simple_generateOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 12.69 MiB is free. Process 732234 has 22.28 GiB memory in use. Of the allocated mem…
mamba2singletest_batched_equivalence_with_cacheOOM7/7(line 532) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 12.00 GiB. GPU 0 has a total capacity of 22.30 GiB of which 8.03 GiB is free. Process 1646561 has 14.27 GiB memory in use. Of the allocated memor…
mamba2singletest_batched_equivalence_without_cacheOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 64.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 46.69 MiB is free. Process 1646561 has 22.25 GiB memory in use. Of the allocated mem…
mamba2singletest_mamba2_mixer_train_vs_eval_equivalenceOOM7/7(line 370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 12.69 MiB is free. Process 1646561 has 22.28 GiB memory in use. Of the allocated memo…
mamba2singletest_simple_generateOOM7/7(line 1370) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 12.69 MiB is free. Process 1646561 has 22.28 GiB memory in use. Of the allocated me…
moshimultitest_moshiko_greedy_unconditional_fp32OOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 34.00 MiB. GPU 1 has a total capacity of 22.30 GiB of which 16.69 MiB is free. Process 127291 has 22.28 GiB memory in use. Of the allocated memor…
olmomultitest_model_7b_logitsOOM6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 172.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 116.69 MiB is free. Process 565912 has 22.18 GiB memory in use. Of the allocated mem…
olmosingletest_model_7b_logitsOOM6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 64.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 30.69 MiB is free. Process 193520 has 22.27 GiB memory in use. Of the allocated memor…
phimoesingletest_model_phimoe_instruct_logitsOOM6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.56 GiB. GPU 0 has a total capacity of 22.30 GiB of which 814.69 MiB is free. Process 329492 has 21.50 GiB memory in use. Of the allocated memor…
phimoesingletest_phimoe_instruct_generationOOM5/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.56 GiB. GPU 0 has a total capacity of 22.30 GiB of which 812.69 MiB is free. Process 329492 has 21.50 GiB memory in use. Of the allocated memor…
phimoesingletest_phimoe_instruct_with_static_cacheOOM5/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.56 GiB. GPU 0 has a total capacity of 22.30 GiB of which 812.69 MiB is free. Process 329492 has 21.50 GiB memory in use. Of the allocated memor…
pi0multitest_train_pi0_base_liberoOOM7/7(line 785) torch.OutOfMemoryError: Caught OutOfMemoryError in replica 0 on device 0.
pi0singletest_train_pi0_base_liberoOOM7/7(line 193) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 18.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 6.69 MiB is free. Process 777692 has 22.29 GiB memory in use. Of the allocated memory…
qwen3_vl_moemultitest_small_model_integration_test_expandOOM7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 768.00 MiB. GPU 1 has a total capacity of 22.30 GiB of which 240.69 MiB is free. Process 643790 has 22.06 GiB memory in use. Of the allocated mem…
generationmultitest_validate_assistantcuda_runtime7/7(line 1909) torch.AcceleratorError: CUDA error: device-side assert triggered
generationsingletest_validate_assistantcuda_runtime7/7(line 1909) torch.AcceleratorError: CUDA error: device-side assert triggered
cwmmultitest_cwm_integrationimport_or_config7/7(line 1968) AttributeError: 'CwmDecoderLayer' object has no attribute 'attention_type'
cwmsingletest_cwm_integrationimport_or_config6/7(line 1968) AttributeError: 'CwmDecoderLayer' object has no attribute 'attention_type'
edgetammultitest_inference_batched_images_batched_boxesimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetammultitest_inference_mask_generation_batched_images_batched_points_multi_pointsimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetammultitest_inference_mask_generation_batched_images_multi_pointsimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetammultitest_inference_mask_generation_from_existing_points_and_maskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetammultitest_inference_mask_generation_one_point_multimaskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetammultitest_inference_mask_generation_one_point_no_multimaskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_batched_images_batched_boxesimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_mask_generation_batched_images_batched_points_multi_pointsimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_mask_generation_batched_images_multi_pointsimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_mask_generation_from_existing_points_and_maskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_mask_generation_one_point_multimaskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
edgetamsingletest_inference_mask_generation_one_point_no_multimaskimport_or_config7/7(line 249) TypeError: FeatureListNet.forward() got an unexpected keyword argument 'original_sizes'
emu3multitest_model_generate_imagesimport_or_config7/7(line 1968) AttributeError: 'Emu3ForConditionalGeneration' object has no attribute 'vocabulary_mapping'
emu3singletest_model_generate_imagesimport_or_config7/7(line 1968) AttributeError: 'Emu3ForConditionalGeneration' object has no attribute 'vocabulary_mapping'
generationmultitest_green_red_watermark_generationimport_or_config7/7(line 665) AttributeError: 'dict' object has no attribute 'validate'
generationsingletest_green_red_watermark_generationimport_or_config7/7(line 665) AttributeError: 'dict' object has no attribute 'validate'
kosmos2multitest_inference_interpolate_pos_encodingimport_or_config7/7(line 777) AttributeError: 'NoneType' object has no attribute 'last_hidden_state'
kosmos2singletest_inference_interpolate_pos_encodingimport_or_config7/7(line 777) AttributeError: 'NoneType' object has no attribute 'last_hidden_state'
nemotronmultitest_nemotron_8b_generation_fa2import_or_config6/7(line 1725) ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package for FlashAttention2 doesn't seem to be installed.
nemotronsingletest_nemotron_8b_generation_fa2import_or_config7/7(line 1725) ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package for FlashAttention2 doesn't seem to be installed.
deepseek_v4multitest_v4_flash_dequantized_chat_seven_promptsload_error5/7(line 501) ValueError: The current `device_map` had weights offloaded to the disk, which needed to be re-saved. This is either because the weights are not in `safetensors` format, or because the model uses an internal …
deepseek_v4multitest_v4_flash_dequantized_generationload_error5/7(line 501) ValueError: The current `device_map` had weights offloaded to the disk, which needed to be re-saved. This is either because the weights are not in `safetensors` format, or because the model uses an internal …
deepseek_v4singletest_v4_flash_dequantized_chat_seven_promptsload_error6/7(line 501) ValueError: The current `device_map` had weights offloaded to the disk, which needed to be re-saved. This is either because the weights are not in `safetensors` format, or because the model uses an internal …
deepseek_v4singletest_v4_flash_dequantized_generationload_error6/7(line 501) ValueError: The current `device_map` had weights offloaded to the disk, which needed to be re-saved. This is either because the weights are not in `safetensors` format, or because the model uses an internal …
jais2multitest_model_generationload_error7/7(line 503) OSError: You are trying to access a gated repo.
jais2multitest_model_logitsload_error7/7(line 503) OSError: You are trying to access a gated repo.
jais2singletest_model_generationload_error7/7(line 503) OSError: You are trying to access a gated repo.
jais2singletest_model_logitsload_error7/7(line 503) OSError: You are trying to access a gated repo.
qwen3_moemultitest_model_15b_a2b_generationload_error7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_model_15b_a2b_logitsload_error7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_model_15b_a2b_long_prompt_sdpaload_error7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_speculative_generationload_error7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
audioflamingo3multitest_fixture_batched_matchesother7/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
audioflamingo3multitest_fixture_single_matchesother7/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
audioflamingo3singletest_fixture_batched_matchesother7/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
audioflamingo3singletest_fixture_single_matchesother7/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
bitnetmultitest_model_generationother7/7(line 309) RuntimeError: expected mat1 and mat2 to have the same dtype, but got: c10::BFloat16 != unsigned char
bitnetmultitest_model_logitsother7/7(line 309) RuntimeError: expected m1 and m2 to have the same dtype, but got: c10::BFloat16 != unsigned char
bitnetsingletest_model_generationother7/7(line 309) RuntimeError: expected mat1 and mat2 to have the same dtype, but got: c10::BFloat16 != unsigned char
bitnetsingletest_model_logitsother7/7(line 309) RuntimeError: expected m1 and m2 to have the same dtype, but got: c10::BFloat16 != unsigned char
bltmultitest_model_logitsother5/7(line 2567) RuntimeError: Expected all tensors to be on the same device, but got index is on cuda:0, different from other tensors on cuda:1 (when checking argument in method wrapper_CUDA__index_select)
bltmultitest_model_logits_bf16other5/7(line 2567) RuntimeError: Expected all tensors to be on the same device, but got index is on cuda:0, different from other tensors on cuda:1 (when checking argument in method wrapper_CUDA__index_select)
bridgetowermultitest_constrastive_learningother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-large-itm-mlm-itc. Should have a `model_type` key in its config.json.
bridgetowermultitest_image_and_text_retrievalother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-base-itm-mlm. Should have a `model_type` key in its config.json.
bridgetowermultitest_masked_language_modelingother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-base-itm-mlm. Should have a `model_type` key in its config.json.
bridgetowersingletest_constrastive_learningother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-large-itm-mlm-itc. Should have a `model_type` key in its config.json.
bridgetowersingletest_image_and_text_retrievalother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-base-itm-mlm. Should have a `model_type` key in its config.json.
bridgetowersingletest_masked_language_modelingother7/7(line 421) ValueError: Unrecognized model in BridgeTower/bridgetower-base-itm-mlm. Should have a `model_type` key in its config.json.
clvpmultitest_full_model_integrationother7/7(line 1310) RuntimeError: The expanded size of the tensor (2) must match the existing size (3) at non-singleton dimension 0. Target sizes: [2]. Tensor sizes: [3]
clvpsingletest_full_model_integrationother7/7(line 1310) RuntimeError: The expanded size of the tensor (2) must match the existing size (3) at non-singleton dimension 0. Target sizes: [2]. Tensor sizes: [3]
cohere2_visionmultitest_model_forward_visionother6/7(line 488) OSError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/root/repos/moe/engines/command_a+_bf16'. Use `repo_type` argument if needed.
cohere2_visionmultitest_model_generate_visionother6/7(line 488) OSError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/root/repos/moe/engines/command_a+_bf16'. Use `repo_type` argument if needed.
cohere2_visionsingletest_model_forward_visionother6/7(line 488) OSError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/root/repos/moe/engines/command_a+_bf16'. Use `repo_type` argument if needed.
cohere2_visionsingletest_model_generate_visionother6/7(line 488) OSError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/root/repos/moe/engines/command_a+_bf16'. Use `repo_type` argument if needed.
colqwen2multitest_model_integration_testother7/7(line 110) ValueError: images must be an image, list of images or list of list of images
colqwen2singletest_model_integration_testother7/7(line 110) ValueError: images must be an image, list of images or list of list of images
dbrxmultitest_tiny_model_logitsother7/7(line 146) huggingface_hub.errors.StrictDataclassFieldValidationError: Validation error for field 'moe_jitter_eps':
dbrxsingletest_tiny_model_logitsother7/7(line 146) huggingface_hub.errors.StrictDataclassFieldValidationError: Validation error for field 'moe_jitter_eps':
deepseek_v3singletest_compile_static_cacheother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
deepseek_vlmultitest_model_text_generationother7/7(line 67) RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
deepseek_vlmultitest_model_text_generation_batchedother7/7(line 67) RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
deepseek_vlmultitest_model_text_generation_with_multi_imageother7/7(line 67) RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
deepseek_vl_hybridmultitest_model_text_generationother7/7(line 67) RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
deepseek_vl_hybridsingletest_model_text_generation_with_multi_imageother7/7(line 468) RuntimeError: You can't move a model that has some modules offloaded to cpu or disk.
flex_olmomultitest_model_7b_greedy_generationother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
gemma2multitest_model_2b_pipeline_bf16_flex_attentionother7/7Cannot retrieve error message.
gemma2singletest_model_2b_pipeline_bf16_flex_attentionother7/7Cannot retrieve error message.
generationmultitest_cache_device_map_with_vision_layer_device_mapother7/7(line 1632) ValueError: The device_map provided does not give any device for the following parameters: model.vision_tower.embeddings.patch_embedding.weight, model.vision_tower.embeddings.patch_embedding.bias, model.vis…
generationmultitest_generate_multi_accelerator_causal_maskother7/7(line 1632) ValueError: The device_map provided does not give any device for the following parameters: model.visual.patch_embed.proj.weight, model.visual.blocks.0.norm1.weight, model.visual.blocks.0.norm1.bias, model.v…
generationsingletest_cache_device_map_with_vision_layer_device_mapother7/7(line 1632) ValueError: The device_map provided does not give any device for the following parameters: model.vision_tower.embeddings.patch_embedding.weight, model.vision_tower.embeddings.patch_embedding.bias, model.vis…
gitmultitest_inference_image_captioningother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
gitsingletest_inference_image_captioningother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
glm46vmultitest_small_model_integration_testother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vmultitest_small_model_integration_test_batchother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vmultitest_small_model_integration_test_batch_different_resolutionsother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vmultitest_small_model_integration_test_batch_wo_imageother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vmultitest_small_model_integration_test_expandother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vmultitest_small_model_integration_test_with_videoother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got torch.cuda.HalfTensor instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_testother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_test_batchother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_test_batch_different_resolutionsother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_test_batch_wo_imageother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_test_expandother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got CUDABFloat16Type instead (while checking arguments for embedding)
glm46vsingletest_small_model_integration_test_with_videoother7/7(line 2567) RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got torch.cuda.HalfTensor instead (while checking arguments for embedding)
glm4v_moemultitest_small_model_integration_test_batchother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
glm4v_moemultitest_small_model_integration_test_with_videoother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
glm_ocrmultitest_small_model_integration_testother7/7(line 456) assert [151331, 1513..., 151343, ...] == [59248, 59250...6, 59280, ...]
glm_ocrsingletest_small_model_integration_testother7/7(line 456) assert [151331, 1513..., 151343, ...] == [59248, 59250...6, 59280, ...]
hunyuan_v1_moemultitest_model_generationother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
hyperclovaxmultitest_model_seed_think_14b_bf16other7/7(line 1319) ValueError: There are one or more stop strings, either in the arguments to `generate` or in the model's generation config, but we could not locate a tokenizer. When generating with stop strings, you must pa…
hyperclovaxsingletest_model_seed_think_14b_bf16other7/7(line 1319) ValueError: There are one or more stop strings, either in the arguments to `generate` or in the model's generation config, but we could not locate a tokenizer. When generating with stop strings, you must pa…
janusmultitest_model_text_generationother7/7(line 1735) ValueError: Image features and image tokens do not match, tokens: 0, features: 1179648
janusmultitest_model_text_generation_with_multi_imageother7/7(line 1735) ValueError: Image features and image tokens do not match, tokens: 0, features: 2359296
janussingletest_model_text_generationother7/7(line 1735) ValueError: Image features and image tokens do not match, tokens: 0, features: 1179648
janussingletest_model_text_generation_with_multi_imageother7/7(line 1735) ValueError: Image features and image tokens do not match, tokens: 0, features: 2359296
mistral4multitest_mistral_small_4_generationother7/7(line 6741) RuntimeError: Expected mat_a to be Float32, BFloat16 or Float16 matrix, got Float8_e4m3fn
mistral4multitest_mistral_small_4_logitsother7/7(line 6741) RuntimeError: Expected mat_a to be Float32, BFloat16 or Float16 matrix, got Float8_e4m3fn
mistral4singletest_mistral_small_4_generationother7/7(line 6741) RuntimeError: Expected mat_a to be Float32, BFloat16 or Float16 matrix, got Float8_e4m3fn
mistral4singletest_mistral_small_4_logitsother7/7(line 6741) RuntimeError: Expected mat_a to be Float32, BFloat16 or Float16 matrix, got Float8_e4m3fn
modernvbertmultitest_masked_lm_inferenceother7/7(line 835) huggingface_hub.errors.RepositoryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2cd2c7-474585012cf449656746018f;6a94d3ec-b383-4df4-9312-21dd1b5b085c)
modernvbertsingletest_masked_lm_inferenceother7/7(line 835) huggingface_hub.errors.RepositoryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2cd1d6-5b0260645b34c8982407b11d;430a3b4b-e895-4d83-8777-c2d396c8cbe6)
mptmultitest_generationother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptmultitest_generation_8kother7/7(line 469) OSError: mosaicml/mpt-7b-8k is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptmultitest_generation_batchedother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptmultitest_model_logitsother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptsingletest_generationother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptsingletest_generation_8kother7/7(line 469) OSError: mosaicml/mpt-7b-8k is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptsingletest_generation_batchedother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
mptsingletest_model_logitsother7/7(line 469) OSError: mosaicml/mpt-7b is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
musicflamingomultitest_fixture_batched_matchesother6/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
musicflamingomultitest_fixture_single_matchesother6/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
musicflamingosingletest_fixture_batched_matchesother6/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
musicflamingosingletest_fixture_single_matchesother6/7(line 2935) RuntimeError: expected scalar type Float but found BFloat16
olmosingletest_model_1b_logitsother6/7(line 2567) RuntimeError: Expected all tensors to be on the same device, but got index is on cpu, different from other tensors on cuda:0 (when checking argument in method wrapper_CUDA__index_select)
peft_integrationmultitest_hotswap_with_compile_and_higher_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationmultitest_hotswap_with_compile_and_lower_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationmultitest_hotswap_without_compile_and_with_higher_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationmultitest_hotswap_without_compile_and_with_lower_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationsingletest_hotswap_with_compile_and_higher_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationsingletest_hotswap_with_compile_and_lower_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationsingletest_hotswap_without_compile_and_with_higher_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
peft_integrationsingletest_hotswap_without_compile_and_with_lower_rank_worksother7/7(line 278) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
pegasusmultitest_device_mapother7/7(line 334) RuntimeError: Expected all tensors to be on the same device, but got other is on cuda:1, different from other tensors on cuda:0 (when checking argument in method wrapper_CUDA__equal)
pegasusmultitest_pegasus_xsum_summaryother7/7(line 350) assert torch.Size([2, 422]) == (2, 421)
pegasussingletest_pegasus_xsum_summaryother7/7(line 350) assert torch.Size([2, 422]) == (2, 421)
persimmonmultitest_model_8b_chat_logitsother6/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
phi3multitest_export_static_cacheother6/7(line 1318) torch._dynamo.exc.Unsupported: Data-dependent branching
phi3singletest_export_static_cacheother7/7(line 1318) torch._dynamo.exc.Unsupported: Data-dependent branching
phimoemultitest_model_phimoe_instruct_logitsother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
phimoemultitest_phimoe_instruct_generationother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
phimoemultitest_phimoe_instruct_with_static_cacheother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
pvt_v2multitest_inference_modelother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
pvt_v2singletest_inference_modelother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
qwen2_moemultitest_speculative_generationother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen2_moesingletest_speculative_generationother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_omni_moemultitest_small_model_integration_testother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_omni_moemultitest_small_model_integration_test_batchother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_omni_moemultitest_small_model_integration_test_multiturnother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_omni_moemultitest_small_model_integration_test_w_audioother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_testother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_test_batchother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_test_batch_different_resolutionsother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_test_batch_wo_imageother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_test_expand_with_videoother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
qwen3_vl_moemultitest_small_model_integration_test_with_videoother7/7(line 273) RuntimeError: We encountered some issues during automatic conversion of the weights. For details look at the `CONVERSION` entries of the above report!
seamless_m4tmultitest_speech_to_speech_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4tmultitest_speech_to_text_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4tmultitest_to_rus_speechother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4tsingletest_speech_to_speech_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4tsingletest_speech_to_text_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4tsingletest_to_rus_speechother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2multitest_speech_to_speech_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2multitest_speech_to_text_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2multitest_to_rus_speechother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2singletest_speech_to_speech_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2singletest_speech_to_text_modelother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
seamless_m4t_v2singletest_to_rus_speechother7/7(line 281) ValueError: Invalid input type. Must be a single audio or a list of audio
superpointmultitest_inferenceother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
superpointsingletest_inferenceother7/7(line 4194) UnboundLocalError: local variable 'output' referenced before assignment
vision_encoder_decodermultitest_forward_passother7/7(line 781) huggingface_hub.errors.RemoteEntryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2ce203-064ca2350fd9eab00e952f84;8f588202-712b-49e0-9778-84672de20df7)
vision_encoder_decodermultitest_generationother7/7(line 781) huggingface_hub.errors.RemoteEntryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2ce205-134140366003d1700b15e4df;557daa74-b8c9-48ce-832c-22e2a0f869b5)
vision_encoder_decodersingletest_forward_passother7/7(line 781) huggingface_hub.errors.RemoteEntryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2ce1c5-0234cefe47ace15f2f3ce1c8;6d3423f2-eaa5-4917-84a6-32685f9735a6)
vision_encoder_decodersingletest_generationother7/7(line 781) huggingface_hub.errors.RemoteEntryNotFoundError: 404 Client Error. (Request ID: Root=1-6a2ce1c7-1b90e6b26cd0852c5e451d52;da762efd-bac2-4006-8263-d949916d16b8)
whispermultitest_distil_token_timestamp_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispermultitest_large_batched_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispermultitest_large_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispermultitest_large_generation_multilingualother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispermultitest_large_timestamp_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispermultitest_small_longform_timestamps_generationother7/7(line 1882) KeyError: 0
whispermultitest_speculative_decoding_distilother7/7(line 323) UnboundLocalError: local variable 'is_updated' referenced before assignment
whispermultitest_tiny_longform_timestamps_generationother7/7(line 1698) KeyError: 0
whispermultitest_tiny_static_generation_long_formother7/7(line 3098) RuntimeError: The size of tensor a (352) must match the size of tensor b (354) at non-singleton dimension 1
whispermultitest_tiny_timestamp_generationother7/7(line 4176) IndexError: list index out of range
whispermultitest_whisper_longform_single_batchother7/7(line 294) TypeError: '>=' not supported between instances of 'list' and 'int'
whispersingletest_distil_token_timestamp_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_batched_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_generation_multilingualother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_timestamp_generationother7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_small_longform_timestamps_generationother7/7(line 1882) KeyError: 0
whispersingletest_speculative_decoding_distilother7/7(line 323) UnboundLocalError: local variable 'is_updated' referenced before assignment
whispersingletest_tiny_longform_timestamps_generationother7/7(line 1698) KeyError: 0
whispersingletest_tiny_static_generation_long_formother7/7(line 3098) RuntimeError: The size of tensor a (352) must match the size of tensor b (354) at non-singleton dimension 1
whispersingletest_tiny_timestamp_generationother7/7(line 4176) IndexError: list index out of range
whispersingletest_whisper_longform_single_batchother7/7(line 294) TypeError: '>=' not supported between instances of 'list' and 'int'
aya_visionmultitest_small_model_integration_generate_chat_templateoutput_mismatch7/7(line 355) AssertionError: 'The [29 chars] two cats resting on a bright pink blanket spread across a red' != 'The [29 chars] two cats resting on a bright pink blanket. The cats,'
aya_visionsingletest_small_model_integration_generate_chat_templateoutput_mismatch7/7(line 355) AssertionError: 'The [29 chars] two cats resting on a bright pink blanket spread across a red' != 'The [29 chars] two cats resting on a bright pink blanket. The cats,'
big_birdmultitest_fill_maskoutput_mismatch7/7(line 906) AssertionError: '' != 'happiness'
big_birdsingletest_fill_maskoutput_mismatch7/7(line 906) AssertionError: '' != 'happiness'
blip_2multitest_inference_t5output_mismatch7/7(line 1616) AssertionError: Lists differ: [0, 2335, 1556, 28, 1782, 30, 8, 2608, 1] != [0, 3, 9, 2335, 19, 1556, 28, 160, 1782, 30, 8, 2608, 1]
blip_2multitest_inference_t5_batched_beam_searchoutput_mismatch7/7(line 1671) AssertionError: Lists differ: [0, 3, 9, 2335, 19, 3823, 30, 8, 2608, 28, 160, 1782, 1] != [0, 3, 9, 2335, 19, 1556, 28, 160, 1782, 30, 8, 2608, 1]
blip_2multitest_inference_t5_multi_acceleratoroutput_mismatch7/7(line 1740) AssertionError: Lists differ: [0, 2335, 1556, 28, 1782, 30, 8, 2608, 1] != [0, 3, 9, 2335, 19, 1556, 28, 160, 1782, 30, 8, 2608, 1]
blip_2singletest_inference_t5output_mismatch6/7(line 1616) AssertionError: Lists differ: [0, 2335, 1556, 28, 1782, 30, 8, 2608, 1] != [0, 3, 9, 2335, 19, 1556, 28, 160, 1782, 30, 8, 2608, 1]
blip_2singletest_inference_t5_batched_beam_searchoutput_mismatch6/7(line 1671) AssertionError: Lists differ: [0, 3, 9, 2335, 19, 3823, 30, 8, 2608, 28, 160, 1782, 1] != [0, 3, 9, 2335, 19, 1556, 28, 160, 1782, 30, 8, 2608, 1]
bloommultitest_batch_generated_textoutput_mismatch7/7(line 621) AssertionError: Lists differ: ['Hello what is', 'Running a quick test with the followi[54 chars]the'] != ['Hello what is the best way to get the data from the se[127 chars]on2']
bloommultitest_batch_generation_paddingoutput_mismatch7/7(line 586) AssertionError: Lists differ: [5941[15 chars]632, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,[82 chars]0, 0] != [5941[15 chars]632, 419, 682, 15, 473, 912, 267, 40704, 15, 1[186 chars] 912]
bloommultitest_simple_generationoutput_mismatch7/7(line 539) AssertionError: 'I en[58 chars] play. I am a very active person, and I am a v[75 chars]am a' != 'I en[58 chars] play with the kids. I am a very active person[86 chars]nd I'
bloomsingletest_batch_generated_textoutput_mismatch7/7(line 621) AssertionError: Lists differ: ['Hello what is', 'Running a quick test with the followi[54 chars]the'] != ['Hello what is the best way to get the data from the se[127 chars]on2']
bloomsingletest_batch_generation_paddingoutput_mismatch7/7(line 586) AssertionError: Lists differ: [5941[15 chars]632, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,[82 chars]0, 0] != [5941[15 chars]632, 419, 682, 15, 473, 912, 267, 40704, 15, 1[186 chars] 912]
bloomsingletest_simple_generationoutput_mismatch7/7(line 539) AssertionError: 'I en[58 chars] play. I am a very active person, and I am a v[75 chars]am a' != 'I en[58 chars] play with the kids. I am a very active person[86 chars]nd I'
chameleonmultitest_model_7boutput_mismatch7/7(line 399) AssertionError: Lists differ: ['Des[115 chars] dot representing the position of the star Alp[92 chars]ted'] != ['Des[115 chars] dot in the center representing the North Star[99 chars]the']
chameleonmultitest_model_7b_batchedoutput_mismatch7/7(line 445) AssertionError: Lists differ: ['Des[115 chars] dot representing the position of the star Alp[309 chars]on.'] != ['Des[115 chars] dot in the center representing the star Alpha[154 chars]The']
chameleonmultitest_model_7b_multi_imageoutput_mismatch7/7(line 469) AssertionError: Lists differ: ['Wha[74 chars]een the night sky and the internet. The first [115 chars]The'] != ['Wha[74 chars]een two celestial objects, the stars and the c[113 chars]map']
chameleonsingletest_model_7boutput_mismatch6/7(line 399) AssertionError: Lists differ: ['Des[115 chars] dot representing the position of the star Alp[92 chars]ted'] != ['Des[115 chars] dot in the center representing the North Star[99 chars]the']
chameleonsingletest_model_7b_batchedoutput_mismatch6/7(line 445) AssertionError: Lists differ: ['Des[115 chars] dot representing the position of the star Alp[309 chars]on.'] != ['Des[115 chars] dot in the center representing the star Alpha[154 chars]The']
chameleonsingletest_model_7b_multi_imageoutput_mismatch6/7(line 469) AssertionError: Lists differ: ['Wha[74 chars]een the night sky and the internet. The first [115 chars]The'] != ['Wha[74 chars]een two celestial objects, the stars and the c[113 chars]map']
clvpmultitest_conditional_encoderoutput_mismatch7/7(line 552) AssertionError: Tensor-likes are not close!
clvpsingletest_conditional_encoderoutput_mismatch7/7(line 552) AssertionError: Tensor-likes are not close!
cohere2_visionmultitest_model_integration_forwardoutput_mismatch6/7(line 687) AssertionError: False is not true : Actual logits: tensor([2.3711, 1.6689, 1.8389, 1.9785, 1.9121], dtype=torch.float16)
cohere2_visionsingletest_model_integration_forwardoutput_mismatch6/7(line 687) AssertionError: False is not true : Actual logits: tensor([2.3711, 1.6689, 1.8389, 1.9785, 1.9121], dtype=torch.float16)
colqwen2multitest_model_integration_test_2output_mismatch7/7(line 400) AssertionError: Expected scores tensor([[16.3750, 10.9375, 14.7500],
colqwen2singletest_model_integration_test_2output_mismatch7/7(line 400) AssertionError: Expected scores tensor([[16.3750, 10.9375, 14.7500],
convnextv2multitest_inference_image_classification_headoutput_mismatch7/7(line 308) AssertionError: Tensor-likes are not close!
convnextv2singletest_inference_image_classification_headoutput_mismatch7/7(line 308) AssertionError: Tensor-likes are not close!
cvtmultitest_inference_image_classification_headoutput_mismatch7/7(line 271) AssertionError: Tensor-likes are not close!
cvtsingletest_inference_image_classification_headoutput_mismatch7/7(line 271) AssertionError: Tensor-likes are not close!
cwmsingletest_cwm_sliding_window_long_sequenceoutput_mismatch6/7(line 182) AssertionError: Tensor-likes are not close!
dab_detrmultitest_inference_object_detection_headoutput_mismatch7/7(line 805) AssertionError: Tensor-likes are not close!
dab_detrsingletest_inference_object_detection_headoutput_mismatch7/7(line 805) AssertionError: Tensor-likes are not close!
dacmultitest_integration_0_dac_16khzoutput_mismatch7/7(line 819) AssertionError: Tensor-likes are not close!
dacmultitest_integration_1_dac_24khzoutput_mismatch7/7(line 813) AssertionError: Scalars are not close!
dacmultitest_integration_2_dac_44khzoutput_mismatch7/7(line 825) AssertionError: Scalars are not close!
dacmultitest_integration_batch_0_dac_16khzoutput_mismatch7/7(line 870) AssertionError: Scalars are not close!
dacmultitest_integration_batch_1_dac_24khzoutput_mismatch7/7(line 876) AssertionError: Tensor-likes are not close!
dacmultitest_integration_batch_2_dac_44khzoutput_mismatch7/7(line 876) AssertionError: Tensor-likes are not close!
dacsingletest_integration_0_dac_16khzoutput_mismatch6/7(line 819) AssertionError: Tensor-likes are not close!
dacsingletest_integration_1_dac_24khzoutput_mismatch6/7(line 813) AssertionError: Scalars are not close!
dacsingletest_integration_2_dac_44khzoutput_mismatch6/7(line 825) AssertionError: Scalars are not close!
dacsingletest_integration_batch_0_dac_16khzoutput_mismatch6/7(line 870) AssertionError: Scalars are not close!
dacsingletest_integration_batch_1_dac_24khzoutput_mismatch6/7(line 876) AssertionError: Tensor-likes are not close!
dacsingletest_integration_batch_2_dac_44khzoutput_mismatch6/7(line 876) AssertionError: Tensor-likes are not close!
deepseek_v3multitest_compile_static_cacheoutput_mismatch7/7(line 424) AssertionError: Lists differ: ['Sim[41 chars]that Frojekecdytesాలు sicʰtinaccianntuala bre[327 chars]rew'] != ['Sim[41 chars]that aportersh455elike injection tactics-altit[355 chars]ick']
deepseek_vlsingletest_model_text_generation_batchedoutput_mismatch7/7(line 147) AssertionError: Lists differ: ['You[222 chars]tant:The image depicts a snowy landscape with [367 chars]the'] != ['You[222 chars]tant:What is a cat, a cat, a cat, a cat, a cat[329 chars]the']
deepseek_vl_hybridsingletest_model_text_generation_batchedoutput_mismatch7/7(line 370) AssertionError: Lists differ: ['You[224 chars]nt:\nThe image depicts a fluffy, light brown a[371 chars]he '] != ['You[224 chars]nt:\nA fluffy animal in a fluffyThe image,The [329 chars]he ']
depth_anythingmultitest_inferenceoutput_mismatch7/7(line 259) AssertionError: Tensor-likes are not close!
depth_anythingsingletest_inferenceoutput_mismatch7/7(line 259) AssertionError: Tensor-likes are not close!
diamultitest_dia_model_integration_generate_audio_contextoutput_mismatch7/7(line 732) AssertionError: Tensor-likes are not equal!
diasingletest_dia_model_integration_generate_audio_contextoutput_mismatch7/7(line 732) AssertionError: Tensor-likes are not equal!
diffllamamultitest_compile_static_cacheoutput_mismatch7/7(line 484) AssertionError: Lists differ: ['Sim[41 chars]that 1) the speed of light is constant in all [301 chars]y p'] != ['Sim[41 chars]that 2.5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 '[133 chars]a a']
diffllamasingletest_compile_static_cacheoutput_mismatch7/7(line 484) AssertionError: Lists differ: ['Sim[41 chars]that 1) the speed of light is constant in all [301 chars]y p'] != ['Sim[41 chars]that 2.5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 '[133 chars]a a']
efficientnetmultitest_inference_image_classification_headoutput_mismatch7/7(line 259) AssertionError: Tensor-likes are not close!
efficientnetsingletest_inference_image_classification_headoutput_mismatch7/7(line 259) AssertionError: Tensor-likes are not close!
emu3singletest_model_generationoutput_mismatch7/7(line 363) AssertionError: Lists differ: ['USE[85 chars]ANT: The image captures a moment of tranquilit[145 chars] in'] != ['USE[85 chars]ANT: 1. The image is a 1.你好!']
eomt_dinov3multitest_inference_bf16output_mismatch7/7(line 310) AssertionError: Tensor-likes are not close!
eomt_dinov3singletest_inference_bf16output_mismatch7/7(line 310) AssertionError: Tensor-likes are not close!
evollamultitest_inference_natural_language_protein_reasoningoutput_mismatch7/7(line 364) AssertionError: 'This protein' not found in 'systemYouareanAIexpertthatcanansweranyquestionsaboutprotein.userWhatisthefunctionofthisprotein?assistantĊThisĠproteinĠisĠaĠcriticalĠenzymeĠinvolvedĠinĠtheĠmetabol…
evollasingletest_inference_natural_language_protein_reasoningoutput_mismatch7/7(line 364) AssertionError: 'This protein' not found in 'systemYouareanAIexpertthatcanansweranyquestionsaboutprotein.userWhatisthefunctionofthisprotein?assistantĊThisĠproteinĠisĠaĠcriticalĠenzymeĠinvolvedĠinĠtheĠmetabol…
exaone4singletest_model_generation_beyond_sliding_windowoutput_mismatch7/7(line 160) AssertionError: " Thi[46 chars] and the atmosphere is so relaxing. I'm gratef[47 chars]. It" != " Thi[46 chars] and I'm grateful for the opportunity to exper[26 chars]reak"
exaone4singletest_model_logitsoutput_mismatch7/7(line 99) AssertionError: Tensor-likes are not close!
exaone_moemultitest_model_logitsoutput_mismatch7/7(line 120) AssertionError: Tensor-likes are not close!
exaone_moesingletest_model_logitsoutput_mismatch7/7(line 120) AssertionError: Tensor-likes are not close!
falcon_h1multitest_falcon_h1_hardoutput_mismatch7/7(line 470) AssertionError: 'user\nTell me about the french revolutio[1920 chars]ct**' != "user\nTell me about the french revolutio[1929 chars]n6. "
falcon_h1singletest_falcon_h1_hardoutput_mismatch7/7(line 470) AssertionError: 'user\nTell me about the french revolutio[1920 chars]ct**' != "user\nTell me about the french revolutio[1929 chars]n6. "
falcon_mambamultitest_batched_generationoutput_mismatch7/7(line 488) AssertionError: Lists differ: ['Hello today I will be talking about the “Theory of Rela[161 chars]bal'] != ['Hello today I am going to talk about the “Theory of Rel[159 chars]bal']
falcon_mambamultitest_generation_4bitoutput_mismatch7/7(line 438) AssertionError: 'Hello today I\'m going to be talking about the "A" in the "A-B' != "Hello today Iava,\n\nI'm sorry to hear that you're having trouble with the "
falcon_mambamultitest_generation_fp16output_mismatch7/7(line 423) AssertionError: 'Hello today I am going to talk about the “Theory of Re[27 chars]n.\n' != 'Hello today Iava,\n\nI am writing to you today to disc[49 chars]tyle'
falcon_mambamultitest_generation_torch_compileoutput_mismatch7/7(line 451) AssertionError: 'Hello today I am going to talk about the “Theory of Re[27 chars]n.\n' != 'Hello today Iava,\n\nI am writing to you today to disc[49 chars]tyle'
falcon_mambasingletest_batched_generationoutput_mismatch7/7(line 488) AssertionError: Lists differ: ['Hello today I will be talking about the “Theory of Rela[161 chars]bal'] != ['Hello today I am going to talk about the “Theory of Rel[159 chars]bal']
falcon_mambasingletest_generation_4bitoutput_mismatch7/7(line 438) AssertionError: 'Hello today I\'m going to be talking about the "A" in the "A-B' != "Hello today Iava,\n\nI'm sorry to hear that you're having trouble with the "
falcon_mambasingletest_generation_fp16output_mismatch7/7(line 423) AssertionError: 'Hello today I am going to talk about the “Theory of Re[27 chars]n.\n' != 'Hello today Iava,\n\nI am writing to you today to disc[49 chars]tyle'
falcon_mambasingletest_generation_torch_compileoutput_mismatch7/7(line 451) AssertionError: 'Hello today I am going to talk about the “Theory of Re[27 chars]n.\n' != 'Hello today Iava,\n\nI am writing to you today to disc[49 chars]tyle'
fastspeech2_conformermultitest_training_integrationoutput_mismatch7/7(line 453) AssertionError: Tensor-likes are not close!
fastspeech2_conformersingletest_training_integrationoutput_mismatch7/7(line 453) AssertionError: Tensor-likes are not close!
flavamultitest_inference_with_itm_labelsoutput_mismatch7/7(line 1223) AssertionError: The values for attribute 'shape' do not match: torch.Size([1, 2]) != torch.Size([2, 2]).
flavamultitest_inferenceoutput_mismatch7/7(line 899) AssertionError: -1352.535400390625 != -1352.4685 within 4 places (0.06690039062505093 difference)
flavasingletest_inference_with_itm_labelsoutput_mismatch7/7(line 1223) AssertionError: The values for attribute 'shape' do not match: torch.Size([1, 2]) != torch.Size([2, 2]).
flavasingletest_inferenceoutput_mismatch7/7(line 899) AssertionError: -1352.535400390625 != -1352.4685 within 4 places (0.06690039062505093 difference)
flex_olmomultitest_model_7b_logitsoutput_mismatch7/7(line 87) AssertionError: Tensor-likes are not close!
flex_olmosingletest_model_7b_logitsoutput_mismatch7/7(line 87) AssertionError: Tensor-likes are not close!
florence2multitest_large_model_inference_eageroutput_mismatch7/7(line 470) AssertionError: Lists differ: [[2, [144 chars], 5, 2014, 6, 8, 11, 5, 3618, 6, 89, 32, 3980,[51 chars], 2]] != [[2, [144 chars], 5, 921, 6, 8, 11, 5, 3618, 6, 89, 32, 1104, [44 chars], 2]]
florence2singletest_large_model_inference_eageroutput_mismatch7/7(line 470) AssertionError: Lists differ: [[2, [144 chars], 5, 2014, 6, 8, 11, 5, 3618, 6, 89, 32, 3980,[51 chars], 2]] != [[2, [144 chars], 5, 921, 6, 8, 11, 5, 3618, 6, 89, 32, 1104, [44 chars], 2]]
fsmtmultitest_inference_no_headoutput_mismatch7/7(line 484) AssertionError: Tensor-likes are not close!
fsmtmultitest_translation_direct_0_en_ruoutput_mismatch7/7(line 517) AssertionError:
fsmtmultitest_translation_direct_1_ru_enoutput_mismatch7/7(line 517) AssertionError:
fsmtsingletest_inference_no_headoutput_mismatch7/7(line 484) AssertionError: Tensor-likes are not close!
fsmtsingletest_translation_direct_0_en_ruoutput_mismatch7/7(line 517) AssertionError:
fsmtsingletest_translation_direct_1_ru_enoutput_mismatch7/7(line 517) AssertionError:
fuyumultitest_greedy_generationoutput_mismatch7/7(line 295) AssertionError: '\x04 A bus parked on the side of a road.' != 'A blue bus parked on the side of a road.'
fuyusingletest_greedy_generationoutput_mismatch7/7(line 295) AssertionError: '\x04 A bus parked on the side of a road.' != 'A blue bus parked on the side of a road.'
gemmamultitest_compile_static_cacheoutput_mismatch7/7(line 337) AssertionError: Lists differ: ['Hel[196 chars]tdi 105bhp.\nI have a problem with the engine [37 chars]the'] != ['Hel[196 chars]tdi 110bhp.\nI have a problem with the engine [49 chars]ugh']
gemmamultitest_export_static_cacheoutput_mismatch7/7(line 414) AssertionError: Lists differ: ['Hel[87 chars] in the 1990s. I have been looking on the internet and I have'] != ['Hel[87 chars] in the 1990s. I have looked on the internet and I have found']
gemmamultitest_model_2b_4bitoutput_mismatch7/7(line 190) AssertionError: Lists differ: ['Hel[118 chars] you a few of my favorite and most used brushes.\n\nI"] != ['Hel[118 chars] you my experience with the new wattpad wattpa[38 chars]pad"]
gemmamultitest_model_7b_4bitoutput_mismatch7/7(line 317) AssertionError: Lists differ: ['Hel[59 chars]ke a "self balancing" robot. I have', 'Hi toda[76 chars] of'] != ['Hel[59 chars]ke a program that will take a number and then'[93 chars]!:)']
gemmamultitest_model_7b_bf16output_mismatch7/7(line 258) AssertionError: Lists differ: ['Hel[59 chars]ke a small game. I have a few questions', 'Hi [86 chars]and'] != ['Hel[59 chars]ke a game in which you have to get a', 'Hi tod[83 chars]and']
gemmamultitest_model_7b_fp16output_mismatch7/7(line 228) AssertionError: Lists differ: ['Hel[27 chars]a 1995 4.0L 4x4. I', 'Hi today I am going to s[51 chars] 3D'] != ['Hel[27 chars]a 1999 4.0L 4x4. I', 'Hi today I am going to s[51 chars] 3D']
gemmamultitest_model_7b_fp16_static_cacheoutput_mismatch7/7(line 288) AssertionError: Lists differ: ['Hel[29 chars]1995 4.0L 4x4. I', 'Hi today I am going to sho[49 chars] 3D'] != ['Hel[29 chars]1995 3000gt SL. I have a', 'Hi today I am goin[57 chars] 3D']
gemmasingletest_compile_static_cacheoutput_mismatch7/7(line 337) AssertionError: Lists differ: ['Hel[196 chars]tdi 105bhp.\nI have a problem with the engine [37 chars]the'] != ['Hel[196 chars]tdi 110bhp.\nI have a problem with the engine [49 chars]ugh']
gemmasingletest_export_static_cacheoutput_mismatch7/7(line 414) AssertionError: Lists differ: ['Hel[87 chars] in the 1990s. I have been looking on the internet and I have'] != ['Hel[87 chars] in the 1990s. I have looked on the internet and I have found']
gemmasingletest_model_2b_4bitoutput_mismatch7/7(line 190) AssertionError: Lists differ: ['Hel[118 chars] you a few of my favorite and most used brushes.\n\nI"] != ['Hel[118 chars] you my experience with the new wattpad wattpa[38 chars]pad"]
gemmasingletest_model_7b_4bitoutput_mismatch7/7(line 317) AssertionError: Lists differ: ['Hel[59 chars]ke a "self balancing" robot. I have', 'Hi toda[76 chars] of'] != ['Hel[59 chars]ke a program that will take a number and then'[93 chars]!:)']
gemmasingletest_model_7b_bf16output_mismatch7/7(line 258) AssertionError: Lists differ: ['Hel[59 chars]ke a small game. I have a few questions', 'Hi [86 chars]and'] != ['Hel[59 chars]ke a game in which you have to get a', 'Hi tod[83 chars]and']
gemmasingletest_model_7b_fp16output_mismatch7/7(line 228) AssertionError: Lists differ: ['Hel[27 chars]a 1995 4.0L 4x4. I', 'Hi today I am going to s[51 chars] 3D'] != ['Hel[27 chars]a 1999 4.0L 4x4. I', 'Hi today I am going to s[51 chars] 3D']
gemmasingletest_model_7b_fp16_static_cacheoutput_mismatch7/7(line 288) AssertionError: Lists differ: ['Hel[27 chars]a 1999 4.0L 4x4. I', 'Hi today I am going to s[51 chars] 3D'] != ['Hel[27 chars]a 1995 3000gt SL. I have a', 'Hi today I am go[59 chars] 3D']
gemma2multitest_model_2b_pipeline_bf16_flex_attentionoutput_mismatch7/7(line 2876) Failed: (subprocess) AssertionError: "Hi t[26 chars]ng about the 10 best anime of all time.\n\n1" != "Hi t[26 chars]ng about the 10 most powerful characters in the Naruto series."
gemma2singletest_model_2b_pipeline_bf16_flex_attentionoutput_mismatch7/7(line 2876) Failed: (subprocess) AssertionError: "Hi t[26 chars]ng about the 10 best anime of all time.\n\n1" != "Hi t[26 chars]ng about the 10 most powerful characters in the Naruto series."
gemma3multitest_dynamic_sliding_window_is_defaultoutput_mismatch7/7(line 874) AssertionError: 'DynamicSlidingWindowLayer' unexpectedly found in 'DynamicCache(layers=[DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlid…
gemma3multitest_model_1b_text_onlyoutput_mismatch7/7(line 728) AssertionError: Lists differ: ['Wri[48 chars]data streams, a boundless flow,\nA silent worl[63 chars]ing'] != ['Wri[48 chars]data flows, a silent stream,\nInto the neural [51 chars],\n']
gemma3multitest_model_4b_batchoutput_mismatch7/7(line 548) AssertionError: Lists differ: ['use[149 chars]with turquoise water and a blue sky in the bac[227 chars]own"] != ['use[149 chars]with clear turquoise water and a blue sky in t[231 chars]own"]
gemma3multitest_model_4b_batch_cropsoutput_mismatch7/7(line 663) AssertionError: Lists differ: ['user\nYou are a helpful assistant.\n\nHe[674 chars]h a'] != ["user\nYou are a helpful assistant.\n\nHe[674 chars]h a']
gemma3multitest_model_4b_cropsoutput_mismatch7/7(line 590) AssertionError: Lists differ: ["user\nYou are a helpful assistant.\n\nHe[268 chars]the"] != ['user\nYou are a helpful assistant.\n\nHe[268 chars]the']
gemma3singletest_dynamic_sliding_window_is_defaultoutput_mismatch7/7(line 874) AssertionError: 'DynamicSlidingWindowLayer' unexpectedly found in 'DynamicCache(layers=[DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlidingWindowLayer, DynamicSlid…
gemma3singletest_model_1b_text_onlyoutput_mismatch7/7(line 728) AssertionError: Lists differ: ['Wri[48 chars]data streams, a boundless flow,\nA silent worl[63 chars]ing'] != ['Wri[48 chars]data flows, a silent stream,\nInto the neural [51 chars],\n']
gemma3singletest_model_4b_batchoutput_mismatch7/7(line 548) AssertionError: Lists differ: ['use[149 chars]with turquoise water and a blue sky in the bac[227 chars]own"] != ['use[149 chars]with clear turquoise water and a blue sky in t[231 chars]own"]
gemma3singletest_model_4b_batch_cropsoutput_mismatch7/7(line 663) AssertionError: Lists differ: ['user\nYou are a helpful assistant.\n\nHe[674 chars]h a'] != ["user\nYou are a helpful assistant.\n\nHe[674 chars]h a']
gemma3singletest_model_4b_cropsoutput_mismatch7/7(line 590) AssertionError: Lists differ: ["user\nYou are a helpful assistant.\n\nHe[268 chars]the"] != ['user\nYou are a helpful assistant.\n\nHe[268 chars]the']
gemma3nmultitest_generation_beyond_sliding_windowoutput_mismatch7/7(line 1196) AssertionError: Lists differ: [' and I find it very relaxing. I also lik[112 chars]re'"] != [" and the people are so friendly. I'm so [93 chars]re'"]
gemma3nmultitest_model_4b_batchoutput_mismatch7/7(line 1083) AssertionError: Lists differ: ['use[196 chars]ewer and has its tongue', "user\nYou are a hel[193 chars]cow"] != ['use[196 chars]ewer with its head slightly', "user\nYou are a[197 chars]cow"]
gemma3nmultitest_model_4b_bf16output_mismatch7/7(line 998) AssertionError: Lists differ: ['use[149 chars]to a turquoise ocean. The cow is facing the vi[31 chars]ned'] != ['use[149 chars]to a clear blue ocean. The cow is facing the v[25 chars]tly']
gemma3nmultitest_model_4b_imageoutput_mismatch7/7(line 1110) AssertionError: Lists differ: ['use[149 chars]to a turquoise ocean. The cow is facing the vi[31 chars]ned'] != ['use[149 chars]to a clear blue ocean. The cow is facing the v[25 chars]tly']
gemma3nmultitest_model_4b_multiimageoutput_mismatch7/7(line 1151) AssertionError: Lists differ: ['use[140 chars]n district. Here are the key elements:\n\n* **A prominent red'] != ['use[140 chars]n district. Here are some of the key elements:\n\n* **A']
gemma3nsingletest_generation_beyond_sliding_windowoutput_mismatch7/7(line 1196) AssertionError: Lists differ: [' and I find it very relaxing. I also lik[112 chars]re'"] != [" and the people are so friendly. I'm so [93 chars]re'"]
gemma3nsingletest_model_4b_batchoutput_mismatch7/7(line 1083) AssertionError: Lists differ: ['use[196 chars]ewer and has its tongue', "user\nYou are a hel[193 chars]cow"] != ['use[196 chars]ewer with its head slightly', "user\nYou are a[197 chars]cow"]
gemma3nsingletest_model_4b_bf16output_mismatch7/7(line 998) AssertionError: Lists differ: ['use[149 chars]to a turquoise ocean. The cow is facing the vi[31 chars]ned'] != ['use[149 chars]to a clear blue ocean. The cow is facing the v[25 chars]tly']
gemma3nsingletest_model_4b_imageoutput_mismatch7/7(line 1110) AssertionError: Lists differ: ['use[149 chars]to a turquoise ocean. The cow is facing the vi[31 chars]ned'] != ['use[149 chars]to a clear blue ocean. The cow is facing the v[25 chars]tly']
gemma3nsingletest_model_4b_multiimageoutput_mismatch7/7(line 1151) AssertionError: Lists differ: ['use[140 chars]n district. Here are the key elements:\n\n* **A prominent red'] != ['use[140 chars]n district. Here are some of the key elements:\n\n* **A']
gemma4multitest_model_multiimageoutput_mismatch7/7(line 742) AssertionError: Lists differ: ['Bas[66 chars]und & Street Scene:**\n* **Roadway:** There is an'] != ['Bas[66 chars]und & Street Scene:**\n* **Traffic Sign:** The most prominent']
gemma4multitest_model_with_imageoutput_mismatch7/7(line 655) AssertionError: Lists differ: ['Thi[61 chars] beach** with the **ocean** in the background under a **clear'] != ['Thi[61 chars] beach** with the **ocean and a blue sky** in the background']
gemma4multitest_model_with_image_batchoutput_mismatch7/7(line 706) AssertionError: Lists differ: ['Thi[81 chars]ocean** in the background under a **clear', "N[102 chars] on"] != ['Thi[81 chars]ocean and a blue sky** in the background', 'No[127 chars]lue']
gemma4singletest_model_multiimageoutput_mismatch7/7(line 742) AssertionError: Lists differ: ['Bas[66 chars]und & Street Scene:**\n* **Roadway:** There is an'] != ['Bas[66 chars]und & Street Scene:**\n* **Traffic Sign:** The most prominent']
gemma4singletest_model_with_imageoutput_mismatch7/7(line 655) AssertionError: Lists differ: ['Thi[61 chars] beach** with the **ocean** in the background under a **clear'] != ['Thi[61 chars] beach** with the **ocean and a blue sky** in the background']
gemma4singletest_model_with_image_batchoutput_mismatch7/7(line 706) AssertionError: Lists differ: ['Thi[81 chars]ocean** in the background under a **clear', "N[102 chars] on"] != ['Thi[81 chars]ocean and a blue sky** in the background', 'No[127 chars]lue']
generationmultitest_TopH_example_integrationoutput_mismatch7/7(line 3215) AssertionError: Lists differ: ['Tel[23 chars]key. Sure, here\'s one for you:\n\nWhy did the[67 chars]s"!'] != ['Tel[23 chars]key. Why did the monkey go to the doctor? Beca[34 chars]c"!']
generationmultitest_assisted_generation_early_exitoutput_mismatch7/7(line 4077) AssertionError: Lists differ: ['Ali[20 chars]ng a game of poker. Alice has a pair of 7s and Bob has a pair'] != ['Ali[20 chars]ng a game of poker. Alice has a pair of 8s and Bob has a pair']
generationmultitest_beam_search_advanced_stopping_criteriaoutput_mismatch7/7(line 681) AssertionError: True is not false
generationmultitest_beam_search_early_stop_heuristicoutput_mismatch7/7(line 2965) AssertionError: "<|us[317 chars]}\\).\nThe sum of 3 and 5 is \\(3 + 5 = 8\\).\[40 chars]\\)." != "<|us[317 chars]}\\)."
generationsingletest_TopH_example_integrationoutput_mismatch7/7(line 3215) AssertionError: Lists differ: ['Tel[23 chars]key. Sure, here\'s one for you:\n\nWhy did the[67 chars]s"!'] != ['Tel[23 chars]key. Why did the monkey go to the doctor? Beca[34 chars]c"!']
generationsingletest_assisted_generation_early_exitoutput_mismatch7/7(line 4077) AssertionError: Lists differ: ['Ali[20 chars]ng a game of poker. Alice has a pair of 7s and Bob has a pair'] != ['Ali[20 chars]ng a game of poker. Alice has a pair of 8s and Bob has a pair']
generationsingletest_beam_search_advanced_stopping_criteriaoutput_mismatch7/7(line 681) AssertionError: True is not false
generationsingletest_beam_search_early_stop_heuristicoutput_mismatch7/7(line 2965) AssertionError: "<|us[317 chars]}\\).\nThe sum of 3 and 5 is \\(3 + 5 = 8\\).\[40 chars]\\)." != "<|us[317 chars]}\\)."
glmmultitest_model_9b_eageroutput_mismatch7/7(line 133) AssertionError: Lists differ: ['Hel[140 chars]ou how to make a simple and easy to make a DIY paper flower.'] != ['Hel[140 chars]ou how to make a simple and easy to make a DIY paper lantern.']
glmsingletest_model_9b_eageroutput_mismatch7/7(line 133) AssertionError: Lists differ: ['Hel[140 chars]ou how to make a simple and easy to make a DIY paper flower.'] != ['Hel[140 chars]ou how to make a simple and easy to make a DIY paper lantern.']
glm_imagemultitest_image_to_image_generationoutput_mismatch7/7(line 687) AssertionError: False is not true : Expected first 30 tokens:
glm_imagesingletest_image_to_image_generationoutput_mismatch7/7(line 687) AssertionError: False is not true : Expected first 30 tokens:
glm_ocrmultitest_small_model_integration_test_batchoutput_mismatch7/7(line 503) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[14885 chars]ia.'] != ["\nWhat kind of dog is this?\n<think>Got [256 chars]t's"]
glm_ocrmultitest_small_model_integration_test_batch_different_resolutionsoutput_mismatch7/7(line 631) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[10983 chars]at.'] != ["\nWhat kind of dog is this?\n<think>Got [258 chars]but"]
glm_ocrmultitest_small_model_integration_test_batch_wo_imageoutput_mismatch7/7(line 603) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[7469 chars]Ai."] != ["\nWhat kind of dog is this?\n<think>Got [267 chars]ion']
glm_ocrmultitest_small_model_integration_test_expandoutput_mismatch7/7(line 575) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[14840 chars]d a'] != ["\nWhat kind of dog is this?\n<think>Got [267 chars]lly"]
glm_ocrmultitest_small_model_integration_test_with_videooutput_mismatch7/7(line 541) AssertionError: Lists differ: ['\n<|begin_of_video|><|image|><|image|><|[50804 chars]rt.'] != ["\n012345Describe this video.\n<think>Got[114 chars]irt"]
glm_ocrsingletest_small_model_integration_test_batchoutput_mismatch7/7(line 503) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[14885 chars]ia.'] != ["\nWhat kind of dog is this?\n<think>Got [256 chars]t's"]
glm_ocrsingletest_small_model_integration_test_batch_different_resolutionsoutput_mismatch7/7(line 631) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[10983 chars]at.'] != ["\nWhat kind of dog is this?\n<think>Got [258 chars]but"]
glm_ocrsingletest_small_model_integration_test_batch_wo_imageoutput_mismatch7/7(line 603) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[7469 chars]Ai."] != ["\nWhat kind of dog is this?\n<think>Got [267 chars]ion']
glm_ocrsingletest_small_model_integration_test_expandoutput_mismatch7/7(line 575) AssertionError: Lists differ: ['\n<|image|><|image|><|image|><|image|><|[14840 chars]d a'] != ["\nWhat kind of dog is this?\n<think>Got [267 chars]lly"]
glm_ocrsingletest_small_model_integration_test_with_videooutput_mismatch7/7(line 541) AssertionError: Lists differ: ['\n<|begin_of_video|><|image|><|image|><|[50804 chars]rt.'] != ["\n012345Describe this video.\n<think>Got[114 chars]irt"]
got_ocr2multitest_small_model_integration_test_got_ocr_formatoutput_mismatch7/7(line 210) AssertionError: 'R\\&D' != '\\title{\nR'
got_ocr2singletest_small_model_integration_test_got_ocr_formatoutput_mismatch7/7(line 210) AssertionError: 'R\\&D' != '\\title{\nR'
granitemultitest_model_3b_logits_bf16output_mismatch7/7(line 687) AssertionError: False is not true
granitesingletest_model_3b_logits_bf16output_mismatch7/7(line 687) AssertionError: False is not true
grounding_dinomultitest_cross_attention_maskoutput_mismatch7/7(line 787) AssertionError: Tensor-likes are not close!
grounding_dinomultitest_grounding_dino_lossoutput_mismatch7/7(line 869) AssertionError: Scalars are not close!
grounding_dinomultitest_inference_object_detection_headoutput_mismatch7/7(line 678) AssertionError: Tensor-likes are not close!
grounding_dinomultitest_inference_object_detection_head_equivalence_cpu_acceleratoroutput_mismatch7/7(line 745) AssertionError: Tensor-likes are not close!
grounding_dinosingletest_cross_attention_maskoutput_mismatch7/7(line 787) AssertionError: Tensor-likes are not close!
grounding_dinosingletest_grounding_dino_lossoutput_mismatch7/7(line 869) AssertionError: Scalars are not close!
grounding_dinosingletest_inference_object_detection_headoutput_mismatch7/7(line 678) AssertionError: Tensor-likes are not close!
grounding_dinosingletest_inference_object_detection_head_equivalence_cpu_acceleratoroutput_mismatch7/7(line 745) AssertionError: Tensor-likes are not close!
heliummultitest_model_2boutput_mismatch7/7(line 73) AssertionError: Lists differ: ['Hel[51 chars]have been working on a new project for a while now and I have'] != ['Hel[51 chars]have been working on a new project for a while now, and I']
heliumsingletest_model_2boutput_mismatch7/7(line 73) AssertionError: Lists differ: ['Hel[51 chars]have been working on a new project for a while now and I have'] != ['Hel[51 chars]have been working on a new project for a while now, and I']
hieramultitest_inference_image_classification_headoutput_mismatch7/7(line 560) AssertionError: Tensor-likes are not close!
hierasingletest_inference_image_classification_headoutput_mismatch7/7(line 560) AssertionError: Tensor-likes are not close!
higgs_audio_v2multitest_batched_inferenceoutput_mismatch7/7(line 1399) AssertionError: Tensor-likes are not equal!
higgs_audio_v2multitest_multi_speaker_smart_voiceoutput_mismatch7/7(line 758) AssertionError: Tensor-likes are not equal!
higgs_audio_v2multitest_multi_speaker_voice_cloningoutput_mismatch7/7(line 1098) AssertionError: Tensor-likes are not equal!
higgs_audio_v2multitest_zero_shot_voice_cloningoutput_mismatch7/7(line 931) AssertionError: Tensor-likes are not equal!
higgs_audio_v2singletest_batched_inferenceoutput_mismatch7/7(line 1399) AssertionError: Tensor-likes are not equal!
higgs_audio_v2singletest_multi_speaker_smart_voiceoutput_mismatch7/7(line 758) AssertionError: Tensor-likes are not equal!
higgs_audio_v2singletest_multi_speaker_voice_cloningoutput_mismatch7/7(line 1098) AssertionError: Tensor-likes are not equal!
higgs_audio_v2singletest_zero_shot_voice_cloningoutput_mismatch7/7(line 931) AssertionError: Tensor-likes are not equal!
instructblipmultitest_inference_flant5_xloutput_mismatch7/7(line 718) AssertionError: Lists differ: [0, 3[68 chars]459, 9256, 16, 8, 2214, 13, 3, 9, 3164, 690, 2[500 chars]5, 1] != [0, 3[68 chars]459, 4049, 16, 8, 2214, 13, 3, 9, 3164, 690, 2[295 chars]5, 1]
instructblipsingletest_inference_flant5_xloutput_mismatch7/7(line 718) AssertionError: Lists differ: [0, 3[68 chars]459, 9256, 16, 8, 2214, 13, 3, 9, 3164, 690, 2[500 chars]5, 1] != [0, 3[68 chars]459, 4049, 16, 8, 2214, 13, 3, 9, 3164, 690, 2[295 chars]5, 1]
instructblipvideomultitest_inference_vicuna_7boutput_mismatch7/7(line 671) AssertionError: 'Expl[43 chars]a baby girl wearing glasses is reading a book on the bed 1' != 'Expl[43 chars]a baby girl wearing glasses is reading a book on the bed 1080p'
instructblipvideosingletest_inference_vicuna_7boutput_mismatch7/7(line 671) AssertionError: 'Expl[43 chars]a baby girl wearing glasses is reading a book on the bed 1' != 'Expl[43 chars]a baby girl wearing glasses is reading a book on the bed 1080p'
internvlmultitest_llama_small_model_integration_forwardoutput_mismatch7/7(line 687) AssertionError: False is not true : Actual logits: tensor([ -9.8750, -0.4954, 1.4580, -10.3281, -10.3359], dtype=torch.float16)
internvlmultitest_llama_small_model_integration_generate_text_onlyoutput_mismatch7/7(line 714) AssertionError: "Autu[14 chars],\nNature's breath, a season's sigh,\nSilent woods awake." != "Autu[14 chars],\nNature's breath, a silent sigh,\nWinter's chill approaches."
internvlsingletest_llama_small_model_integration_forwardoutput_mismatch7/7(line 687) AssertionError: False is not true : Actual logits: tensor([ -9.8750, -0.4954, 1.4580, -10.3281, -10.3359], dtype=torch.float16)
internvlsingletest_llama_small_model_integration_generate_text_onlyoutput_mismatch7/7(line 714) AssertionError: "Autu[14 chars],\nNature's breath, a season's sigh,\nSilent woods awake." != "Autu[14 chars],\nNature's breath, a silent sigh,\nWinter's chill approaches."
jambamultitest_simple_batched_generate_with_paddingoutput_mismatch7/7(line 576) AssertionError: "<|startoftext|>Tell me a story<|pad|><|p[50 chars]t I'" != '<|pad|><|pad|><|pad|><|pad|><|pad|><|pad[76 chars]ates'
jambasingletest_simple_batched_generate_with_paddingoutput_mismatch7/7(line 576) AssertionError: "<|startoftext|>Tell me a story<|pad|><|p[50 chars]t I'" != '<|pad|><|pad|><|pad|><|pad|><|pad|><|pad[76 chars]ates'
kosmos2multitest_snowman_image_captioningoutput_mismatch7/7(line 79) AssertionError:
kosmos2multitest_snowman_image_captioning_batchoutput_mismatch7/7(line 712) AssertionError: Lists differ: ['<gr[35 chars]ail: A snowman is sitting in front of a fire, [575 chars]t>.'] != ['<gr[35 chars]ail: The image features a snowman sitting by<p[836 chars]t>.']
kosmos2singletest_snowman_image_captioningoutput_mismatch7/7(line 79) AssertionError:
kosmos2singletest_snowman_image_captioning_batchoutput_mismatch7/7(line 712) AssertionError: Lists differ: ['<gr[35 chars]ail: A snowman is sitting in front of a fire, [575 chars]t>.'] != ['<gr[35 chars]ail: The image features a snowman sitting by<p[836 chars]t>.']
kosmos2_5multitest_eageroutput_mismatch7/7(line 578) AssertionError: Lists differ: ['<bb[216 chars]<y_650></bbox>COOKIE DOH SAUCES\n<bbox><x_788>[452 chars]0\n'] != ['<bb[216 chars]<y_651></bbox>COOKIE DOH SAUCES\n<bbox><x_788>[452 chars]0\n']
kosmos2_5singletest_eageroutput_mismatch7/7(line 578) AssertionError: Lists differ: ['<bb[216 chars]<y_650></bbox>COOKIE DOH SAUCES\n<bbox><x_788>[452 chars]0\n'] != ['<bb[216 chars]<y_651></bbox>COOKIE DOH SAUCES\n<bbox><x_788>[452 chars]0\n']
layoutlmv2multitest_processor_case_1output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS[522 chars]t itc ' s new fmcg businesses are the fastest [829 chars]PAD]" != "[CLS[522 chars]t itc's new fmcg businesses are the fastest gr[827 chars]PAD]"
layoutlmv2multitest_processor_case_4output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS] what ' s his name? [SEP] 11 : 14 to 11 : 39 a[1108 chars]SEP]" != "[CLS] what's his name? [SEP] 11 : 14 to 11 : 39 a. [1106 chars]SEP]"
layoutlmv2multitest_processor_case_5output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS] what ' s his name? [SEP] hello world [SEP]" != "[CLS] what's his name? [SEP] hello world [SEP]"
layoutlmv2singletest_processor_case_1output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS[522 chars]t itc ' s new fmcg businesses are the fastest [829 chars]PAD]" != "[CLS[522 chars]t itc's new fmcg businesses are the fastest gr[827 chars]PAD]"
layoutlmv2singletest_processor_case_4output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS] what ' s his name? [SEP] 11 : 14 to 11 : 39 a[1108 chars]SEP]" != "[CLS] what's his name? [SEP] 11 : 14 to 11 : 39 a. [1106 chars]SEP]"
layoutlmv2singletest_processor_case_5output_mismatch7/7(line 675) AssertionError: Sequences differ: "[CLS] what ' s his name? [SEP] hello world [SEP]" != "[CLS] what's his name? [SEP] hello world [SEP]"
lfm2_moemultitest_model_1a8b_batched_chat_generationoutput_mismatch7/7(line 223) AssertionError: Lists differ: ['Who are you? (AI) designed to assist? \nI am an AI ass[192 chars]ial'] != ['Who are you? (as AI) created by? \nI am an artificial [200 chars]ish']
lfm2_moesingletest_model_1a8b_batched_chat_generationoutput_mismatch7/7(line 223) AssertionError: Lists differ: ['Who are you? (AI) designed to assist? \nI am an AI ass[192 chars]ial'] != ['Who are you? (as AI) created by? \nI am an artificial [200 chars]ish']
lfm2_vlmultitest_integration_testoutput_mismatch7/7(line 246) AssertionError: 'In t[53 chars]. They are both very relaxed and comfortable. [14 chars]grey' != 'In t[53 chars]. There are also two remote controls on the blanket.\n\n\n\n'
lfm2_vlmultitest_integration_test_high_resolutionoutput_mismatch7/7(line 354) AssertionError: 'In t[52 chars]ymbol of freedom and democracy. It stands tall on a small' != 'In t[52 chars]ymbol of freedom and democracy. It stands on Liberty Island in'
lfm2_vlsingletest_integration_testoutput_mismatch7/7(line 246) AssertionError: 'In t[53 chars]. They are both very relaxed and comfortable. [14 chars]grey' != 'In t[53 chars]. There are also two remote controls on the blanket.\n\n\n\n'
lfm2_vlsingletest_integration_test_high_resolutionoutput_mismatch7/7(line 354) AssertionError: 'In t[52 chars]ymbol of freedom and democracy. It stands tall on a small' != 'In t[52 chars]ymbol of freedom and democracy. It stands on Liberty Island in'
llamamultitest_llama_3_1_hardoutput_mismatch7/7(line 96) AssertionError: 'Tell[74 chars]ical social and political upheaval in France t[552 chars] the' != 'Tell[74 chars]ical political and social upheaval in France t[558 chars]nshr'
llamasingletest_llama_3_1_hardoutput_mismatch7/7(line 96) AssertionError: 'Tell[74 chars]ical social and political upheaval in France t[552 chars] the' != 'Tell[74 chars]ical political and social upheaval in France t[558 chars]nshr'
llavamultitest_batched_generationoutput_mismatch7/7(line 566) AssertionError: Lists differ: ["\n [134 chars] one image and a", '\nUSER: Describe the image[210 chars]ama'] != ["\n [134 chars] one and a yellow", '\nUSER: Describe the imag[211 chars]ama']
llavamultitest_pixtral_batchedoutput_mismatch7/7(line 724) AssertionError: Lists differ: ['Wha[97 chars]mage?A narrow dirt path is surrounded by grass[74 chars]ue.'] != ['Wha[97 chars]mage?The image depicts a narrow, winding dirt [175 chars]ere']
llavasingletest_batched_generationoutput_mismatch7/7(line 566) AssertionError: Lists differ: ["\n [134 chars] one image and a", '\nUSER: Describe the image[210 chars]ama'] != ["\n [134 chars] one and a yellow", '\nUSER: Describe the imag[211 chars]ama']
llavasingletest_pixtral_batchedoutput_mismatch7/7(line 724) AssertionError: Lists differ: ['Wha[97 chars]mage?A narrow dirt path is surrounded by grass[74 chars]ue.'] != ['Wha[97 chars]mage?The image depicts a narrow, winding dirt [175 chars]ere']
llava_nextmultitest_small_model_integration_testoutput_mismatch7/7(line 172) AssertionError: assert False
llava_nextsingletest_small_model_integration_testoutput_mismatch7/7(line 172) AssertionError: assert False
llava_next_videomultitest_small_model_integration_testoutput_mismatch7/7(line 388) AssertionError: 'USER[154 chars]hile wearing a pair of glasses that are too la[24 chars] are' != 'USER[154 chars]hile another child is attempting to read the s[45 chars]eems'
llava_next_videomultitest_small_model_integration_test_batch_matches_singleoutput_mismatch7/7(line 480) AssertionError: 'USER[154 chars]hile another child is attempting to read the s[96 chars]e to' != 'USER[154 chars]hile wearing a pair of glasses that are too la[69 chars]g it'
llava_next_videosingletest_small_model_integration_testoutput_mismatch7/7(line 388) AssertionError: 'USER[154 chars]hile wearing a pair of glasses that are too la[24 chars] are' != 'USER[154 chars]hile another child is attempting to read the s[45 chars]eems'
llava_next_videosingletest_small_model_integration_test_batch_matches_singleoutput_mismatch7/7(line 480) AssertionError: 'USER[154 chars]hile another child is attempting to read the s[96 chars]e to' != 'USER[154 chars]hile wearing a pair of glasses that are too la[69 chars]g it'
longt5multitest_inference_hidden_statesoutput_mismatch7/7(line 1225) AssertionError: Tensor-likes are not close!
longt5multitest_summarizationoutput_mismatch7/7(line 1194) AssertionError: Lists differ: ['background : coronary artery disease ( ca[601 chars]red'] != ['sss thessass:ss andss toss ofss fillssess[171 chars]se,']
longt5singletest_inference_hidden_statesoutput_mismatch7/7(line 1225) AssertionError: Tensor-likes are not close!
longt5singletest_summarizationoutput_mismatch7/7(line 1194) AssertionError: Lists differ: ['background : coronary artery disease ( ca[601 chars]red'] != ['sss thessass:ss andss toss ofss fillssess[171 chars]se,']
lukemultitest_inference_base_modeloutput_mismatch7/7(line 905) AssertionError: Tensor-likes are not close!
lukemultitest_inference_large_modeloutput_mismatch7/7(line 940) AssertionError: Tensor-likes are not close!
lukesingletest_inference_base_modeloutput_mismatch7/7(line 905) AssertionError: Tensor-likes are not close!
lukesingletest_inference_large_modeloutput_mismatch7/7(line 940) AssertionError: Tensor-likes are not close!
lw_detrmultitest_inference_object_detection_head_tinyoutput_mismatch7/7(line 690) AssertionError: Tensor-likes are not close!
lw_detrmultitest_inference_object_detection_head_xlargeoutput_mismatch7/7(line 766) AssertionError: Tensor-likes are not close!
lw_detrsingletest_inference_object_detection_head_tinyoutput_mismatch7/7(line 690) AssertionError: Tensor-likes are not close!
lw_detrsingletest_inference_object_detection_head_xlargeoutput_mismatch7/7(line 766) AssertionError: Tensor-likes are not close!
m2m_100multitest_seq_to_seq_generationoutput_mismatch7/7(line 397) AssertionError: assert ['</s>__en__T... France.</s>'] == ['</s> __en__... France.</s>']
m2m_100singletest_seq_to_seq_generationoutput_mismatch7/7(line 397) AssertionError: assert ['</s>__en__T... France.</s>'] == ['</s> __en__... France.</s>']
mimimultitest_integrationoutput_mismatch7/7(line 687) AssertionError: False is not true
mimimultitest_integration_longformoutput_mismatch7/7(line 687) AssertionError: False is not true
mimisingletest_integrationoutput_mismatch7/7(line 687) AssertionError: False is not true
mimisingletest_integration_longformoutput_mismatch7/7(line 687) AssertionError: False is not true
minimaxmultitest_small_model_logitsoutput_mismatch7/7(line 233) AssertionError: Tensor-likes are not close!
minimaxsingletest_small_model_logitsoutput_mismatch7/7(line 233) AssertionError: Tensor-likes are not close!
ministralmultitest_model_8b_generationoutput_mismatch7/7(line 116) AssertionError: 'My favourite condiment is 100% natural, 100% organic, 100% free of' != 'MyfavouritecondimentisĊĠĠĠĠJoined:Ġ2018-01-01,Ġ12'
ministralmultitest_model_8b_logitsoutput_mismatch7/7(line 93) AssertionError: Tensor-likes are not close!
ministralsingletest_model_8b_generationoutput_mismatch7/7(line 116) AssertionError: 'My favourite condiment is 100% natural, 100% organic, 100% free of' != 'MyfavouritecondimentisĊĠĠĠĠJoined:Ġ2018-01-01,Ġ12'
ministralsingletest_model_8b_logitsoutput_mismatch7/7(line 93) AssertionError: Tensor-likes are not close!
ministral3multitest_model_3b_generationoutput_mismatch7/7(line 130) AssertionError: 'My favourite condiment is icing sugar. I[47 chars]fles' != "My favourite condiment is 100% pure oliv[46 chars]t in"
ministral3multitest_model_3b_logitsoutput_mismatch7/7(line 102) AssertionError: Tensor-likes are not close!
ministral3singletest_model_3b_generationoutput_mismatch7/7(line 130) AssertionError: 'My favourite condiment is icing sugar. I[47 chars]fles' != "My favourite condiment is 100% pure oliv[46 chars]t in"
ministral3singletest_model_3b_logitsoutput_mismatch7/7(line 102) AssertionError: Tensor-likes are not close!
mistralmultitest_model_7b_logitsoutput_mismatch7/7(line 112) AssertionError: Tensor-likes are not close!
mistralmultitest_speculative_generationoutput_mismatch7/7(line 207) AssertionError: 'My f[18 chars] is 100% ketchup. I’m not a fan of mustard, relish' != 'My f[18 chars] is 100% mayonnaise. I’m not a fan of the fancy stuff with all'
mistralsingletest_model_7b_logitsoutput_mismatch7/7(line 112) AssertionError: Tensor-likes are not close!
mistralsingletest_speculative_generationoutput_mismatch7/7(line 207) AssertionError: 'My f[18 chars] is 100% ketchup. I’m not a fan of mustard, relish' != 'My f[18 chars] is 100% mayonnaise. I’m not a fan of the fancy stuff with all'
mistral3multitest_mistral3_integration_batched_generateoutput_mismatch7/7(line 362) AssertionError: ' to write a short story based on this ima[70 chars]e pl' != 'Calm waters reflect\nWooden path to dista[26 chars]oods'
mistral3multitest_mistral3_integration_batched_generate_multi_imageoutput_mismatch7/7(line 438) AssertionError: ' to write a short story based on this im[81 chars]ched' != "Calm waters reflect\nWooden path to dist[29 chars]hold"
mistral3multitest_mistral3_integration_generateoutput_mismatch7/7(line 309) AssertionError: 'The [14 chars] two tabby cats lying on a pink surface, which[21 chars]h or' != 'The [14 chars] two cats lying on a pink surface, which appea[21 chars] bed'
mistral3singletest_mistral3_integration_batched_generateoutput_mismatch7/7(line 362) AssertionError: ' to write a short story based on this ima[70 chars]e pl' != 'Calm waters reflect\nWooden path to dista[26 chars]oods'
mistral3singletest_mistral3_integration_batched_generate_multi_imageoutput_mismatch7/7(line 438) AssertionError: ' to write a short story based on this im[81 chars]ched' != "Calm waters reflect\nWooden path to dist[29 chars]hold"
mistral3singletest_mistral3_integration_generateoutput_mismatch7/7(line 309) AssertionError: 'The [14 chars] two tabby cats lying on a pink surface, which[21 chars]h or' != 'The [14 chars] two cats lying on a pink surface, which appea[21 chars] bed'
mixtralmultitest_small_model_logitsoutput_mismatch7/7(line 143) AssertionError: Tensor-likes are not close!
mixtralmultitest_small_model_logits_batchedoutput_mismatch7/7(line 188) AssertionError: Tensor-likes are not close!
mixtralsingletest_small_model_logitsoutput_mismatch7/7(line 143) AssertionError: Tensor-likes are not close!
mixtralsingletest_small_model_logits_batchedoutput_mismatch7/7(line 188) AssertionError: Tensor-likes are not close!
mllamamultitest_11b_model_integration_batched_generateoutput_mismatch7/7(line 643) AssertionError: 'If I[43 chars]d be: "I\'m not a fan of long exposure, but I\[21 chars]".\\' != 'If I[43 chars]d be:.\\nA dock in the lake.\\nA mountain in t[27 chars]ure.'
mllamamultitest_11b_model_integration_forwardoutput_mismatch7/7(line 687) AssertionError: False is not true : Actual logits: tensor([ 6.5938, 4.4062, 3.0938, -0.3105, 1.8906], dtype=torch.bfloat16)
mllamamultitest_11b_model_integration_generateoutput_mismatch7/7(line 510) AssertionError: 'If I[43 chars]d be: "I\'m not a fan of long exposure, but I\[21 chars]".\\' != 'If I[43 chars]d be:.\\nA dock in the lake.\\nA mountain in t[27 chars]ure.'
mllamamultitest_11b_model_integration_multi_image_generateoutput_mismatch7/7(line 724) AssertionError: 'The image shows a red octagonal stop sign w[59 chars]to a' != 'This image shows a long wooden dock extendi[67 chars]ling'
mllamasingletest_11b_model_integration_batched_generateoutput_mismatch7/7(line 643) AssertionError: 'If I[43 chars]d be: "I\'m not a fan of long exposure, but I\[21 chars]".\\' != 'If I[43 chars]d be:.\\nA dock in the lake.\\nA mountain in t[27 chars]ure.'
mllamasingletest_11b_model_integration_forwardoutput_mismatch7/7(line 687) AssertionError: False is not true : Actual logits: tensor([ 6.5938, 4.4062, 3.0938, -0.3105, 1.8906], dtype=torch.bfloat16)
mllamasingletest_11b_model_integration_generateoutput_mismatch7/7(line 510) AssertionError: 'If I[43 chars]d be: "I\'m not a fan of long exposure, but I\[21 chars]".\\' != 'If I[43 chars]d be:.\\nA dock in the lake.\\nA mountain in t[27 chars]ure.'
mllamasingletest_11b_model_integration_multi_image_generateoutput_mismatch7/7(line 724) AssertionError: 'The image shows a red octagonal stop sign w[59 chars]to a' != 'This image shows a long wooden dock extendi[67 chars]ling'
mlukemultitest_entity_classification_no_padding_or_truncationoutput_mismatch7/7(line 453) AssertionError: '<s> Japanese is an<s> East Asian language<s> spoken by about[40 chars]</s>' != '<s> Japanese is an<ent>East Asian language<ent>spoken by abo[42 chars]</s>'
mlukemultitest_entity_pair_classification_no_padding_or_truncationoutput_mismatch7/7(line 507) AssertionError: '<s><s> Japanese<s> is an East Asian language [64 chars]</s>' != '<s><ent>Japanese<ent>is an East Asian languag[68 chars]</s>'
mlukemultitest_entity_span_classification_no_padding_or_truncationoutput_mismatch7/7(line 572) AssertionError: '<s> [33 chars]e spoken by about 128 million people, primarily in Japan .</s>' != '<s> [33 chars]e spoken by about 128 million people, primarily in Japan.</s>'
mlukesingletest_entity_classification_no_padding_or_truncationoutput_mismatch7/7(line 453) AssertionError: '<s> Japanese is an<s> East Asian language<s> spoken by about[40 chars]</s>' != '<s> Japanese is an<ent>East Asian language<ent>spoken by abo[42 chars]</s>'
mlukesingletest_entity_pair_classification_no_padding_or_truncationoutput_mismatch7/7(line 507) AssertionError: '<s><s> Japanese<s> is an East Asian language [64 chars]</s>' != '<s><ent>Japanese<ent>is an East Asian languag[68 chars]</s>'
mlukesingletest_entity_span_classification_no_padding_or_truncationoutput_mismatch7/7(line 572) AssertionError: '<s> [33 chars]e spoken by about 128 million people, primarily in Japan .</s>' != '<s> [33 chars]e spoken by about 128 million people, primarily in Japan.</s>'
mm_grounding_dinomultitest_inference_object_detection_headoutput_mismatch7/7(line 672) AssertionError: Tensor-likes are not close!
mm_grounding_dinomultitest_inference_object_detection_head_equivalence_cpu_gpuoutput_mismatch7/7(line 738) AssertionError: Tensor-likes are not close!
mm_grounding_dinomultitest_mm_grounding_dino_lossoutput_mismatch7/7(line 687) AssertionError: False is not true
mm_grounding_dinosingletest_inference_object_detection_headoutput_mismatch7/7(line 672) AssertionError: Tensor-likes are not close!
mm_grounding_dinosingletest_inference_object_detection_head_equivalence_cpu_gpuoutput_mismatch7/7(line 738) AssertionError: Tensor-likes are not close!
mm_grounding_dinosingletest_mm_grounding_dino_lossoutput_mismatch7/7(line 687) AssertionError: False is not true
moonshine_streamingmultitest_medium_logits_batchoutput_mismatch7/7(line 605) AssertionError: Tensor-likes are not close!
moonshine_streamingmultitest_small_logits_batchoutput_mismatch7/7(line 572) AssertionError: Tensor-likes are not close!
moonshine_streamingsingletest_medium_logits_batchoutput_mismatch7/7(line 605) AssertionError: Tensor-likes are not close!
moonshine_streamingsingletest_small_logits_batchoutput_mismatch7/7(line 572) AssertionError: Tensor-likes are not close!
moshimultitest_moshika_greedy_unconditional_fp16output_mismatch7/7(line 687) AssertionError: False is not true
moshimultitest_moshiko_greedy_unconditional_fp16output_mismatch7/7(line 687) AssertionError: False is not true
moshimultitest_moshiko_greedy_unconditional_fp16_eageroutput_mismatch7/7(line 687) AssertionError: False is not true
moshisingletest_moshika_greedy_unconditional_fp16output_mismatch7/7(line 687) AssertionError: False is not true
moshisingletest_moshiko_greedy_unconditional_fp16output_mismatch7/7(line 687) AssertionError: False is not true
moshisingletest_moshiko_greedy_unconditional_fp16_eageroutput_mismatch7/7(line 687) AssertionError: False is not true
moshisingletest_moshiko_greedy_unconditional_fp32output_mismatch7/7(line 687) AssertionError: False is not true
musicgenmultitest_generate_text_prompt_samplingoutput_mismatch7/7(line 1262) AssertionError: Tensor-likes are not close!
musicgenmultitest_generate_unconditional_samplingoutput_mismatch7/7(line 1179) AssertionError: Tensor-likes are not close!
musicgensingletest_generate_text_prompt_samplingoutput_mismatch7/7(line 1262) AssertionError: Tensor-likes are not close!
musicgensingletest_generate_unconditional_samplingoutput_mismatch7/7(line 1179) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_text_audio_promptoutput_mismatch6/7(line 1307) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_text_prompt_greedyoutput_mismatch6/7(line 1219) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_text_prompt_greedy_with_classifier_free_guidanceoutput_mismatch6/7(line 1247) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_text_prompt_samplingoutput_mismatch6/7(line 1282) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_unconditional_greedyoutput_mismatch6/7(line 1167) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_unconditional_samplingoutput_mismatch6/7(line 1192) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_text_audio_promptoutput_mismatch6/7(line 1376) AssertionError: Tensor-likes are not close!
musicgen_melodymultitest_generate_unconditional_greedyoutput_mismatch6/7(line 1344) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_text_audio_promptoutput_mismatch7/7(line 1307) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_text_prompt_greedyoutput_mismatch7/7(line 1219) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_text_prompt_greedy_with_classifier_free_guidanceoutput_mismatch7/7(line 1247) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_text_prompt_samplingoutput_mismatch7/7(line 1282) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_unconditional_greedyoutput_mismatch7/7(line 1167) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_unconditional_samplingoutput_mismatch7/7(line 1192) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_text_audio_promptoutput_mismatch7/7(line 1376) AssertionError: Tensor-likes are not close!
musicgen_melodysingletest_generate_unconditional_greedyoutput_mismatch7/7(line 1344) AssertionError: Tensor-likes are not close!
nemotronmultitest_nemotron_8b_generation_eageroutput_mismatch6/7(line 103) AssertionError: Lists differ: ['Wha[46 chars]er: Jupiter\n\nWhat is the answer'] != ['Wha[46 chars]er: Jupiter\n\nWhat is the answer: What is the name of the 19']
nemotronsingletest_nemotron_8b_generation_eageroutput_mismatch7/7(line 103) AssertionError: Lists differ: ['Wha[46 chars]er: Jupiter\n\nWhat is the answer'] != ['Wha[46 chars]er: Jupiter\n\nWhat is the answer: What is the name of the 19']
nllb_moemultitest_inference_logitsoutput_mismatch7/7(line 399) AssertionError: Tensor-likes are not close!
nllb_moesingletest_inference_logitsoutput_mismatch7/7(line 399) AssertionError: Tensor-likes are not close!
olmomultitest_export_static_cacheoutput_mismatch6/7(line 338) AssertionError: Lists differ: ['Sim[41 chars]that \nthe speed of light is the same in all r[35 chars]ght'] != ['Sim[41 chars]that .1.\nThe theory of relativity states tha[18 chars] of']
olmomultitest_model_7b_greedy_generationoutput_mismatch6/7(line 242) AssertionError: 'Simp[40 chars]that \nthe speed of light is the same for all [232 chars]\n\n' != 'Simp[40 chars]that .1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1[20 chars].1.1'
olmosingletest_export_static_cacheoutput_mismatch6/7(line 338) AssertionError: Lists differ: ['Sim[41 chars]that \nthe speed of light is the same in all r[35 chars]ght'] != ['Sim[41 chars]that .1.\nThe theory of relativity states tha[18 chars] of']
olmosingletest_model_7b_greedy_generationoutput_mismatch6/7(line 242) AssertionError: 'Simp[40 chars]that \nthe speed of light is the same for all [232 chars]\n\n' != 'Simp[40 chars]that .1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1.1[20 chars].1.1'
olmo2multitest_model_1b_logits_bfloat16output_mismatch7/7(line 214) AssertionError: Tensor-likes are not close!
olmo2singletest_model_1b_logits_bfloat16output_mismatch7/7(line 214) AssertionError: Tensor-likes are not close!
olmo3multitest_model_7b_logitsoutput_mismatch7/7(line 196) AssertionError: Tensor-likes are not close!
olmo3singletest_model_7b_logitsoutput_mismatch7/7(line 196) AssertionError: Tensor-likes are not close!
olmoemultitest_model_7b_logitsoutput_mismatch7/7(line 217) AssertionError: Tensor-likes are not close!
olmoesingletest_model_7b_logitsoutput_mismatch7/7(line 217) AssertionError: Tensor-likes are not close!
oneformermultitest_inference_no_headoutput_mismatch6/7(line 507) AssertionError: Tensor-likes are not close!
oneformermultitest_inference_universal_segmentation_headoutput_mismatch6/7(line 549) AssertionError: Tensor-likes are not close!
oneformersingletest_inference_no_headoutput_mismatch7/7(line 507) AssertionError: Tensor-likes are not close!
oneformersingletest_inference_universal_segmentation_headoutput_mismatch7/7(line 549) AssertionError: Tensor-likes are not close!
optmultitest_inference_no_headoutput_mismatch7/7(line 357) AssertionError: tensor([[-0.2883, -1.9219, -0.3079],
optsingletest_inference_no_headoutput_mismatch7/7(line 357) AssertionError: tensor([[-0.2883, -1.9219, -0.3079],
ovis2multitest_small_model_integration_test_batch_different_resolutionsoutput_mismatch7/7(line 355) AssertionError: Lists differ: ['sys[81 chars]ant\n', 'system\nYou are a helpful assistant.\[139 chars]et.'] != ['sys[81 chars]ant\nAnswer: I see a brown dog standing on a w[224 chars]et.']
ovis2singletest_small_model_integration_test_batch_different_resolutionsoutput_mismatch7/7(line 355) AssertionError: Lists differ: ['sys[81 chars]ant\n', 'system\nYou are a helpful assistant.\[139 chars]et.'] != ['sys[81 chars]ant\nAnswer: I see a brown dog standing on a w[224 chars]et.']
owlvitmultitest_inference_interpolate_pos_encodingoutput_mismatch7/7(line 683) AssertionError: Tensor-likes are not close!
owlvitmultitest_inference_object_detectionoutput_mismatch7/7(line 800) AssertionError: Tensor-likes are not close!
owlvitmultitest_inference_one_shot_object_detectionoutput_mismatch7/7(line 843) AssertionError: Tensor-likes are not close!
owlvitsingletest_inference_interpolate_pos_encodingoutput_mismatch7/7(line 683) AssertionError: Tensor-likes are not close!
owlvitsingletest_inference_object_detectionoutput_mismatch7/7(line 800) AssertionError: Tensor-likes are not close!
owlvitsingletest_inference_one_shot_object_detectionoutput_mismatch7/7(line 843) AssertionError: Tensor-likes are not close!
persimmonmultitest_model_8b_chat_greedy_generationoutput_mismatch6/7(line 131) AssertionError: 'huma[58 chars]ept: The theory of relativity states that the [80 chars]ion.' != 'huma[58 chars]ept: the speed of light in a vacuum is the sam[33 chars]ence'
persimmonsingletest_model_8b_chat_greedy_generationoutput_mismatch6/7(line 131) AssertionError: 'huma[58 chars]ept: The theory of relativity states that the [80 chars]ion.' != 'huma[58 chars]ept: the speed of light in a vacuum is the sam[33 chars]ence'
persimmonsingletest_model_8b_chat_logitsoutput_mismatch6/7(line 99) AssertionError: Tensor-likes are not close!
pixiomultitest_inference_no_headoutput_mismatch7/7(line 277) AssertionError: Tensor-likes are not close!
pixiosingletest_inference_no_headoutput_mismatch7/7(line 277) AssertionError: Tensor-likes are not close!
plbartmultitest_fill_maskoutput_mismatch7/7(line 444) AssertionError: '0 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0' != '0 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0 the'
plbartmultitest_java_cs_generate_batchoutput_mismatch7/7(line 379) AssertionError: assert ['public int ...turn a * b *'] == ['public int ...rn a * b * c']
plbartmultitest_java_cs_generate_oneoutput_mismatch7/7(line 370) AssertionError: 'public int maximum(int a, int b, int c){return Math.Max(' != 'public int maximum(int a, int b, int c){return Math.Max(a'
plbartsingletest_fill_maskoutput_mismatch7/7(line 444) AssertionError: '0 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0' != '0 0 the 0 the 0 the 0 the 0 the 0 the 0 the 0 the'
plbartsingletest_java_cs_generate_batchoutput_mismatch7/7(line 379) AssertionError: assert ['public int ...turn a * b *'] == ['public int ...rn a * b * c']
plbartsingletest_java_cs_generate_oneoutput_mismatch7/7(line 370) AssertionError: 'public int maximum(int a, int b, int c){return Math.Max(' != 'public int maximum(int a, int b, int c){return Math.Max(a'
pvtmultitest_inference_image_classificationoutput_mismatch7/7(line 257) AssertionError: Tensor-likes are not close!
pvtmultitest_inference_modeloutput_mismatch7/7(line 284) AssertionError: Tensor-likes are not close!
pvtsingletest_inference_image_classificationoutput_mismatch7/7(line 257) AssertionError: Tensor-likes are not close!
pvtsingletest_inference_modeloutput_mismatch7/7(line 284) AssertionError: Tensor-likes are not close!
pvt_v2multitest_inference_image_classificationoutput_mismatch7/7(line 275) AssertionError: Tensor-likes are not close!
pvt_v2singletest_inference_image_classificationoutput_mismatch7/7(line 275) AssertionError: Tensor-likes are not close!
qwen2_5_omnimultitest_small_model_integration_testoutput_mismatch7/7(line 692) AssertionError: "syst[108 chars]d is glass shattering, and the dog is a Labrador Retriever." != "syst[108 chars]d is a glass shattering. The dog in the pictur[22 chars]ver."
qwen2_5_omnimultitest_small_model_integration_test_batchoutput_mismatch7/7(line 734) AssertionError: Lists differ: ["sys[109 chars]d is glass shattering, and the dog is a Labrad[185 chars]er."] != ["sys[109 chars]d is a glass shattering. The dog in the pictur[211 chars]er."]
qwen2_5_omnisingletest_small_model_integration_testoutput_mismatch7/7(line 692) AssertionError: "syst[108 chars]d is glass shattering, and the dog is a Labrador Retriever." != "syst[108 chars]d is a glass shattering. The dog in the pictur[22 chars]ver."
qwen2_5_omnisingletest_small_model_integration_test_batchoutput_mismatch7/7(line 734) AssertionError: Lists differ: ["sys[109 chars]d is glass shattering, and the dog is a Labrad[185 chars]er."] != ["sys[109 chars]d is a glass shattering. The dog in the pictur[211 chars]er."]
qwen2_5_vlmultitest_small_model_integration_test_batch_wo_imageoutput_mismatch7/7(line 611) AssertionError: Lists differ: ['sys[298 chars]en, a large language model created by Alibaba [84 chars]and'] != ['sys[298 chars]en, an AI language model created by Alibaba Cl[96 chars]on,']
qwen2_5_vlsingletest_small_model_integration_test_batch_wo_imageoutput_mismatch7/7(line 611) AssertionError: Lists differ: ['sys[298 chars]en, a large language model created by Alibaba [84 chars]and'] != ['sys[298 chars]en, an AI language model created by Alibaba Cl[96 chars]on,']
qwen2_moemultitest_model_a2_7b_logitsoutput_mismatch7/7(line 147) AssertionError: Tensor-likes are not close!
qwen2_moesingletest_model_a2_7b_logitsoutput_mismatch7/7(line 147) AssertionError: Tensor-likes are not close!
qwen3multitest_model_600m_logitsoutput_mismatch7/7(line 92) AssertionError: Tensor-likes are not close!
qwen3multitest_speculative_generationoutput_mismatch7/7(line 198) AssertionError: 'My f[22 chars]100% beef, 100% beef, 100% beef.' != 'My f[22 chars]100% vegetable oil. It has a rich, creamy text[19 chars]utty'
qwen3singletest_model_600m_logitsoutput_mismatch7/7(line 92) AssertionError: Tensor-likes are not close!
qwen3singletest_speculative_generationoutput_mismatch7/7(line 198) AssertionError: 'My f[22 chars]100% beef, 100% beef, 100% beef.' != 'My f[22 chars]100% vegetable oil. It has a rich, creamy text[19 chars]utty'
qwen3_5multitest_model_video_generationoutput_mismatch7/7(line 845) AssertionError: Lists differ: [248045, 846, 198, 27, 15, 13, 18, 6283, 29, 248053] != [248045, 846, 198, 248053, 27, 15, 13, 18, 6283, 29]
qwen3_5multitest_model_video_generation_batchoutput_mismatch7/7(line 897) AssertionError: Lists differ: [248045, 846, 198, 27, 15, 13, 18, 6283, 29, 248053] != [248045, 846, 198, 248053, 27, 15, 13, 18, 6283, 29]
qwen3_5singletest_model_video_generationoutput_mismatch7/7(line 845) AssertionError: Lists differ: [248045, 846, 198, 27, 15, 13, 18, 6283, 29, 248053] != [248045, 846, 198, 248053, 27, 15, 13, 18, 6283, 29]
qwen3_5singletest_model_video_generation_batchoutput_mismatch7/7(line 897) AssertionError: Lists differ: [248045, 846, 198, 27, 15, 13, 18, 6283, 29, 248053] != [248045, 846, 198, 248053, 27, 15, 13, 18, 6283, 29]
qwen3_omni_moesingletest_small_model_integration_test_batchoutput_mismatch7/7(line 823) AssertionError: Lists differ: ["use[99 chars]ation, here is a breakdown of what you're hear[187 chars]n\n"] != ["use[99 chars]ation provided:\n\nThe sound you hear is the d[191 chars]hed"]
qwen3_omni_moesingletest_small_model_integration_test_w_audiooutput_mismatch7/7(line 911) AssertionError: 'syst[223 chars]derstand spoken content, and I can also make inferences about' != 'syst[223 chars]derstand spoken content, and I can also process and respond to'
qwen3_vl_moesingletest_small_model_integration_test_batchoutput_mismatch7/7(line 446) AssertionError: Lists differ: ["use[92 chars]'s a wild cat species native to the grasslands[182 chars]ons"] != ["use[92 chars]'s a small wild cat native to the grasslands a[178 chars]ons"]
ragsingletest_rag_sequence_generate_batchoutput_mismatch7/7(line 948) AssertionError: Lists differ: [' michael gross', ' monday 17 , 2018', ' te[96 chars]ndo'] != [' albert einstein', ' june 22 , 2018', ' am[85 chars]' 8']
ragsingletest_rag_sequence_generate_batch_from_context_input_idsoutput_mismatch7/7(line 1000) AssertionError: Lists differ: [' michael gross', ' monday 17 , 2018', ' te[96 chars]ndo'] != [' albert einstein', ' june 22 , 2018', ' am[85 chars]' 8']
ragsingletest_rag_sequence_generate_beamoutput_mismatch7/7(line 892) AssertionError: '" in the United States. "People Need Love"[155 chars]hit.' != '"She\'s My Kind of Girl" was released thro[257 chars]nts.'
ragsingletest_rag_token_generate_beamoutput_mismatch7/7(line 854) AssertionError: '"She[14 chars] Girl' != '"She[14 chars] Girl" was released through Epic Records in Ja[179 chars]ses"'
recurrent_gemmamultitest_2b_generateoutput_mismatch7/7(line 157) AssertionError: Lists differ: ['Hel[325 chars]oday the 19th of June 2019, I was in the offic[256 chars] to'] != ['Hel[325 chars]oday is a new app that allows you to make mone[256 chars]app']
recurrent_gemmamultitest_2b_sampleoutput_mismatch7/7(line 195) AssertionError: Lists differ: ['Wha[24 chars]Deep Learning (or deep learning) is one of the[107 chars]ple'] != ['Wha[24 chars]Deep learning is the next frontier in computer[98 chars] is']
recurrent_gemmamultitest_longer_than_windowoutput_mismatch7/7(line 243) AssertionError: Lists differ: [' Jean-Philippe Guillet said, "We have no[245 chars]eo.'] != [" Robin's comments follow claims by two m[249 chars]the"]
recurrent_gemmamultitest_model_2b_8bitoutput_mismatch7/7(line 222) AssertionError: Lists differ: ['Hel[26 chars] the effects of the environment on the human b[124 chars]aur"] != ['Hel[26 chars] the topic of "The impact of social media on t[102 chars] 3D"]
recurrent_gemmasingletest_2b_generateoutput_mismatch7/7(line 157) AssertionError: Lists differ: ['Hel[325 chars]oday the 19th of June 2019, I was in the offic[256 chars] to'] != ['Hel[325 chars]oday is a new app that allows you to make mone[256 chars]app']
recurrent_gemmasingletest_2b_sampleoutput_mismatch7/7(line 195) AssertionError: Lists differ: ['Wha[24 chars]Deep Learning (or deep learning) is one of the[107 chars]ple'] != ['Wha[24 chars]Deep learning is the next frontier in computer[98 chars] is']
recurrent_gemmasingletest_longer_than_windowoutput_mismatch7/7(line 243) AssertionError: Lists differ: [' Jean-Philippe Guillet said, "We have no[245 chars]eo.'] != [" Robin's comments follow claims by two m[249 chars]the"]
recurrent_gemmasingletest_model_2b_8bitoutput_mismatch7/7(line 222) AssertionError: Lists differ: ['Hel[26 chars] the effects of the environment on the human b[124 chars]aur"] != ['Hel[26 chars] the topic of "The impact of social media on t[102 chars] 3D"]
reformermultitest_pretrained_generate_crime_and_punishoutput_mismatch7/7(line 1370) AssertionError: 'A fe[36 chars]is ideas, so attentively two or three thousand roubles, and' != 'A fe[36 chars]is ideas, at the first entrance. He was positively for an inst'
reformersingletest_pretrained_generate_crime_and_punishoutput_mismatch7/7(line 1370) AssertionError: 'A fe[36 chars]is ideas, so attentively two or three thousand roubles, and' != 'A fe[36 chars]is ideas, at the first entrance. He was positively for an inst'
regnetmultitest_inference_image_classification_headoutput_mismatch7/7(line 243) AssertionError: Tensor-likes are not close!
regnetsingletest_inference_image_classification_headoutput_mismatch7/7(line 243) AssertionError: Tensor-likes are not close!
resnetmultitest_inference_image_classification_headoutput_mismatch7/7(line 291) AssertionError: Tensor-likes are not close!
resnetsingletest_inference_image_classification_headoutput_mismatch7/7(line 291) AssertionError: Tensor-likes are not close!
seed_ossmultitest_model_36b_eageroutput_mismatch7/7(line 95) AssertionError: Lists differ: ['How[132 chars]ing to use the ByteDance-Seed dataset for my research. I have'] != ['How[132 chars]ing to run the code on the <beginning of the code>seed']
seed_ossmultitest_model_36b_sdpaoutput_mismatch7/7(line 114) AssertionError: Lists differ: ['How[132 chars]ing to use the ByteDance-Seed dataset for my research. I have'] != ['How[132 chars]ing to run the code on the <beginning of the code>seed']
seed_osssingletest_model_36b_eageroutput_mismatch7/7(line 95) AssertionError: Lists differ: ['How[132 chars]ing to use the ByteDance-Seed dataset for my research. I have'] != ['How[132 chars]ing to run the code on the <beginning of the code>seed']
seed_osssingletest_model_36b_sdpaoutput_mismatch7/7(line 114) AssertionError: Lists differ: ['How[132 chars]ing to use the ByteDance-Seed dataset for my research. I have'] != ['How[132 chars]ing to run the code on the <beginning of the code>seed']
smollm3multitest_export_static_cacheoutput_mismatch7/7(line 198) AssertionError: 'Gravity is the force that pulls objects [69 chars] and' != ["Gravity is the force that pulls objects[85 chars] of"]
smollm3multitest_model_3b_logitsoutput_mismatch7/7(line 89) AssertionError: Tensor-likes are not close!
smollm3singletest_export_static_cacheoutput_mismatch7/7(line 198) AssertionError: 'Gravity is the force that pulls objects [69 chars] and' != ["Gravity is the force that pulls objects[85 chars] of"]
smollm3singletest_model_3b_logitsoutput_mismatch7/7(line 89) AssertionError: Tensor-likes are not close!
stablelmmultitest_model_stablelm_3b_4e1t_logitsoutput_mismatch7/7(line 65) AssertionError: Tensor-likes are not close!
stablelmmultitest_model_tiny_random_stablelm_2_logitsoutput_mismatch7/7(line 98) AssertionError: Tensor-likes are not close!
stablelmsingletest_model_stablelm_3b_4e1t_logitsoutput_mismatch7/7(line 65) AssertionError: Tensor-likes are not close!
stablelmsingletest_model_tiny_random_stablelm_2_logitsoutput_mismatch7/7(line 98) AssertionError: Tensor-likes are not close!
starcoder2multitest_starcoder2_batched_generation_4bitoutput_mismatch7/7(line 152) AssertionError: Lists differ: ['Hel[188 chars]of', 'def hello_world():\n\treturn "Hello Worl[95 chars]ute'] != ['Hel[188 chars]of', "def hello_world(): hello_world():\n r[117 chars]'})"]
starcoder2multitest_starcoder2_batched_generation_eageroutput_mismatch7/7(line 99) AssertionError: Lists differ: ['Hel[223 chars]ld():\n\treturn 'Hello World!'\n\n@app.route('[72 chars]app"] != ['Hel[223 chars]ld(): hello_world():\n return 'Hello World![87 chars]n\n"]
starcoder2multitest_starcoder2_batched_generation_sdpaoutput_mismatch7/7(line 79) AssertionError: Lists differ: ['Hel[223 chars]ld():\n\treturn 'Hello World!'\n\n@app.route('[72 chars]app"] != ['Hel[223 chars]ld(): hello_world():\n return 'Hello World![87 chars]n\n"]
starcoder2singletest_starcoder2_batched_generation_4bitoutput_mismatch7/7(line 152) AssertionError: Lists differ: ['Hel[188 chars]of', 'def hello_world():\n\treturn "Hello Worl[95 chars]ute'] != ['Hel[188 chars]of', "def hello_world(): hello_world():\n r[117 chars]'})"]
starcoder2singletest_starcoder2_batched_generation_eageroutput_mismatch7/7(line 99) AssertionError: Lists differ: ['Hel[223 chars]ld():\n\treturn 'Hello World!'\n\n@app.route('[72 chars]app"] != ['Hel[223 chars]ld(): hello_world():\n return 'Hello World![87 chars]n\n"]
starcoder2singletest_starcoder2_batched_generation_sdpaoutput_mismatch7/7(line 79) AssertionError: Lists differ: ['Hel[223 chars]ld():\n\treturn 'Hello World!'\n\n@app.route('[72 chars]app"] != ['Hel[223 chars]ld(): hello_world():\n return 'Hello World![87 chars]n\n"]
swiftformermultitest_inference_image_classification_headoutput_mismatch7/7(line 263) AssertionError: Tensor-likes are not close!
swiftformersingletest_inference_image_classification_headoutput_mismatch7/7(line 263) AssertionError: Tensor-likes are not close!
swin2srmultitest_inference_fp16output_mismatch7/7(line 332) AssertionError: Tensor-likes are not close!
swin2srsingletest_inference_fp16output_mismatch7/7(line 332) AssertionError: Tensor-likes are not close!
swinv2multitest_inference_fp16output_mismatch7/7(line 492) AssertionError: Tensor-likes are not close!
swinv2singletest_inference_fp16output_mismatch7/7(line 492) AssertionError: Tensor-likes are not close!
t5gemma2multitest_model_generation_batch_270moutput_mismatch7/7(line 1128) AssertionError: Lists differ: [' a [83 chars]e UK.\n\nThe bumblebee is a species of bee tha[15 chars]the'] != [' a [83 chars]e UK.']
t5gemma2singletest_model_generation_batch_270moutput_mismatch7/7(line 1128) AssertionError: Lists differ: [' a [83 chars]e UK.\n\nThe bumblebee is a species of bee tha[15 chars]the'] != [' a [83 chars]e UK.']
table_transformermultitest_table_detectionoutput_mismatch7/7(line 554) AssertionError: Tensor-likes are not close!
table_transformersingletest_table_detectionoutput_mismatch7/7(line 554) AssertionError: Tensor-likes are not close!
univnetmultitest_integrationoutput_mismatch7/7(line 330) AssertionError: Scalars are not close!
univnetsingletest_integrationoutput_mismatch7/7(line 330) AssertionError: Scalars are not close!
utilsmultitest_cache_copyoutput_mismatch7/7(line 436) AssertionError: Lists differ: ['You are a helpful assistant. Help me to [390 chars] is'] != ["You are a helpful assistant. Help me to [385 chars] is']
utilsmultitest_dynamic_cache_hardoutput_mismatch7/7(line 319) AssertionError: "Here[57 chars]ave fur, they have four legs, they have a tail[1045 chars]have" != "Here[57 chars]ave four legs, they have a tail, they have a f[1078 chars]They"
utilssingletest_cache_copyoutput_mismatch7/7(line 436) AssertionError: Lists differ: ['You are a helpful assistant. Help me to [390 chars] is'] != ["You are a helpful assistant. Help me to [385 chars] is']
utilssingletest_dynamic_cache_hardoutput_mismatch7/7(line 319) AssertionError: "Here[57 chars]ave fur, they have four legs, they have a tail[1045 chars]have" != "Here[57 chars]ave four legs, they have a tail, they have a f[1078 chars]They"
video_llavamultitest_small_model_integration_test_llamaoutput_mismatch7/7(line 491) AssertionError: 'USER: \nDescribe the video in details. A[572 chars]ion.' != "USER: \nDescribe the video in details. A[675 chars]ing."
video_llavamultitest_small_model_integration_test_mixed_inputsoutput_mismatch7/7(line 464) AssertionError: Lists differ: ['USE[183 chars]se it shows a baby sitting on a bed and reading a book. The'] != ['USE[183 chars]se it shows a baby sitting on a bed and reading a book, which']
video_llavasingletest_small_model_integration_test_llamaoutput_mismatch7/7(line 491) AssertionError: 'USER: \nDescribe the video in details. A[572 chars]ion.' != "USER: \nDescribe the video in details. A[675 chars]ing."
video_llavasingletest_small_model_integration_test_mixed_inputsoutput_mismatch7/7(line 464) AssertionError: Lists differ: ['USE[183 chars]se it shows a baby sitting on a bed and reading a book. The'] != ['USE[183 chars]se it shows a baby sitting on a bed and reading a book, which']
videomaemultitest_inference_for_pretrainingoutput_mismatch7/7(line 478) AssertionError: Tensor-likes are not close!
videomaemultitest_inference_for_video_classificationoutput_mismatch7/7(line 453) AssertionError: Tensor-likes are not close!
videomaesingletest_inference_for_pretrainingoutput_mismatch7/7(line 478) AssertionError: Tensor-likes are not close!
videomaesingletest_inference_for_video_classificationoutput_mismatch7/7(line 453) AssertionError: Tensor-likes are not close!
viltmultitest_inference_masked_lmoutput_mismatch7/7(line 575) AssertionError: Tensor-likes are not close!
viltsingletest_inference_masked_lmoutput_mismatch7/7(line 575) AssertionError: Tensor-likes are not close!
vision_encoder_decodermultitest_inference_cordv2output_mismatch7/7(line 1352) AssertionError: Tensor-likes are not close!
vision_encoder_decodermultitest_inference_docvqaoutput_mismatch7/7(line 1288) AssertionError: Tensor-likes are not close!
vision_encoder_decodermultitest_inference_rvlcdipoutput_mismatch7/7(line 1414) AssertionError: Tensor-likes are not close!
vision_encoder_decodersingletest_inference_cordv2output_mismatch7/7(line 1352) AssertionError: Tensor-likes are not close!
vision_encoder_decodersingletest_inference_docvqaoutput_mismatch7/7(line 1288) AssertionError: Tensor-likes are not close!
vision_encoder_decodersingletest_inference_rvlcdipoutput_mismatch7/7(line 1414) AssertionError: Tensor-likes are not close!
vitsmultitest_forward_fp16output_mismatch7/7(line 433) AssertionError: Tensor-likes are not close!
vitssingletest_forward_fp16output_mismatch7/7(line 433) AssertionError: Tensor-likes are not close!
vivitmultitest_inference_for_video_classificationoutput_mismatch7/7(line 361) AssertionError: Tensor-likes are not close!
vivitsingletest_inference_for_video_classificationoutput_mismatch7/7(line 361) AssertionError: Tensor-likes are not close!
voxtralmultitest_mini_multi_turn_text_and_audiooutput_mismatch7/7(line 381) AssertionError: Lists differ: ['Des[790 chars]as a farewell address by a president, reflecti[151 chars]xt.'] != ['Des[790 chars]as a political speech by a president, reflecti[151 chars]xt.']
voxtralmultitest_mini_single_turn_audio_onlyoutput_mismatch7/7(line 163) AssertionError: Lists differ: ['The[442 chars]king what A\'s tattoo says, and A always respo[777 chars]nt.'] != ['The[442 chars]king A what his tattoo says, and A always resp[884 chars]on.']
voxtralmultitest_mini_single_turn_text_and_audiooutput_mismatch7/7(line 203) AssertionError: Lists differ: ["Wha[241 chars]. He expresses gratitude for the conversations[429 chars]en."] != ["Wha[241 chars]. He acknowledges the diverse perspectives and[412 chars]es."]
voxtralmultitest_mini_single_turn_text_and_multiple_audios_batchedoutput_mismatch7/7(line 327) AssertionError: Lists differ: ["Who[609 chars]m is likely the Seattle Mariners, as the comme[446 chars]me.'] != ["Who[609 chars]m is the Mariners, and the commentator is exci[414 chars]nt.']
voxtralsingletest_mini_multi_turn_text_and_audiooutput_mismatch7/7(line 381) AssertionError: Lists differ: ['Des[790 chars]as a farewell address by a president, reflecti[151 chars]xt.'] != ['Des[790 chars]as a political speech by a president, reflecti[151 chars]xt.']
voxtralsingletest_mini_single_turn_audio_onlyoutput_mismatch7/7(line 163) AssertionError: Lists differ: ['The[442 chars]king what A\'s tattoo says, and A always respo[777 chars]nt.'] != ['The[442 chars]king A what his tattoo says, and A always resp[884 chars]on.']
voxtralsingletest_mini_single_turn_text_and_audiooutput_mismatch7/7(line 203) AssertionError: Lists differ: ["Wha[241 chars]. He expresses gratitude for the conversations[429 chars]en."] != ["Wha[241 chars]. He acknowledges the diverse perspectives and[412 chars]es."]
voxtralsingletest_mini_single_turn_text_and_multiple_audios_batchedoutput_mismatch7/7(line 327) AssertionError: Lists differ: ["Who[609 chars]m is likely the Seattle Mariners, as the comme[446 chars]me.'] != ["Who[609 chars]m is the Mariners, and the commentator is exci[414 chars]nt.']
voxtral_realtimemultitest_batched_longformoutput_mismatch7/7(line 349) AssertionError: Lists differ: [' Come on! Dude. You got a tattoo. So did you, dud[1097 chars]the"] != [' Come on. Dude. You got a tattoo. So did you, dud[1097 chars]the"]
voxtral_realtimesingletest_batched_longformoutput_mismatch7/7(line 349) AssertionError: Lists differ: [' Come on! Dude. You got a tattoo. So did you, dud[1097 chars]the"] != [' Come on. Dude. You got a tattoo. So did you, dud[1097 chars]the"]
whispermultitest_small_token_timestamp_generationoutput_mismatch7/7(line 2023) AssertionError: Tensor-likes are not close!
whispermultitest_speculative_decoding_non_distiloutput_mismatch7/7(line 2390) AssertionError: Lists differ: [' Mr[35 chars]dle classes and we are glad to welcome his gospel. Thank you.'] != [' Mr[35 chars]dle classes and we are glad to welcome his gospel.']
whispermultitest_tiny_en_batched_generationoutput_mismatch7/7(line 1541) AssertionError: The values for attribute 'shape' do not match: torch.Size([4, 18]) != torch.Size([4, 20]).
whispermultitest_tiny_en_generationoutput_mismatch7/7(line 1383) AssertionError: ' Mr.[15 chars] apostle of the middle classes, and we are glad to' != ' Mr.[15 chars] apostle of the middle classes, and we are glad to welcome his'
whispermultitest_tiny_generationoutput_mismatch7/7(line 1399) AssertionError: ' Mr.[21 chars]le of the middle classes and we are glad' != ' Mr.[21 chars]le of the middle classes and we are glad to welcome his gospel'
whispermultitest_tiny_specaugment_librispeechoutput_mismatch7/7(line 2137) AssertionError: Tensor-likes are not close!
whispermultitest_whisper_longform_multi_batch_hardoutput_mismatch7/7(line 2787) AssertionError: Lists differ: [" Fo[272 chars]ting of classics, Sicilian, nade door variatio[8147 chars]le!'] != [" Fo[272 chars]ting a classic Sicilian, nade door variation o[8150 chars]le!']
whispermultitest_whisper_longform_multi_batch_hard_prev_condoutput_mismatch7/7(line 2841) AssertionError: Lists differ: [" Fo[425 chars]a fischer shows in lip nitskey attack the fisc[5579 chars]y ."] != [" Fo[425 chars]a fisher shows in lip-nitsky attack that culmi[7900 chars]le!"]
whispermultitest_whisper_longform_no_speech_detectionoutput_mismatch7/7(line 2947) AssertionError: Lists differ: [" Fo[435 chars]sting And so so so so so so so so so so so so [7329 chars]our"] != [" Fo[435 chars]sting", ' Ladies and gentlemen, you know, I sp[1433 chars]es."]
whispermultitest_whisper_shortform_single_batch_prev_condoutput_mismatch7/7(line 2556) AssertionError: Lists differ: [" Fo[268 chars]ating, so soft, it would make JD power and her[196 chars]ke."] != [" Fo[268 chars]ating so soft, it would make JD power and her [195 chars]ke."]
whispersingletest_small_token_timestamp_generationoutput_mismatch7/7(line 2023) AssertionError: Tensor-likes are not close!
whispersingletest_speculative_decoding_non_distiloutput_mismatch7/7(line 2390) AssertionError: Lists differ: [' Mr[35 chars]dle classes and we are glad to welcome his gospel. Thank you.'] != [' Mr[35 chars]dle classes and we are glad to welcome his gospel.']
whispersingletest_tiny_en_batched_generationoutput_mismatch7/7(line 1541) AssertionError: The values for attribute 'shape' do not match: torch.Size([4, 18]) != torch.Size([4, 20]).
whispersingletest_tiny_en_generationoutput_mismatch7/7(line 1383) AssertionError: ' Mr.[15 chars] apostle of the middle classes, and we are glad to' != ' Mr.[15 chars] apostle of the middle classes, and we are glad to welcome his'
whispersingletest_tiny_generationoutput_mismatch7/7(line 1399) AssertionError: ' Mr.[21 chars]le of the middle classes and we are glad' != ' Mr.[21 chars]le of the middle classes and we are glad to welcome his gospel'
whispersingletest_tiny_specaugment_librispeechoutput_mismatch7/7(line 2137) AssertionError: Tensor-likes are not close!
whispersingletest_whisper_longform_multi_batch_hardoutput_mismatch7/7(line 2787) AssertionError: Lists differ: [" Fo[272 chars]ting of classics, Sicilian, nade door variatio[8147 chars]le!'] != [" Fo[272 chars]ting a classic Sicilian, nade door variation o[8150 chars]le!']
whispersingletest_whisper_longform_multi_batch_hard_prev_condoutput_mismatch7/7(line 2841) AssertionError: Lists differ: [" Fo[425 chars]a fischer shows in lip nitskey attack the fisc[5579 chars]y ."] != [" Fo[425 chars]a fisher shows in lip-nitsky attack that culmi[7900 chars]le!"]
whispersingletest_whisper_longform_no_speech_detectionoutput_mismatch7/7(line 2947) AssertionError: Lists differ: [" Fo[435 chars]sting And so so so so so so so so so so so so [7329 chars]our"] != [" Fo[435 chars]sting", ' Ladies and gentlemen, you know, I sp[1433 chars]es."]
whispersingletest_whisper_shortform_single_batch_prev_condoutput_mismatch7/7(line 2556) AssertionError: Lists differ: [" Fo[268 chars]ating, so soft, it would make JD power and her[196 chars]ke."] != [" Fo[268 chars]ating so soft, it would make JD power and her [195 chars]ke."]
zambamultitest_simple_batched_generate_with_paddingoutput_mismatch7/7(line 476) AssertionError: '<s> [20 chars]g on this lovely evening? I hope you are having a great day. I' != '<s> [20 chars]g on this lovely evening? I hope you are all doing well. I am'
zambamultitest_simple_generateoutput_mismatch7/7(line 463) AssertionError: The values for attribute 'dtype' do not match: torch.bfloat16 != torch.float32.
zambasingletest_simple_batched_generate_with_paddingoutput_mismatch7/7(line 476) AssertionError: '<s> [20 chars]g on this lovely evening? I hope you are having a great day. I' != '<s> [20 chars]g on this lovely evening? I hope you are all doing well. I am'
zambasingletest_simple_generateoutput_mismatch7/7(line 463) AssertionError: The values for attribute 'dtype' do not match: torch.bfloat16 != torch.float32.
zamba2multitest_simple_batched_generate_with_padding_0_cudaoutput_mismatch7/7(line 600) AssertionError: Tensor-likes are not close!
zamba2singletest_simple_batched_generate_with_padding_0_cudaoutput_mismatch7/7(line 600) AssertionError: Tensor-likes are not close!

Unpinned failure modes

modecount
output_mismatch487
other153
OOM57
import_or_config22
load_error10
cuda_runtime2

Per-model breakdown (all failures)

modelfailuresgpumode mix
whisper42multi/singleother 22 output_mismatch 20
musicgen_melody16multi/singleoutput_mismatch 16
generation15multi/singleoutput_mismatch 8 other 3 import_or_config 2 cuda_runtime 2
gemma14multi/singleoutput_mismatch 14
dac12multi/singleoutput_mismatch 12
edgetam12multi/singleimport_or_config 12
glm46v12multi/singleother 12
glm_ocr12multi/singleoutput_mismatch 10 other 2
gemma310multi/singleoutput_mismatch 10
gemma3n10multi/singleoutput_mismatch 10
vision_encoder_decoder10multi/singleoutput_mismatch 6 other 4
cohere2_vision8multi/singleother 4 output_mismatch 2 OOM 2
emu38multi/singleOOM 5 import_or_config 2 output_mismatch 1
falcon_mamba8multi/singleoutput_mismatch 8
gemma48multi/singleoutput_mismatch 6 OOM 2
grounding_dino8multi/singleoutput_mismatch 8
higgs_audio_v28multi/singleoutput_mismatch 8
llava8multi/singleoutput_mismatch 4 OOM 4
mamba28multi/singleOOM 8
mllama8multi/singleoutput_mismatch 8
moshi8multi/singleoutput_mismatch 7 OOM 1
mpt8multi/singleother 8
qwen3_vl_moe8multi/singleother 6 output_mismatch 1 OOM 1
recurrent_gemma8multi/singleoutput_mismatch 8
voxtral8multi/singleoutput_mismatch 8
peft_integration8multi/singleother 8
olmo7multi/singleoutput_mismatch 4 OOM 2 other 1
bloom6multi/singleoutput_mismatch 6
bridgetower6multi/singleother 6
chameleon6multi/singleoutput_mismatch 6
deepseek_v26multi/singleOOM 6
exaone46multi/singleOOM 4 output_mismatch 2
fsmt6multi/singleoutput_mismatch 6
glm6multi/singleOOM 4 output_mismatch 2
kosmos26multi/singleoutput_mismatch 4 import_or_config 2
layoutlmv26multi/singleoutput_mismatch 6
mistral36multi/singleoutput_mismatch 6
mluke6multi/singleoutput_mismatch 6
mm_grounding_dino6multi/singleoutput_mismatch 6
owlvit6multi/singleoutput_mismatch 6
phimoe6multi/singleOOM 3 other 3
plbart6multi/singleoutput_mismatch 6
qwen3_omni_moe6multi/singleother 4 output_mismatch 2
seamless_m4t6multi/singleother 6
seamless_m4t_v26multi/singleother 6
starcoder26multi/singleoutput_mismatch 6
blip_25multi/singleoutput_mismatch 5
deepseek_vl_hybrid5multi/singleother 2 OOM 2 output_mismatch 1
deepseek_v44multi/singleload_error 4
audioflamingo34multi/singleother 4
bamba4multi/singleOOM 4
bitnet4multi/singleother 4
clvp4multi/singleoutput_mismatch 2 other 2
colqwen24multi/singleother 2 output_mismatch 2
cwm4multi/singleimport_or_config 2 OOM 1 output_mismatch 1
deepseek_vl4multi/singleother 3 output_mismatch 1
flava4multi/singleoutput_mismatch 4
gemma24multi/singleoutput_mismatch 2 other 2
internvl4multi/singleoutput_mismatch 4
jais24multi/singleload_error 4
janus4multi/singleother 4
lfm2_vl4multi/singleoutput_mismatch 4
llava_next_video4multi/singleoutput_mismatch 4
longt54multi/singleoutput_mismatch 4
luke4multi/singleoutput_mismatch 4
lw_detr4multi/singleoutput_mismatch 4
mimi4multi/singleoutput_mismatch 4
ministral4multi/singleoutput_mismatch 4
ministral34multi/singleoutput_mismatch 4
mistral4multi/singleoutput_mismatch 4
mistral44multi/singleother 4
mixtral4multi/singleoutput_mismatch 4
moonshine_streaming4multi/singleoutput_mismatch 4
musicgen4multi/singleoutput_mismatch 4
nemotron4multi/singleoutput_mismatch 2 import_or_config 2
oneformer4multi/singleoutput_mismatch 4
persimmon4multi/singleoutput_mismatch 3 other 1
pvt4multi/singleoutput_mismatch 4
pvt_v24multi/singleoutput_mismatch 2 other 2
qwen2_5_omni4multi/singleoutput_mismatch 4
qwen2_moe4multi/singleoutput_mismatch 2 other 2
qwen34multi/singleoutput_mismatch 4
qwen3_54multi/singleoutput_mismatch 4
qwen3_moe4multiload_error 4
rag4singleoutput_mismatch 4
seed_oss4multi/singleoutput_mismatch 4
smollm34multi/singleoutput_mismatch 4
stablelm4multi/singleoutput_mismatch 4
video_llava4multi/singleoutput_mismatch 4
videomae4multi/singleoutput_mismatch 4
zamba4multi/singleoutput_mismatch 4
utils4multi/singleoutput_mismatch 4
musicflamingo4multi/singleother 4
flex_olmo3multi/singleoutput_mismatch 2 other 1
pegasus3multi/singleother 3
aya_vision2multi/singleoutput_mismatch 2
big_bird2multi/singleoutput_mismatch 2
convnextv22multi/singleoutput_mismatch 2
cvt2multi/singleoutput_mismatch 2
dab_detr2multi/singleoutput_mismatch 2
dbrx2multi/singleother 2
deepseek_v32multi/singleoutput_mismatch 1 other 1
depth_anything2multi/singleoutput_mismatch 2
dia2multi/singleoutput_mismatch 2
diffllama2multi/singleoutput_mismatch 2
efficientnet2multi/singleoutput_mismatch 2
eomt_dinov32multi/singleoutput_mismatch 2
evolla2multi/singleoutput_mismatch 2
exaone4_52multiOOM 2
exaone_moe2multi/singleoutput_mismatch 2
falcon_h12multi/singleoutput_mismatch 2
fastspeech2_conformer2multi/singleoutput_mismatch 2
florence22multi/singleoutput_mismatch 2
fuyu2multi/singleoutput_mismatch 2
git2multi/singleother 2
glm4_moe2multi/singleOOM 2
glm4_moe_lite2multi/singleOOM 2
glm4v_moe2multiother 2
glm_image2multi/singleoutput_mismatch 2
got_ocr22multi/singleoutput_mismatch 2
granite2multi/singleoutput_mismatch 2
helium2multi/singleoutput_mismatch 2
hiera2multi/singleoutput_mismatch 2
hyperclovax2multi/singleother 2
instructblip2multi/singleoutput_mismatch 2
instructblipvideo2multi/singleoutput_mismatch 2
jamba2multi/singleoutput_mismatch 2
kosmos2_52multi/singleoutput_mismatch 2
lfm2_moe2multi/singleoutput_mismatch 2
llama2multi/singleoutput_mismatch 2
llava_next2multi/singleoutput_mismatch 2
m2m_1002multi/singleoutput_mismatch 2
minimax2multi/singleoutput_mismatch 2
modernvbert2multi/singleother 2
nllb_moe2multi/singleoutput_mismatch 2
olmo22multi/singleoutput_mismatch 2
olmo32multi/singleoutput_mismatch 2
olmoe2multi/singleoutput_mismatch 2
opt2multi/singleoutput_mismatch 2
ovis22multi/singleoutput_mismatch 2
phi32multi/singleother 2
pi02multi/singleOOM 2
pixio2multi/singleoutput_mismatch 2
qwen2_5_vl2multi/singleoutput_mismatch 2
reformer2multi/singleoutput_mismatch 2
regnet2multi/singleoutput_mismatch 2
resnet2multi/singleoutput_mismatch 2
superpoint2multi/singleother 2
swiftformer2multi/singleoutput_mismatch 2
swin2sr2multi/singleoutput_mismatch 2
swinv22multi/singleoutput_mismatch 2
t5gemma22multi/singleoutput_mismatch 2
table_transformer2multi/singleoutput_mismatch 2
univnet2multi/singleoutput_mismatch 2
vilt2multi/singleoutput_mismatch 2
vits2multi/singleoutput_mismatch 2
vivit2multi/singleoutput_mismatch 2
voxtral_realtime2multi/singleoutput_mismatch 2
zamba22multi/singleoutput_mismatch 2
blt2multiother 2
hunyuan_v1_moe1multiother 1

Pinned clusters (CI bisect)

(none)

Flaky (CI flagged)

modelgputestmodedays
deepseek_v4singletest_v4_flash_dequantized_chat_seven_promptsload_error6/7
deepseek_v4singletest_v4_flash_dequantized_generationload_error6/7

Unpinned — samples per mode

These failures persisted across the window but CI couldn't attribute a bad commit. They likely regressed before the 7-day bisect window. Showing the most-recently-seen samples per failure mode.

output_mismatch 487 unpinned failures — sample of 5

modelgputestdaystrace excerpt
zamba2singletest_simple_batched_generate_with_padding_0_cuda7/7(line 600) AssertionError: Tensor-likes are not close!
zamba2multitest_simple_batched_generate_with_padding_0_cuda7/7(line 600) AssertionError: Tensor-likes are not close!
zambasingletest_simple_batched_generate_with_padding7/7(line 476) AssertionError: '<s> [20 chars]g on this lovely evening? I hope you are having a great day. I' != '<s> [20 chars]g on this lovely evening? I hope you are all doing well. I am'
zambasingletest_simple_generate7/7(line 463) AssertionError: The values for attribute 'dtype' do not match: torch.bfloat16 != torch.float32.
zambamultitest_simple_batched_generate_with_padding7/7(line 476) AssertionError: '<s> [20 chars]g on this lovely evening? I hope you are having a great day. I' != '<s> [20 chars]g on this lovely evening? I hope you are all doing well. I am'

OOM 57 unpinned failures — sample of 5

modelgputestdaystrace excerpt
qwen3_vl_moemultitest_small_model_integration_test_expand7/7(line 991) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 768.00 MiB. GPU 1 has a total capacity of 22.30 GiB of which 240.69 MiB is free. Process 643790 has 22.06 GiB memory in use. Of the allocated mem…
pi0singletest_train_pi0_base_libero7/7(line 193) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 18.00 MiB. GPU 0 has a total capacity of 22.30 GiB of which 6.69 MiB is free. Process 777692 has 22.29 GiB memory in use. Of the allocated memory…
pi0multitest_train_pi0_base_libero7/7(line 785) torch.OutOfMemoryError: Caught OutOfMemoryError in replica 0 on device 0.
phimoesingletest_model_phimoe_instruct_logits6/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.56 GiB. GPU 0 has a total capacity of 22.30 GiB of which 814.69 MiB is free. Process 329492 has 21.50 GiB memory in use. Of the allocated memor…
phimoesingletest_phimoe_instruct_generation5/7(line 353) torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.56 GiB. GPU 0 has a total capacity of 22.30 GiB of which 812.69 MiB is free. Process 329492 has 21.50 GiB memory in use. Of the allocated memor…

load_error 10 unpinned failures — sample of 5

modelgputestdaystrace excerpt
qwen3_moemultitest_model_15b_a2b_generation7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_model_15b_a2b_logits7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_model_15b_a2b_long_prompt_sdpa7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
qwen3_moemultitest_speculative_generation7/7(line 74) ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modul…
jais2multitest_model_generation7/7(line 503) OSError: You are trying to access a gated repo.

cuda_runtime 2 unpinned failures — sample of 2

modelgputestdaystrace excerpt
generationsingletest_validate_assistant7/7(line 1909) torch.AcceleratorError: CUDA error: device-side assert triggered
generationmultitest_validate_assistant7/7(line 1909) torch.AcceleratorError: CUDA error: device-side assert triggered

import_or_config 22 unpinned failures — sample of 5

modelgputestdaystrace excerpt
nemotronsingletest_nemotron_8b_generation_fa27/7(line 1725) ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package for FlashAttention2 doesn't seem to be installed.
nemotronmultitest_nemotron_8b_generation_fa26/7(line 1725) ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package for FlashAttention2 doesn't seem to be installed.
kosmos2multitest_inference_interpolate_pos_encoding7/7(line 777) AttributeError: 'NoneType' object has no attribute 'last_hidden_state'
kosmos2singletest_inference_interpolate_pos_encoding7/7(line 777) AttributeError: 'NoneType' object has no attribute 'last_hidden_state'
generationsingletest_green_red_watermark_generation7/7(line 665) AttributeError: 'dict' object has no attribute 'validate'

other 153 unpinned failures — sample of 5

modelgputestdaystrace excerpt
whispersingletest_distil_token_timestamp_generation7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_batched_generation7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_generation7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_generation_multilingual7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same
whispersingletest_large_timestamp_generation7/7(line 370) RuntimeError: Input type (float) and bias type (c10::Half) should be the same