Singh, S. , Sharma, A. and Tiwari, S. (2026) “Empirical Benchmarking of Vision-Language Transformer Combinations for Visual Question Answering Tasks”, DMPedia Lecture Notes in Computer Science & Engineering, (IMPACT26), pp. 199–209. Available at: https://digitalmanuscriptpedia.com/conferences/index.php/DMP-LNCSE/article/view/144 (Accessed: 29 March 2026).