(1)
Singh, S. .; Sharma, A. .; Tiwari, S. . Empirical Benchmarking of Vision-Language Transformer Combinations for Visual Question Answering Tasks. DMP-LNCSE 2026, No. IMPACT26, 199-209.