selected publications conference paper Leveraging Efficient Training and Feature Fusion in Transformers for Multimodal Classification 2023