Dr Shreya Ghosh
Lecturer
School of Electrical Engineering and Computer Science

Journal Articles
Hasan, Md Rakibul, Hossain, Md Zakir, Ghosh, Shreya, Krishna, Aneesh and Gedeon, Tom (2025). Empathy detection from text, audiovisual, audio or physiological signals: a systematic review of task formulations and machine learning methods. IEEE Transactions on Affective Computing, 1-20. doi: 10.1109/taffc.2025.3590107
Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar, Knibbe, Jarrod and Ji, Qiang (2024). Automatic gaze analysis: a survey of deep learning based approaches. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (1), 61-84. doi: 10.1109/tpami.2023.3321337
Cai, Zhixi, Ghosh, Shreya, Dhall, Abhinav, Gedeon, Tom, Stefanov, Kalin and Hayat, Munawar (2023). Glitch in the matrix: A large scale benchmark for content driven audio-visual forgery detection and localization. Computer Vision and Image Understanding, 236 103818, 1-12. doi: 10.1016/j.cviu.2023.103818
Ghosh, Shreya, Dhall, Abhinav, Sebe, Nicu and Gedeon, Tom (2022). Automatic prediction of group cohesiveness in images. IEEE Transactions on Affective Computing, 13 (3), 1677-1690. doi: 10.1109/taffc.2020.3026095
Ghosh, Shreya and Anwar, Tarique (2021). Depression Intensity Estimation via Social Media: A Deep Learning Approach. IEEE Transactions On Computational Social Systems, 8 (6), 1465-1474. doi: 10.1109/tcss.2021.3084154
Li, Yi, Ghosh, Shreya and Joshi, Jyoti (2021). PLAAN: Pain Level Assessment with Anomaly-detection based Network. Journal On Multimodal User Interfaces, 15 (4), 359-372. doi: 10.1007/s12193-020-00362-8
Conference Papers
Kollias, Dimitrios, Zafeiriou, Stefanos, Kotsia, Irene, Dhall, Abhinav, Ghosh, Shreya, Shao, Chunchang and Hu, Guanyu (2025). 7th ABAW competition: multi-task learning and compound expression recognition. Computer Vision – ECCV 2024 Workshops, Milan, Italy, 29 September-4 October 2024. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-91581-9_3
Madan, S., Ghosh, S., Sookha, L. R., Ganaie, M. A., Subramanian, R., Dhall, A. and Gedeon, T. (2025). MIP-GAF: a MLLM-annotated benchmark for Most Important Person localization and group context understanding. 2025 Winter Conference on Applications of Computer Vision-WACV, Tucson, AZ USA, 28 February-4 March 2025. Los Alamitos, CA USA: IEEE Computer Society. doi: 10.1109/wacv61041.2025.00150
Tao, Jianhua, Ghosh, Shreya, Lian, Zheng, Cai, Zhixi, Schuller, Björn W., Dhall, Abhinav, Zhao, Guoying, Kollias, Dimitrios, Cambria, Erik, Goecke, Roland and Gedeon, Tom (2024). MRAC '24 Chairs' Welcome. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October - 1 November 2024. New York, NY United States: Association for Computing Machinery.
Ghosh, Shreya, Cai, Zhixi, Dhall, Abhinav, Kollias, Dimitrios, Goecke, Roland and Gedeon, Tom (2024). MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing. The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3689092.3690042
Cai, Zhixi, Ghosh, Shreya, Adatia, Aman Pankaj, Hayat, Munawar, Dhall, Abhinav, Gedeon, Tom and Stefanov, Kalin (2024). AV-Deepfake1M: a large-scale LLM-driven audio-visual deepfake dataset. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3664647.3680795
Cai, Zhixi, Dhall, Abhinav, Ghosh, Shreya, Hayat, Munawar, Kollias, Dimitrios, Stefanov, Kalin and Tariq, Usman (2024). 1M-Deepfakes Detection Challenge. The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery, Inc. doi: 10.1145/3664647.3689145
Ghosh, Shreya, Cai, Zhixi, Gupta, Parul, Sharma, Garima, Dhall, Abhinav, Hayat, Munawar and Gedeon, Tom (2024). Emolysis: a multimodal open-source group emotion analysis and visualization toolkit. 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos-ACIIW, Glasgow, Scotland, United Kingdom, 15 September 2024. Los Alamitos, CA USA: IEEE Computer Society. doi: 10.1109/aciiw63320.2024.00023
Shaiok, Lazib Sharar, Hoque, Ishtiaqul, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom and Hossain, Md Zakir (2024). Attention-Based Multi-layer Perceptron to Categorize Affective Videos from Viewer's Physiological Signals. 16th Asian Conference on Intelligent Information and Database Systems (ACIIDS), Ras Al Khaimah, United Arab Emirates, 15-18 April 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5934-7_3
Mondal, Chayan, Pham, Duc-Son, Gupta, Ashu, Ghosh, Shreya, Tan, Tele and Gedeon, Tom (2023). EfficienTransNet: an automated chest X-ray report generation paradigm. The 31st ACM International Conference on Multimedia, Ottawa, Canada, 29 October 2023. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3607865.3616174
Sharma, Garima, Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar, Cai, Jianfei and Gedeon, Tom (2023). GraphITTI: Attributed Graph-based Dominance Ranking in Social Interaction Videos. ICMI '23 Companion: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, 9 - 13 October 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3610661.3616184
Cai, Zhixi, Ghosh, Shreya, Stefanov, Kalin, Dhall, Abhinav, Cai, Jianfei, Rezatofighi, Hamid, Haffari, Reza and Hayat, Munawar (2023). MARLIN: Masked Autoencoder for facial video Representation LearnINg. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00150
Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar and Knibbe, Jarrod (2023). 'Labelling the gaps': a weakly supervised automatic eye gaze estimation. 16th Asian Conference on Computer Vision (ACCV), Macao, Peoples Republic of China, 4-8 December 2022. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-26316-3_44
Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar and Knibbe, Jarrod (2022). AV-GAZE: a study on the effectiveness of audio guided visual attention estimation for non-profilic faces. 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 October 2022. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icip46576.2022.9897360
Ghosh, Shreya, Hayat, Munawar, Dhall, Abhinav and Knibbe, Jarrod (2022). MTGLS: Multi-Task Gaze Estimation with Limited Supervision. 22nd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 4-8 January 2022. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/WACV51458.2022.00123
Ghosh, Shreya, Dhall, Abhinav, Sharma, Garima, Gupta, Sarthak and Sebe, Nicu (2021). Speak2Label: using domain knowledge for creating a large scale driver gaze zone estimation dataset. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, Canada, 11-17 October 2021. Los Alamitos, CA USA: IEEE Computer Society. doi: 10.1109/iccvw54120.2021.00324
Li, Yi, Ghosh, Shreya, Joshi, Jyoti and Oviatt, Sharon (2020). LSTM-DNN based Approach for Pain Intensity and Protective Behaviour Prediction. 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG), Buenos Aires, Argentina, 16-20 November 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/fg47880.2020.00061
Sharma, Garima, Ghosh, Shreya and Dhall, Abhinav (2019). Automatic group level affect and cohesion prediction in videos. 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW), Cambridge, United Kingdom, 3-6 September 2019. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/aciiw.2019.8925231
Dubey, Neeru, Ghosh, Shreya and Dhall, Abhinav (2019). Unsupervised learning of eye gaze representation from the web. International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 14-19 July 2019. New York, NY USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn.2019.8851961
Ghosh, Shreya, Dhall, Abhinav, Sebe, Nicu and Gedeon, Tom (2019). Predicting group cohesiveness in images. 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 14-19 July 2019. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn.2019.8852184
Ghosh, Shreya and Dhall, Abhinav (2019). Role of group level affect to find the most influential person in images. 15th European Conference on Computer Vision (ECCV), Munich, Germany, 8-14 September 2018. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-030-11012-3_39
Dhall, Abhinav, Goecke, Roland, Ghosh, Shreya and Gedeon, Tom (2019). EmotiW 2019: Automatic Emotion, Engagement and Cohesion Prediction Tasks. 21st ACM International Conference on Multimodal Interaction (ICMI), Suzhou, China, 14-18 October 2019. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3340555.3355710
Ghosh, Shreya, Dhall, Abhinav and Sebe, Nicu (2018). Automatic Group Affect Analysis in Images via Visual Attribute and Feature Networks. 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece, 7-10 October 2018. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icip.2018.8451242
Dhall, Abhinav, Goecke, Roland, Ghosh, Shreya, Joshi, Jyoti, Hoey, Jesse and Gedeon, Tom (2017). From individual to group-level emotion recognition: EmotiW 5.0. 19th International Conference on Multimodal Interaction-ICMI, Glasgow, Scotland, United Kingdom, 13-17 November 2017. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3136755.3143004