Abdelrahman Eldesokey is a Postdoctoral Fellow at KAUST, working on Generative AI and Computer Vision. His research focuses on diffusion models, multimodal large language models, and vision foundation models, aiming to make generative systems more controllable, interpretable, and reliable.

Biography

Abdelrahman Eldesokey is a Postdoctoral Fellow at King Abdullah University of Science and Technology (KAUST), specializing in Generative AI and Computer Vision. His research explores diffusion models, multimodal large language models, and vision foundation models, bridging perception and generation. He holds a Ph.D. in Computer Vision and Deep Learning from Linköping University, Sweden, and has over a decade of combined academic and industrial experience across Sweden, Egypt, and Saudi Arabia. His work has been published in leading venues including CVPR, ICCV, SIGGRAPH, NeurIPS, and ICLR, and focuses on advancing the controllability, interpretability, and reliability of modern generative systems.

Research Interests

My research focuses on Generative AI, particularly diffusion models, vision-language models, and agentic multimodal systems. I am interested in improving the controllability, interpretability, and reliability of generative models, bridging perception and generation. Additional interests include uncertainty-aware learning, 3D scene understanding, and AI evaluation for generative models in real-world settings.

Service Contributions

Service to the Discipline or Profession
  • Co-organized the Visual Object Tracking challenge (VOT) 2016, 2017, 2018, and 2019, 2016 - 2019

Education

Bachelor of Science (B.S.)
Computers and Systems Engineering, Mansoura University, Egypt, 2011
Master of Science (M.S.)
Communication and Information Technology, Nile University, Egypt, 2016
Doctor of Philosophy (Ph.D.)
Computer Vision and Deep Learning, Linköping University, Sweden, 2021