RT Conference Proceedings T1 Efficient Parallel Video Encoding on Heterogeneous Systems A1 Momcilovic, Svetislav A1 Ilic, Aleksandar A1 Roma, Nuno A1 Sousa, Leonel A2 Carretero Pérez, Jesús A2 García Blas, Javier A2 Barbosa, Jorge A2 Morla, Ricardo A2 Universidad Carlos III de Madrid. Computer Architecture, Communications and Systems Group (ARCOS) AB In this study we propose an efficient method for collaborative H.264/AVC inter-loop encoding in heterogeneous CPU+GPU systems. This method relies on specifically developed extensive library of highly optimized parallel algorithms for both CPU and GPU architectures, and all inter-loop modules. In order to minimize the overall encoding time, this method integrates adaptive load balancing for the most computationally intensive, inter-prediction modules, which is based on dynamically built functional performance models of heterogenous devices and inter-loop modules. The proposed method also introduces efficient communication-aware techniques, which maximize data reusing, and decrease the overhead of expensive data transfers in collaborative video encoding. The experimental results show that the proposed method is able of achieving real-time video encoding for very demanding video coding parameters, i.e., full HD video format, 64×64 pixels search area and the exhaustive motion estimation. SN 978-84-617-2251-8 YR 2014 FD 2014-11 LK https://hdl.handle.net/10016/21976 UL https://hdl.handle.net/10016/21976 LA eng NO Proceedings of: First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014). Porto (Portugal), August 27-28, 2014. NO This work was supported by national funds through FCT – Fundação para a Ciência e a Tecnologia, under projects PEst-OE/EEI/LA0021/2013, PTDC/EEI-ELC/3152/2012 and PTDC/EEA-ELC/117329/2010. DS e-Archivo RD 17 jul. 2024