{"id":823,"date":"2018-12-29T00:38:32","date_gmt":"2018-12-29T05:38:32","guid":{"rendered":"https:\/\/coefs.charlotte.edu\/htabkhiv\/?page_id=823"},"modified":"2026-03-12T17:04:54","modified_gmt":"2026-03-12T21:04:54","slug":"empowering-fpgas-for-cloud-computing","status":"publish","type":"page","link":"https:\/\/coefs.charlotte.edu\/htabkhiv\/research\/empowering-fpgas-for-cloud-computing\/","title":{"rendered":"Empowering FPGAs for Cloud Computing"},"content":{"rendered":"\n<p>This research is an intersection of three different areas: &nbsp;computer architecture, reconfigurable computing, and design automation. The aim is to empower the FPGAs to become a much more efficient platform for high-performance power efficient execution of many emerging massively parallel applications (from AI and computer vision to scientific and big data computing). This research suggests a shift from synthesizing the computation-path toward synthesizing the entire application-specific system architecture with kernel-specific memory prefetcher and runtime thread scheduler to fully utilize the benefits and potential of reconfigurable fabric. This study aims to create a systematic way to integrate the proposed architecture optimizations (runtime prefetching and thread scheduling) into the OpenCL High-Level Synthesis.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"328\" src=\"https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2019\/01\/toolchain-1024x328.jpg\" alt=\"flow chart\" class=\"wp-image-957\" srcset=\"https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2019\/01\/toolchain-1024x328.jpg 1024w, https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2019\/01\/toolchain-300x96.jpg 300w, https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2019\/01\/toolchain-768x246.jpg 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/div>\n\n\n<p><\/p>\n\n\n\n<p>The advent of Open Source Programming Models like OpenCL for reconfigurable platforms like FPGAs has attracted many application developers and system architects for efficient execution of compute-intensive massively parallel applications. Our work is focused on identifying the major performance bottlenecks when executing OpenCL kernels on FPGAs and proposing both architectural and design level optimizations to remove those bottlenecks. Our philosophy is to provide easy-to-use, sustainable generic solutions that are scalable in nature so as to integrate with the ongoing FPGA research in the industry as well as academia.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"767\" src=\"https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2018\/12\/fpga-1024x767.jpg\" alt=\"flow chart\" class=\"wp-image-811\" srcset=\"https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2018\/12\/fpga-1024x767.jpg 1024w, https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2018\/12\/fpga-300x225.jpg 300w, https:\/\/coefs.charlotte.edu\/htabkhiv\/files\/2018\/12\/fpga-768x575.jpg 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/div>\n\n\n<p><\/p>\n\n\n\n<p>Current projects underway include;<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>A novel <strong><em>LLVM based automation<\/em><\/strong> approach to enable memory decoupling in order to hide memory access latency thereby improving the overall throughput of massively parallel applications running on FPGAs. We use Intel Altera Stratix V FPGA device for this work.<\/li>\n\n\n\n<li><strong><em>Machine learning applications<\/em><\/strong> that are both compute and memory intensive are employed in a diverse category of toolchains supported by the cutting edge Amazon Web Services [AWS] F1 EC2 cloud instance with state-of-art Xilinx VU9P FPGAs. Xilinx and AWS provides support for developing and running such applications on high-performance custom hardware acceleration.<\/li>\n<\/ol>\n\n\n\n<p>Through the combination of such powerful computing resources and novel architecture for FPGAs, we foresee to achieve fruitful results in many domains such as machine learning training, inference and computer vision.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This research is an intersection of three different areas: &nbsp;computer architecture, reconfigurable computing, and design automation. The aim is to empower the FPGAs to become a much more efficient platform for high-performance power efficient execution of many emerging massively parallel &hellip; <a href=\"https:\/\/coefs.charlotte.edu\/htabkhiv\/research\/empowering-fpgas-for-cloud-computing\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":231,"featured_media":0,"parent":47,"menu_order":3,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-823","page","type-page","status-publish","hentry"],"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/pages\/823","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/users\/231"}],"replies":[{"embeddable":true,"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/comments?post=823"}],"version-history":[{"count":5,"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/pages\/823\/revisions"}],"predecessor-version":[{"id":1646,"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/pages\/823\/revisions\/1646"}],"up":[{"embeddable":true,"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/pages\/47"}],"wp:attachment":[{"href":"https:\/\/coefs.charlotte.edu\/htabkhiv\/wp-json\/wp\/v2\/media?parent=823"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}