この論文は以下の「出版者版」論文です。
書誌情報 : 電子情報通信学会技術報告,SC2018-32,2018年11月
DOI: https://ken.ieice.org/ken/paper/20181110B16w/
プレプリント / バージョン1

IoTアプリケーションのGPUオフロード時の並列処理部抽出とデータ転送回数低減手法

##article.authors##

  • 山登, 庸次 NTT株式会社 ネットワークサービスシステム研究所
  • 野口, 博史 NTT株式会社 ネットワークサービスシステム研究所
  • 片岡, 操 NTT株式会社 ネットワークサービスシステム研究所
  • 磯田, 卓万 NTT株式会社 ネットワークサービスシステム研究所
  • 出水, 達也 NTT株式会社 ネットワークサービスシステム研究所

DOI:

https://doi.org/10.51094/jxiv.3000

キーワード:

オープンIoT、 GPGPU、 データ転送最適化、 遺伝的アルゴリズム、 自動オフロード

抄録

私達は,オープンIoTに向け,ユーザが必要なデータを持つデバイスを動的に発見し,利用する,Tacit Computing 技術とその要素技術としてGPU自動オフロード技術を提案している.しかし,既存技術は,並列処理部の最適化を行うだけで性能改善できるアプリケーションは限られていた.そこで,本稿では,より多くのアプリケーションを改善するため,CPUとGPU間のデータ転送回数を低減する手法を提案した.私達は,大規模アプリケーションであるDarknetに対して適用し,CPUに比べて3倍の高速化を確認した.

山登庸次,電子情報通信学会技術報告,SC2018-32,2018年11月

利益相反に関する開示

著者は,この原稿と競合する利害関係がないことを宣言する.

ダウンロード *前日までの集計結果を表示します

ダウンロード実績データは、公開の翌日以降に作成されます。

引用文献

M. Hermann, T. Pentek and B. Otto, "Design Principles for Industrie 4.0 Scenarios," Working Draft, Rechnische Universitat Dortmund. 2015,

P. C. Evans and M. Annunziata, "Industrial Internet: Pushing the Boundaries of Minds and Machines," Technical report of General Electric (GE), Nov. 2012.

AWS IoT Platform website, https://aws.amazon.com/iot-platform/how-it-works/?nc1=h_ls

}

O. Sefraoui, M. Aissaoui and M. Eleuldj, "OpenStack: toward an open-source solution for cloud computing," International Journal of Computer Applications, Vol.55, 2012.

Y. Yamato, Y. Nishizawa, S. Nagao and K. Sato, "Fast and Reliable Restoration Method of Virtual Resources on OpenStack," IEEE Transactions on Cloud Computing, DOI: 10.1109/TCC.2015.2481392, Sep. 2015.

Y. Yamato, S. Katsuragi, S. Nagao and N. Miura, "Software Maintenance Evaluation of Agile Software Development Method Based on OpenStack," IEICE Transactions on Information & Systems, Vol.E98-D, pp.1377-1380, 2015.

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker and I. Stoica, "Spark: Cluster computing with working sets," Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, 2010.

N. Marz, "STORM: Distributed and fault-tolerant realtime computation," 2013.

J.Dean, and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," OSDI'04, pp.137-150, 2004.

Tron project web site, http://www.tron.org/

OpenCV web site, http://opencv.org/

AWS EC2 instance type web site, https://aws.amazon.com/ec2/instance-types/

A. Putnam, et al., "A reconfigurable fabric for accelerating large-scale datacenter services," Proceedings of the 41th Annual International Symposium on Computer Architecture (ISCA'14), pp.13-24, June 2014.

Y. Yamato, "Ubiquitous Service Composition Technology for Ubiquitous Network Environments," IPSJ Journal, Vol.48, No.2, pp.562-577, Feb. 2007.

Y. Yamato, Y. Tanaka and H. Sunaga, "Context-aware Ubiquitous Service Composition Technology," The IFIP International Conference on Research and Practical Issues of Enterprise Information Systems (CONFENIS 2006), pp.51-61, Apr. 2006.

Y. Yamato and H. Sunaga, "Context-Aware Service Composition and Component Change-over using Semantic Web Techniques," IEEE International Conference on Web Services (ICWS 2007), pp.687-694, July 2007.

Y. Yokohata, Y. Yamato, M. Takemoto, E. Tanaka and K. Nishiki, "Context-Aware Content-Provision Service for Shopping Malls Based on Ubiquitous Service-Oriented Network Framework and Authentication and Access Control Agent Framework," IEEE CCNC 2006, pp.1330-1331, 2006.

J. Sanders, E. Kandrot, "CUDA by example : an introduction to general-purpose GPU programming," Addison-Wesley, ISBN-0131387685, 2011

J. E. Stone, D. Gohara, and G. Shi, "OpenCL: A parallel programming standard for heterogeneous computing systems," Computing in science & engineering, Vol.12, No.3, pp.66-73, 2010.

Y. Yamato, T. Demizu, H. Noguchi and M. Kataoka, "Automatic GPU Offloading Technology for Open IoT Environment," IEEE Internet of Things Journal, 2018.

S. Wienke, P. Springer, C. Terboven and D. an Mey, "OpenACC-first experiences with real-world applications," Euro-Par 2012 Parallel Processing, pp.859-870, 2012.

M. Wolfe, "Implementing the PGI accelerator model," ACM the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, pp.43-50, Mar. 2010.

K. Ishizaki, "Transparent GPU exploitation for Java," The Fourth International Symposium on Computing and Networking (CANDAR 2016), Nov. 2016.

E. Su, X. Tian, M. Girkar, G. Haab, S. Shah and P. Petersen, "Compiler support of the workqueuing execution model for Intel SMP architectures," In Fourth European Workshop on OpenMP, Sep. 2002.

Himeno website, http://accc.riken.jp/en/supercom/himenobmt/

Y. Tanaka, M. Yoshimi. M. Miki and T. Hiroyasu, "Evaluation of Optimization Method for Fortran Codes with GPU Automatic Parallelization Compiler," IPSJ SIG Technical Report, 2011(9), pp.1-6, 2011.

Y. Tomatsu, T. Hiroyasu, M. Yoshimi and M. Miki, "gPot: Intelligent Compiler for GPGPU using Combinatorial Optimization Techniques," 7th Joint Symp. between Doshisha University and Chonnam National University, 2010.

Y. Yamato, "Optimum Application Deployment Technology for Heterogeneous IaaS Cloud," Journal of Information Processing, Vol.25, No.1, pp.56-58, Jan. 2017.

Y. Yamato, "OpenStack Hypervisor, Container and Baremetal Servers Performance Comparison," IEICE Communication Express, Vol.4, No.7, pp.228-232, July 2015.

Y. Yamato, "Performance-Aware Server Architecture Recommendation and Automatic Performance Verification Technology on IaaS Cloud," Service Oriented Computing and Applications, Springer, Nov. 2016.

Y. Yamato, "Server Selection, Configuration and Reconfiguration Technology for IaaS Cloud with Multiple Server Types," Journal of Network and Systems Management, Springer, DOI: 10.1007/s10922-017-9418-z, Aug. 2017.

Y. Yamato, M. Muroi, K. Tanaka and M. Uchimura, "Development of Template Management Technology for Easy Deployment of Virtual Resources on OpenStack," Journal of Cloud Computing, Springer, 2014, 3:7, DOI: 10.1186/s13677-014-0007-3, June 2014.

Y. Yamato, "Automatic verification technology of software patches for user virtual environments on IaaS cloud," Journal of Cloud Computing, Springer, 2015, 4:4, DOI: 10.1186/s13677-015-0028-6, Feb. 2015.

J. H. Holland, "Genetic algorithms," Scientific american, Vol.267, No.1, pp.66-73, 1992.

Clang website, http://llvm.org/

gcov website, http://gcc.gnu.org/onlinedocs/gcc/Gcov.html

gprof website, http://sourceware.org/binutils/docs-2.20/gprof/

laplace equation source website, https://github.com/parallel-forall/cudacasts/tree/master/ep3-first-openacc-program

J. Redmon and A. Angelova, "Real-time grasp detection using convolutional neural networks," IEEE International Conference on Robotics and Automation (ICRA), 2015.

G. Beylkin, G. Fann, R. J. Harrison, C. Kurcz and L. Monzon, "Multiresolution representation of operators with boundary conditions on simple domains," Elsevier AHCA, Vol.33, No.1, pp.109-139, 2012.

ダウンロード

公開済


投稿日時: 2026-02-08 22:41:07 UTC

公開日時: 2026-02-18 05:57:33 UTC
研究分野
情報科学