Search Results - PARALLEL COMPUTER ARCHITECTURE

Refine Results
  1. 1
  2. 2

    Multilayer perceptron architecture optimization using parallel computing techniques. by Wilson Castro, Jimy Oblitas, Roberto Santa-Cruz, Himer Avila-George

    Published in PLoS ONE (2017-01-01)
    “…Using MATLAB 2015a, a logical sequence was designed and implemented for constructing, training, and evaluating multilayer-perceptron-type neural networks using parallel computing techniques. The results show that HL and NHL have a statistically relevant effect on SSE, and from two hidden layers, AF also has a significant effect; thus, both AF and NHL can be evaluated to determine the optimal combination per group. …”
    Get full text
    Article
  3. 3
  4. 4

    Performance Analysis of Homogeneous On-Chip Large-Scale Parallel Computing Architectures for Data-Parallel Applications by Xiaowen Chen, Zhonghai Lu, Axel Jantsch, Shuming Chen, Yang Guo, Shenggang Chen, Hu Chen

    “…On-chip computing platforms are evolving from single-core bus-based systems to many-core network-based systems, which are referred to as On-chip Large-scale Parallel Computing Architectures (OLPCs) in the paper. …”
    Get full text
    Article
  5. 5
  6. 6
  7. 7

    An FPGA-Based Quantum Computing Emulation Framework Based on Serial-Parallel Architecture by Y. H. Lee, M. Khalil-Hani, M. N. Marsono

    “…Hardware emulation of quantum systems can mimic more efficiently the parallel behaviour of quantum computations, thus allowing higher processing speed-up than software simulations. …”
    Get full text
    Article
  8. 8
  9. 9

    Parallel Sort Implementation and Evaluation in a Dataflow-Based Polymorphic Computing Architecture by David Hentrich, Erdal Oruklu, Jafar Saniie

    Published in Computers (2025-05-01)
    “…Finally, a comparative study of the odd–even sort algorithm in several configurations of the polymorphic computing architecture is presented. The results show that scaling up the number of processing cores in the polymorphic architecture to the maximum amount of instantaneously exploitable parallelism improves the speed of the sort algorithms. …”
    Get full text
    Article
  10. 10
  11. 11

    A Cloud Computing Service Architecture of a Parallel Algorithm Oriented to Scientific Computing with CUDA and Monte Carlo by Yimu Ji, Zizhuo Kuang, Yu Pan Qiao, Yanpeng Sun, Jiangbang Kang, Wei Huang

    Published in Cybernetics and Information Technologies (2013-12-01)
    “…We propose one solution of the parallel computation in global optimization of the physical characteristics in organ electronics with Monte Carlo, and one cloud service architecture for parallel computation of organ electronics was designed. …”
    Get full text
    Article
  12. 12

    A Parallel Computing Method for the Computation of the Moore–Penrose Generalized Inverse for Shared-Memory Architectures by Elkin Gelvez-Almeida, Ricardo J. Barrientos, Karina Vilches-Ponce, Marco Mora

    Published in IEEE Access (2023-01-01)
    “…In this paper, we propose a parallel computing method for the computation of the Moore–Penrose generalized inverse of large-size full-rank rectangular matrices. …”
    Get full text
    Article
  13. 13

    Parallel Transitive Closure Algorithm for Heterogeneous Architecture by XIAO Han, GUO Baoyun, LI Cailin, ZHOU Qinglei

    Published in Jisuanji gongcheng (2021-08-01)
    “…The algorithm combines the characteristics of algorithm-intensive computation and OpenCL architecture, and uses the parallel submatrix multiplication and block matrix multiplication optimized by local memory for parallel computing. …”
    Get full text
    Article
  14. 14

    PMBA: A Parallel MCMC Bayesian Computing Accelerator by Yufei Ni, Yangdong Deng, Songlin Li

    Published in IEEE Access (2021-01-01)
    “…In this work, we propose a parallel MCMC Bayesian computing accelerator (PMBA) architecture. …”
    Get full text
    Article
  15. 15

    Multitask Learning-Based Pipeline-Parallel Computation Offloading Architecture for Deep Face Analysis by Faris S. Alghareb, Balqees Talal Hasan

    Published in Computers (2025-01-01)
    “…Therefore, this manuscript explores levels of parallelism techniques along with edge computation offloading to develop an innovative hardware platform that improves the efficacy of deep learning computing architectures. …”
    Get full text
    Article
  16. 16
  17. 17

    Architecture and key technologies of coalmine underground vision computing by Jian CHENG, Hao LI, Kun MA, Bin LIU, Dazhi SUN, Yongzhuang MA, Gang YIN, Guangfu WANG, Heping LI

    Published in Meitan kexue jishu (2023-09-01)
    “…To effectively develop this theory and make it better for intelligent development of coalmine, this paper first analyzed the similarities and differences about computer vision and visual computing in coalmine, and proposed its composition architecture. …”
    Get full text
    Article
  18. 18

    Implementation and Optimization of Parallel KNN Algorithm for Sunway Architecture by WANG Qihan, PANG Jianmin, YUE Feng, ZHU Di, SHEN Li, XIAO Qian

    Published in Jisuanji gongcheng (2023-05-01)
    “…Exploiting the structural characteristics of the new-generation Sunway heterogeneous many-core processors allows an efficient KNN algorithm to be achieved for massive data analysis and collation.In this study,based on the structural characteristics of the SW26010pro processor,the master-slave acceleration programming model is used to implement the basic version of the KNN parallel algorithm,which transfers the computing core to the slave core for thread-level parallelism.Subsequently,the key factors affecting the performance of the basic parallel algorithm are analyzed,and the SWKNN algorithm is proposed,which is different from the task-division method of the basic parallel KNN algorithm. …”
    Get full text
    Article
  19. 19

    Parallel Optimization Method of Unstructured-grid Computing in CFD for DomesticHeterogeneous Many-core Architecture by CHEN Xin, LI Fang, DING Hai-xin, SUN Wei-ze, LIU Xin, CHEN De-xun, YE Yue-jin, HE Xiang

    Published in Jisuanji kexue (2022-06-01)
    “…Sunway TaihuLight ranked first in the global supercomputer top 500 list 2016-2018 with a peak performance of 125.4 PFlops.Its computing power is mainly attributed to the domestic SW26010 many-core RISC processor.CFD unstructured-grid computing has always been a challenge for porting and optimizing in domestic many-core supercomputer,because of its complex topology,serious discrete memory access problems,and strongly correlated linear equation solution.In order to give fully play to the computing efficiency of domestic heterogeneous multi-core architecture,firstly,a data reconstruction model is proposed to improve the locality and parallelism of data,and the data structure is more suitable for the characteristics of multi-core architecture.Secondly,aiming at the discrete memory access problem caused by the disorder of unstructured-grid data storage,a discrete memory access optimization method based on prestorage of information relation is proposed,which transforms discrete memory access into continuous memory access.Finally,the pipeline parallelism mechanism in core array is introduced to realize many-core parallelism for solving linear equations with strong correlation.Experiments show that the overall performance of unstructured-grid computing in CFD is improved by more than 4 times,and is 1.2x faster than the general CPU.The computing cores scale to 624 000,and the parallelism efficiency is maintained at 64.5%.…”
    Get full text
    Article
  20. 20

    HybriD-GM: A Framework for Quantum Computing Simulation Targeted to Hybrid Parallel Architectures by Anderson Avila, Helida Santos, Anderson Cruz, Samuel Xavier-de-Souza, Giancarlo Lucca, Bruno Moura, Adenauer Yamin, Renata Reiser

    Published in Entropy (2023-03-01)
    “…The D-GM environment is also extended, providing efficient parallel executions for quantum computing simulations, targeted to hybrid architectures considering the CPU and GPU integration. …”
    Get full text
    Article

Search Tools: