CodePudding
  • Home
  • front end
  • Back-end
  • Net
  • Software design
  • Enterprise
  • Blockchain
  • Mobile
  • Software engineering
  • database
  • OS
  • other
 Tags > neon
  • 09-10Back-endhow to properly do multiply accumulate with NEON intrinsics
  • 08-24OSMemory copying: ARM STM vs. ARM NEON
  • 06-21MobileWhat kind of assembly instruction is this ld1 {v4.16b - v7.16b}, [x10]?
  • 04-28EnterpriseEfficiently calculate hamming weight
  • 04-19OSHow do I interpret the instruction `mov v2.2d[0],x14` in aarch64 assembly?
  • 03-22OSfast bit-matrix (64x64) transpose algorithm using SIMD (ARM)
  • 03-21Mobilefast bitwise 64x64 bit-matrix transpose algorithm using SIMD (ARM)
  • 12-16Enterpriseaarch64 xtn2 clearing lower half
  • 11-30Enterprisemm_shuffle_epi8 equivalent on ARM machines
  • 11-10Software designLoop takes more cycles to execute than expected in an ARM Cortex-A72 CPU
  • 11-02otherARM Neon intrinsics, addition of two vectors
  • 10-30otherAre there are ARM Neon instructions for round function?
  • 10-30BlockchainNEON assembly code requires more cycles on Cortex-A72 vs Cortex-A53
  •  Links:  
  • CodePudding

About Us:  Contact Us      Terms of Service       Privacy Policy

Copyright © 2010-2023,Powered By CodePudding