Full Code of alivanz/go-simd for AI

main f3b5f7d73797 cached
76 files
1.9 MB
634.7k tokens
7261 symbols
1 requests
Download .txt
Showing preview only (1,994K chars total). Download the full file or copy to clipboard to get everything.
Repository: alivanz/go-simd
Branch: main
Commit: f3b5f7d73797
Files: 76
Total size: 1.9 MB

Directory structure:
gitextract_6cse9yuu/

├── .gitignore
├── LICENSE
├── README.md
├── arm/
│   ├── generate.go
│   ├── neon/
│   │   ├── functions.c
│   │   ├── functions.go
│   │   ├── functions_bypass.go
│   │   ├── functions_cgo.go
│   │   ├── functions_test.go
│   │   ├── loops.c
│   │   ├── loops.go
│   │   └── loops_test.go
│   └── types.go
├── example/
│   ├── neon/
│   │   └── main.go
│   └── sse2/
│       └── main.go
├── generator/
│   ├── arm/
│   │   ├── arm.go
│   │   ├── main.go
│   │   └── sort.go
│   ├── scanner/
│   │   ├── scan.go
│   │   ├── scan_test.go
│   │   └── util.go
│   ├── types/
│   │   ├── function.go
│   │   └── type.go
│   ├── utils/
│   │   ├── download.go
│   │   ├── filter.go
│   │   └── slice.go
│   ├── writer/
│   │   ├── cgo.go
│   │   ├── function.go
│   │   ├── package.go
│   │   ├── package_test.go
│   │   ├── type.go
│   │   └── writer.go
│   └── x86/
│       ├── info.go
│       └── main.go
├── go.mod
├── go.sum
└── x86/
    ├── aes/
    │   ├── functions.c
    │   └── functions.go
    ├── avx/
    │   ├── functions.c
    │   └── functions.go
    ├── avx2/
    │   ├── functions.c
    │   └── functions.go
    ├── bmi/
    │   ├── functions.c
    │   └── functions.go
    ├── bmi2/
    │   ├── functions.c
    │   └── functions.go
    ├── crc32/
    │   ├── functions.c
    │   └── functions.go
    ├── f16c/
    │   ├── functions.c
    │   └── functions.go
    ├── fma/
    │   ├── functions.c
    │   └── functions.go
    ├── fsgsbase/
    │   ├── functions.c
    │   └── functions.go
    ├── generate.go
    ├── lzcnt/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_sse/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_sse2/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_ssse3/
    │   ├── functions.c
    │   └── functions.go
    ├── popcnt/
    │   ├── functions.c
    │   └── functions.go
    ├── sse/
    │   ├── functions.c
    │   └── functions.go
    ├── sse2/
    │   ├── functions.c
    │   └── functions.go
    ├── sse3/
    │   ├── functions.c
    │   └── functions.go
    ├── ssse3/
    │   ├── functions.c
    │   └── functions.go
    └── types.go

================================================
FILE CONTENTS
================================================

================================================
FILE: .gitignore
================================================
.vscode
raw.h
intrinsics.json
data.xml

================================================
FILE: LICENSE
================================================
MIT License

Copyright (c) 2023 Alivan Akbar

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.


================================================
FILE: README.md
================================================
# SIMD Implementation in Golang

This repository contains an implementation of SIMD (Single Instruction, Multiple Data) operations in Go, specifically targeting ARM NEON architecture. The goal is to provide optimized parallel processing capabilities for certain computational tasks.

## Future Plans

We are actively working on expanding the SIMD implementation to support x86 architecture as well. The upcoming x86 implementation will provide similar SIMD functionalities for parallel processing on x86-based systems.

## Hacks

When we call a C function through CGO, there are some overheads due to Go design.
In general, avoiding CGO would be a good idea.
But I found a hack, instead of relying on CGO, we can utilize `linkname` directive to call C code, bypass CGO, and get better performance.

```
goos: darwin
goarch: arm64
pkg: github.com/alivanz/go-simd/arm/neon
BenchmarkMultRef-8                131395              9168 ns/op
BenchmarkMultSimd-8               598742              1954 ns/op
BenchmarkMultSimdBypass-8         605554              1959 ns/op
BenchmarkMultSimdFull-8          1816879               661.3 ns/op
BenchmarkMultSimdCgo-8             13020             92213 ns/op
PASS
```

```
goos: darwin
goarch: arm64
pkg: github.com/alivanz/go-simd/arm/neon
cpu: Apple M2
BenchmarkVmulqF32N-8                8848            124616 ns/op        33657.86 MB/s       1422 B/op          0 allocs/op
BenchmarkVmulqF32C-8                2256            528683 ns/op        7933.49 MB/s        5577 B/op          0 allocs/op
BenchmarkVmulqF32Ref-8              3630            327995 ns/op        12787.69 MB/s       3466 B/op          0 allocs/op
PASS
ok      github.com/alivanz/go-simd/arm/neon     5.793s
```

The floating-point multiplication benchmarks demonstrate significant performance differences between implementations:

- `VmulqF32N` (Native): Achieves the highest throughput at 33.6 GB/s with minimal memory allocation (1422 B/op). This implementation leverages direct SIMD instructions for optimal performance.
- `VmulqF32C` (C): Shows the lowest performance at 7.9 GB/s with higher memory allocation (5577 B/op), likely due to the overhead of CGO calls and memory management.
- `VmulqF32Ref` (Reference): Performs at 12.8 GB/s with moderate memory usage (3466 B/op), serving as a baseline for comparison.

These results highlight the importance of using native SIMD implementations over CGO-based solutions for performance-critical applications. The native implementation is approximately 2.6x faster than the reference implementation, while the C implementation is about 1.6x slower than the reference.

## Features

- SIMD operations for ARM NEON architecture.
- High-performance parallel processing for specific tasks.
- Utilizes the power of SIMD instructions to process multiple data elements simultaneously.
- Supports a range of data types, including integers and floating-point numbers.
- Modular design for easy integration into existing projects.
- Well-documented code for understanding and extending the implementation.

## Roadmap

- [x] Implement SIMD operations for ARM NEON architecture.
- [ ] Add support for x86 architecture.
- [ ] Expand SIMD operations for additional data types.
- [ ] Optimize performance for specific use cases.
- [ ] Develop comprehensive test suite for validation.

## Usage

To use the SIMD implementations in your project, follow these steps:

1. Import the required package in your Go code:

```go
import "github.com/alivanz/go-simd"
```

2. Use the SIMD functions in your code as needed. Example:

```go
package main

import (
	"log"

	"github.com/alivanz/go-simd/arm"
	"github.com/alivanz/go-simd/arm/neon"
)

func main() {
	var a, b arm.Int8X8
	var add, mul arm.Int16X8
	for i := 0; i < 8; i++ {
		a[i] = arm.Int8(i)
		b[i] = arm.Int8(i * i)
	}
	log.Printf("a = %+v", a)
	log.Printf("b = %+v", b)
	neon.VaddlS8(&add, &a, &b)
	neon.VmullS8(&mul, &a, &b)
	log.Printf("add = %+v", add)
	log.Printf("mul = %+v", mul)
}

```

## Supported Operations

Only ARM Neon supported, for now.

Refer to the documentation in each respective file for more details on how to use each operation.

## Contributing

Contributions to this project are welcome. To contribute, please follow these steps:

1. Fork the repository.
2. Create a new branch for your feature or bug fix.
3. Make your changes and commit them with descriptive messages.
4. Push your changes to your forked repository.
5. Submit a pull request to the main repository.

Please ensure that your code follows the existing code style and includes appropriate tests.

## Acknowledgments

- The ARM NEON architecture documentation for providing valuable insights into SIMD programming techniques.
- The open-source community for their contributions and inspiration.

## Contact

For any questions or feedback regarding this repository, please feel free to contact me at [alivan1627@gmail.com](mailto:alivan1627@gmail.com)

================================================
FILE: arm/generate.go
================================================
package arm

//go:generate go run ../generator/arm


================================================
FILE: arm/neon/functions.c
================================================
#include <arm_neon.h>

void VabaS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vaba_s8(*v0, *v1, *v2); }
void VabaS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vaba_s16(*v0, *v1, *v2); }
void VabaS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vaba_s32(*v0, *v1, *v2); }
void VabaU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vaba_u8(*v0, *v1, *v2); }
void VabaU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vaba_u16(*v0, *v1, *v2); }
void VabaU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vaba_u32(*v0, *v1, *v2); }
void VabalS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vabal_s8(*v0, *v1, *v2); }
void VabalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vabal_s16(*v0, *v1, *v2); }
void VabalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vabal_s32(*v0, *v1, *v2); }
void VabalU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vabal_u8(*v0, *v1, *v2); }
void VabalU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vabal_u16(*v0, *v1, *v2); }
void VabalU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vabal_u32(*v0, *v1, *v2); }
void VabalHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vabal_high_s8(*v0, *v1, *v2); }
void VabalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vabal_high_s16(*v0, *v1, *v2); }
void VabalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vabal_high_s32(*v0, *v1, *v2); }
void VabalHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vabal_high_u8(*v0, *v1, *v2); }
void VabalHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vabal_high_u16(*v0, *v1, *v2); }
void VabalHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vabal_high_u32(*v0, *v1, *v2); }
void VabaqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vabaq_s8(*v0, *v1, *v2); }
void VabaqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vabaq_s16(*v0, *v1, *v2); }
void VabaqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vabaq_s32(*v0, *v1, *v2); }
void VabaqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vabaq_u8(*v0, *v1, *v2); }
void VabaqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vabaq_u16(*v0, *v1, *v2); }
void VabaqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vabaq_u32(*v0, *v1, *v2); }
void VabdS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vabd_s8(*v0, *v1); }
void VabdS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vabd_s16(*v0, *v1); }
void VabdS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vabd_s32(*v0, *v1); }
void VabdU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vabd_u8(*v0, *v1); }
void VabdU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vabd_u16(*v0, *v1); }
void VabdU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vabd_u32(*v0, *v1); }
void VabdF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vabd_f32(*v0, *v1); }
void VabdF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vabd_f64(*v0, *v1); }
void VabddF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vabdd_f64(*v0, *v1); }
void VabdlS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vabdl_s8(*v0, *v1); }
void VabdlS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vabdl_s16(*v0, *v1); }
void VabdlS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vabdl_s32(*v0, *v1); }
void VabdlU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vabdl_u8(*v0, *v1); }
void VabdlU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vabdl_u16(*v0, *v1); }
void VabdlU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vabdl_u32(*v0, *v1); }
void VabdlHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vabdl_high_s8(*v0, *v1); }
void VabdlHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vabdl_high_s16(*v0, *v1); }
void VabdlHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vabdl_high_s32(*v0, *v1); }
void VabdlHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vabdl_high_u8(*v0, *v1); }
void VabdlHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vabdl_high_u16(*v0, *v1); }
void VabdlHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vabdl_high_u32(*v0, *v1); }
void VabdqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vabdq_s8(*v0, *v1); }
void VabdqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vabdq_s16(*v0, *v1); }
void VabdqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vabdq_s32(*v0, *v1); }
void VabdqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vabdq_u8(*v0, *v1); }
void VabdqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vabdq_u16(*v0, *v1); }
void VabdqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vabdq_u32(*v0, *v1); }
void VabdqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vabdq_f32(*v0, *v1); }
void VabdqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vabdq_f64(*v0, *v1); }
void VabdsF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vabds_f32(*v0, *v1); }
void VabsS8(int8x8_t* r, int8x8_t* v0) { *r = vabs_s8(*v0); }
void VabsS16(int16x4_t* r, int16x4_t* v0) { *r = vabs_s16(*v0); }
void VabsS32(int32x2_t* r, int32x2_t* v0) { *r = vabs_s32(*v0); }
void VabsS64(int64x1_t* r, int64x1_t* v0) { *r = vabs_s64(*v0); }
void VabsF32(float32x2_t* r, float32x2_t* v0) { *r = vabs_f32(*v0); }
void VabsF64(float64x1_t* r, float64x1_t* v0) { *r = vabs_f64(*v0); }
void VabsdS64(int64_t* r, int64_t* v0) { *r = vabsd_s64(*v0); }
void VabsqS8(int8x16_t* r, int8x16_t* v0) { *r = vabsq_s8(*v0); }
void VabsqS16(int16x8_t* r, int16x8_t* v0) { *r = vabsq_s16(*v0); }
void VabsqS32(int32x4_t* r, int32x4_t* v0) { *r = vabsq_s32(*v0); }
void VabsqS64(int64x2_t* r, int64x2_t* v0) { *r = vabsq_s64(*v0); }
void VabsqF32(float32x4_t* r, float32x4_t* v0) { *r = vabsq_f32(*v0); }
void VabsqF64(float64x2_t* r, float64x2_t* v0) { *r = vabsq_f64(*v0); }
void VaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vadd_s8(*v0, *v1); }
void VaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vadd_s16(*v0, *v1); }
void VaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vadd_s32(*v0, *v1); }
void VaddS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vadd_s64(*v0, *v1); }
void VaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vadd_u8(*v0, *v1); }
void VaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vadd_u16(*v0, *v1); }
void VaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vadd_u32(*v0, *v1); }
void VaddU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vadd_u64(*v0, *v1); }
void VaddF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vadd_f32(*v0, *v1); }
void VaddF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vadd_f64(*v0, *v1); }
void VaddP16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vadd_p16(*v0, *v1); }
void VaddP64(poly64x1_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vadd_p64(*v0, *v1); }
void VaddP8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vadd_p8(*v0, *v1); }
void VadddS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vaddd_s64(*v0, *v1); }
void VadddU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vaddd_u64(*v0, *v1); }
void VaddhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vaddhn_s16(*v0, *v1); }
void VaddhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vaddhn_s32(*v0, *v1); }
void VaddhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vaddhn_s64(*v0, *v1); }
void VaddhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vaddhn_u16(*v0, *v1); }
void VaddhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vaddhn_u32(*v0, *v1); }
void VaddhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vaddhn_u64(*v0, *v1); }
void VaddhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vaddhn_high_s16(*v0, *v1, *v2); }
void VaddhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vaddhn_high_s32(*v0, *v1, *v2); }
void VaddhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vaddhn_high_s64(*v0, *v1, *v2); }
void VaddhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vaddhn_high_u16(*v0, *v1, *v2); }
void VaddhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vaddhn_high_u32(*v0, *v1, *v2); }
void VaddhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vaddhn_high_u64(*v0, *v1, *v2); }
void VaddlS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vaddl_s8(*v0, *v1); }
void VaddlS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vaddl_s16(*v0, *v1); }
void VaddlS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vaddl_s32(*v0, *v1); }
void VaddlU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vaddl_u8(*v0, *v1); }
void VaddlU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vaddl_u16(*v0, *v1); }
void VaddlU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vaddl_u32(*v0, *v1); }
void VaddlHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vaddl_high_s8(*v0, *v1); }
void VaddlHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vaddl_high_s16(*v0, *v1); }
void VaddlHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vaddl_high_s32(*v0, *v1); }
void VaddlHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaddl_high_u8(*v0, *v1); }
void VaddlHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vaddl_high_u16(*v0, *v1); }
void VaddlHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vaddl_high_u32(*v0, *v1); }
void VaddlvS8(int16_t* r, int8x8_t* v0) { *r = vaddlv_s8(*v0); }
void VaddlvS16(int32_t* r, int16x4_t* v0) { *r = vaddlv_s16(*v0); }
void VaddlvS32(int64_t* r, int32x2_t* v0) { *r = vaddlv_s32(*v0); }
void VaddlvU8(uint16_t* r, uint8x8_t* v0) { *r = vaddlv_u8(*v0); }
void VaddlvU16(uint32_t* r, uint16x4_t* v0) { *r = vaddlv_u16(*v0); }
void VaddlvU32(uint64_t* r, uint32x2_t* v0) { *r = vaddlv_u32(*v0); }
void VaddlvqS8(int16_t* r, int8x16_t* v0) { *r = vaddlvq_s8(*v0); }
void VaddlvqS16(int32_t* r, int16x8_t* v0) { *r = vaddlvq_s16(*v0); }
void VaddlvqS32(int64_t* r, int32x4_t* v0) { *r = vaddlvq_s32(*v0); }
void VaddlvqU8(uint16_t* r, uint8x16_t* v0) { *r = vaddlvq_u8(*v0); }
void VaddlvqU16(uint32_t* r, uint16x8_t* v0) { *r = vaddlvq_u16(*v0); }
void VaddlvqU32(uint64_t* r, uint32x4_t* v0) { *r = vaddlvq_u32(*v0); }
void VaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vaddq_s8(*v0, *v1); }
void VaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vaddq_s16(*v0, *v1); }
void VaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vaddq_s32(*v0, *v1); }
void VaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vaddq_s64(*v0, *v1); }
void VaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaddq_u8(*v0, *v1); }
void VaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vaddq_u16(*v0, *v1); }
void VaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vaddq_u32(*v0, *v1); }
void VaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vaddq_u64(*v0, *v1); }
void VaddqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vaddq_f32(*v0, *v1); }
void VaddqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vaddq_f64(*v0, *v1); }
void VaddqP128(poly128_t* r, poly128_t* v0, poly128_t* v1) { *r = vaddq_p128(*v0, *v1); }
void VaddqP16(poly16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vaddq_p16(*v0, *v1); }
void VaddqP64(poly64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vaddq_p64(*v0, *v1); }
void VaddqP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vaddq_p8(*v0, *v1); }
void VaddvS8(int8_t* r, int8x8_t* v0) { *r = vaddv_s8(*v0); }
void VaddvS16(int16_t* r, int16x4_t* v0) { *r = vaddv_s16(*v0); }
void VaddvS32(int32_t* r, int32x2_t* v0) { *r = vaddv_s32(*v0); }
void VaddvU8(uint8_t* r, uint8x8_t* v0) { *r = vaddv_u8(*v0); }
void VaddvU16(uint16_t* r, uint16x4_t* v0) { *r = vaddv_u16(*v0); }
void VaddvU32(uint32_t* r, uint32x2_t* v0) { *r = vaddv_u32(*v0); }
void VaddvF32(float32_t* r, float32x2_t* v0) { *r = vaddv_f32(*v0); }
void VaddvqS8(int8_t* r, int8x16_t* v0) { *r = vaddvq_s8(*v0); }
void VaddvqS16(int16_t* r, int16x8_t* v0) { *r = vaddvq_s16(*v0); }
void VaddvqS32(int32_t* r, int32x4_t* v0) { *r = vaddvq_s32(*v0); }
void VaddvqS64(int64_t* r, int64x2_t* v0) { *r = vaddvq_s64(*v0); }
void VaddvqU8(uint8_t* r, uint8x16_t* v0) { *r = vaddvq_u8(*v0); }
void VaddvqU16(uint16_t* r, uint16x8_t* v0) { *r = vaddvq_u16(*v0); }
void VaddvqU32(uint32_t* r, uint32x4_t* v0) { *r = vaddvq_u32(*v0); }
void VaddvqU64(uint64_t* r, uint64x2_t* v0) { *r = vaddvq_u64(*v0); }
void VaddvqF32(float32_t* r, float32x4_t* v0) { *r = vaddvq_f32(*v0); }
void VaddvqF64(float64_t* r, float64x2_t* v0) { *r = vaddvq_f64(*v0); }
void VaddwS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1) { *r = vaddw_s8(*v0, *v1); }
void VaddwS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1) { *r = vaddw_s16(*v0, *v1); }
void VaddwS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1) { *r = vaddw_s32(*v0, *v1); }
void VaddwU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1) { *r = vaddw_u8(*v0, *v1); }
void VaddwU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1) { *r = vaddw_u16(*v0, *v1); }
void VaddwU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1) { *r = vaddw_u32(*v0, *v1); }
void VaddwHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1) { *r = vaddw_high_s8(*v0, *v1); }
void VaddwHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1) { *r = vaddw_high_s16(*v0, *v1); }
void VaddwHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1) { *r = vaddw_high_s32(*v0, *v1); }
void VaddwHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1) { *r = vaddw_high_u8(*v0, *v1); }
void VaddwHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1) { *r = vaddw_high_u16(*v0, *v1); }
void VaddwHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1) { *r = vaddw_high_u32(*v0, *v1); }
void VaesdqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaesdq_u8(*v0, *v1); }
void VaeseqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaeseq_u8(*v0, *v1); }
void VaesimcqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vaesimcq_u8(*v0); }
void VaesmcqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vaesmcq_u8(*v0); }
void VandS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vand_s8(*v0, *v1); }
void VandS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vand_s16(*v0, *v1); }
void VandS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vand_s32(*v0, *v1); }
void VandS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vand_s64(*v0, *v1); }
void VandU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vand_u8(*v0, *v1); }
void VandU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vand_u16(*v0, *v1); }
void VandU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vand_u32(*v0, *v1); }
void VandU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vand_u64(*v0, *v1); }
void VandqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vandq_s8(*v0, *v1); }
void VandqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vandq_s16(*v0, *v1); }
void VandqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vandq_s32(*v0, *v1); }
void VandqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vandq_s64(*v0, *v1); }
void VandqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vandq_u8(*v0, *v1); }
void VandqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vandq_u16(*v0, *v1); }
void VandqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vandq_u32(*v0, *v1); }
void VandqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vandq_u64(*v0, *v1); }
void VbcaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vbcaxq_s8(*v0, *v1, *v2); }
void VbcaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vbcaxq_s16(*v0, *v1, *v2); }
void VbcaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vbcaxq_s32(*v0, *v1, *v2); }
void VbcaxqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vbcaxq_s64(*v0, *v1, *v2); }
void VbcaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vbcaxq_u8(*v0, *v1, *v2); }
void VbcaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vbcaxq_u16(*v0, *v1, *v2); }
void VbcaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vbcaxq_u32(*v0, *v1, *v2); }
void VbcaxqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vbcaxq_u64(*v0, *v1, *v2); }
void VbicS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vbic_s8(*v0, *v1); }
void VbicS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vbic_s16(*v0, *v1); }
void VbicS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vbic_s32(*v0, *v1); }
void VbicS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vbic_s64(*v0, *v1); }
void VbicU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vbic_u8(*v0, *v1); }
void VbicU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vbic_u16(*v0, *v1); }
void VbicU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vbic_u32(*v0, *v1); }
void VbicU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vbic_u64(*v0, *v1); }
void VbicqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vbicq_s8(*v0, *v1); }
void VbicqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vbicq_s16(*v0, *v1); }
void VbicqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vbicq_s32(*v0, *v1); }
void VbicqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vbicq_s64(*v0, *v1); }
void VbicqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vbicq_u8(*v0, *v1); }
void VbicqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vbicq_u16(*v0, *v1); }
void VbicqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vbicq_u32(*v0, *v1); }
void VbicqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vbicq_u64(*v0, *v1); }
void VbslS8(int8x8_t* r, uint8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vbsl_s8(*v0, *v1, *v2); }
void VbslS16(int16x4_t* r, uint16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vbsl_s16(*v0, *v1, *v2); }
void VbslS32(int32x2_t* r, uint32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vbsl_s32(*v0, *v1, *v2); }
void VbslS64(int64x1_t* r, uint64x1_t* v0, int64x1_t* v1, int64x1_t* v2) { *r = vbsl_s64(*v0, *v1, *v2); }
void VbslU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vbsl_u8(*v0, *v1, *v2); }
void VbslU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vbsl_u16(*v0, *v1, *v2); }
void VbslU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vbsl_u32(*v0, *v1, *v2); }
void VbslU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1, uint64x1_t* v2) { *r = vbsl_u64(*v0, *v1, *v2); }
void VbslF32(float32x2_t* r, uint32x2_t* v0, float32x2_t* v1, float32x2_t* v2) { *r = vbsl_f32(*v0, *v1, *v2); }
void VbslF64(float64x1_t* r, uint64x1_t* v0, float64x1_t* v1, float64x1_t* v2) { *r = vbsl_f64(*v0, *v1, *v2); }
void VbslP16(poly16x4_t* r, uint16x4_t* v0, poly16x4_t* v1, poly16x4_t* v2) { *r = vbsl_p16(*v0, *v1, *v2); }
void VbslP64(poly64x1_t* r, uint64x1_t* v0, poly64x1_t* v1, poly64x1_t* v2) { *r = vbsl_p64(*v0, *v1, *v2); }
void VbslP8(poly8x8_t* r, uint8x8_t* v0, poly8x8_t* v1, poly8x8_t* v2) { *r = vbsl_p8(*v0, *v1, *v2); }
void VbslqS8(int8x16_t* r, uint8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vbslq_s8(*v0, *v1, *v2); }
void VbslqS16(int16x8_t* r, uint16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vbslq_s16(*v0, *v1, *v2); }
void VbslqS32(int32x4_t* r, uint32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vbslq_s32(*v0, *v1, *v2); }
void VbslqS64(int64x2_t* r, uint64x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vbslq_s64(*v0, *v1, *v2); }
void VbslqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vbslq_u8(*v0, *v1, *v2); }
void VbslqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vbslq_u16(*v0, *v1, *v2); }
void VbslqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vbslq_u32(*v0, *v1, *v2); }
void VbslqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vbslq_u64(*v0, *v1, *v2); }
void VbslqF32(float32x4_t* r, uint32x4_t* v0, float32x4_t* v1, float32x4_t* v2) { *r = vbslq_f32(*v0, *v1, *v2); }
void VbslqF64(float64x2_t* r, uint64x2_t* v0, float64x2_t* v1, float64x2_t* v2) { *r = vbslq_f64(*v0, *v1, *v2); }
void VbslqP16(poly16x8_t* r, uint16x8_t* v0, poly16x8_t* v1, poly16x8_t* v2) { *r = vbslq_p16(*v0, *v1, *v2); }
void VbslqP64(poly64x2_t* r, uint64x2_t* v0, poly64x2_t* v1, poly64x2_t* v2) { *r = vbslq_p64(*v0, *v1, *v2); }
void VbslqP8(poly8x16_t* r, uint8x16_t* v0, poly8x16_t* v1, poly8x16_t* v2) { *r = vbslq_p8(*v0, *v1, *v2); }
void VcaddRot270F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcadd_rot270_f32(*v0, *v1); }
void VcaddRot90F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcadd_rot90_f32(*v0, *v1); }
void VcaddqRot270F32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcaddq_rot270_f32(*v0, *v1); }
void VcaddqRot270F64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcaddq_rot270_f64(*v0, *v1); }
void VcaddqRot90F32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcaddq_rot90_f32(*v0, *v1); }
void VcaddqRot90F64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcaddq_rot90_f64(*v0, *v1); }
void VcageF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcage_f32(*v0, *v1); }
void VcageF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcage_f64(*v0, *v1); }
void VcagedF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaged_f64(*v0, *v1); }
void VcageqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcageq_f32(*v0, *v1); }
void VcageqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcageq_f64(*v0, *v1); }
void VcagesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcages_f32(*v0, *v1); }
void VcagtF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcagt_f32(*v0, *v1); }
void VcagtF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcagt_f64(*v0, *v1); }
void VcagtdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcagtd_f64(*v0, *v1); }
void VcagtqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcagtq_f32(*v0, *v1); }
void VcagtqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcagtq_f64(*v0, *v1); }
void VcagtsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcagts_f32(*v0, *v1); }
void VcaleF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcale_f32(*v0, *v1); }
void VcaleF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcale_f64(*v0, *v1); }
void VcaledF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaled_f64(*v0, *v1); }
void VcaleqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcaleq_f32(*v0, *v1); }
void VcaleqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcaleq_f64(*v0, *v1); }
void VcalesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcales_f32(*v0, *v1); }
void VcaltF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcalt_f32(*v0, *v1); }
void VcaltF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcalt_f64(*v0, *v1); }
void VcaltdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaltd_f64(*v0, *v1); }
void VcaltqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcaltq_f32(*v0, *v1); }
void VcaltqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcaltq_f64(*v0, *v1); }
void VcaltsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcalts_f32(*v0, *v1); }
void VceqS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vceq_s8(*v0, *v1); }
void VceqS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vceq_s16(*v0, *v1); }
void VceqS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vceq_s32(*v0, *v1); }
void VceqS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vceq_s64(*v0, *v1); }
void VceqU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vceq_u8(*v0, *v1); }
void VceqU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vceq_u16(*v0, *v1); }
void VceqU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vceq_u32(*v0, *v1); }
void VceqU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vceq_u64(*v0, *v1); }
void VceqF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vceq_f32(*v0, *v1); }
void VceqF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vceq_f64(*v0, *v1); }
void VceqP64(uint64x1_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vceq_p64(*v0, *v1); }
void VceqP8(uint8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vceq_p8(*v0, *v1); }
void VceqdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vceqd_s64(*v0, *v1); }
void VceqdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vceqd_u64(*v0, *v1); }
void VceqdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vceqd_f64(*v0, *v1); }
void VceqqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vceqq_s8(*v0, *v1); }
void VceqqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vceqq_s16(*v0, *v1); }
void VceqqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vceqq_s32(*v0, *v1); }
void VceqqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vceqq_s64(*v0, *v1); }
void VceqqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vceqq_u8(*v0, *v1); }
void VceqqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vceqq_u16(*v0, *v1); }
void VceqqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vceqq_u32(*v0, *v1); }
void VceqqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vceqq_u64(*v0, *v1); }
void VceqqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vceqq_f32(*v0, *v1); }
void VceqqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vceqq_f64(*v0, *v1); }
void VceqqP64(uint64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vceqq_p64(*v0, *v1); }
void VceqqP8(uint8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vceqq_p8(*v0, *v1); }
void VceqsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vceqs_f32(*v0, *v1); }
void VceqzS8(uint8x8_t* r, int8x8_t* v0) { *r = vceqz_s8(*v0); }
void VceqzS16(uint16x4_t* r, int16x4_t* v0) { *r = vceqz_s16(*v0); }
void VceqzS32(uint32x2_t* r, int32x2_t* v0) { *r = vceqz_s32(*v0); }
void VceqzS64(uint64x1_t* r, int64x1_t* v0) { *r = vceqz_s64(*v0); }
void VceqzU8(uint8x8_t* r, uint8x8_t* v0) { *r = vceqz_u8(*v0); }
void VceqzU16(uint16x4_t* r, uint16x4_t* v0) { *r = vceqz_u16(*v0); }
void VceqzU32(uint32x2_t* r, uint32x2_t* v0) { *r = vceqz_u32(*v0); }
void VceqzU64(uint64x1_t* r, uint64x1_t* v0) { *r = vceqz_u64(*v0); }
void VceqzF32(uint32x2_t* r, float32x2_t* v0) { *r = vceqz_f32(*v0); }
void VceqzF64(uint64x1_t* r, float64x1_t* v0) { *r = vceqz_f64(*v0); }
void VceqzP64(uint64x1_t* r, poly64x1_t* v0) { *r = vceqz_p64(*v0); }
void VceqzP8(uint8x8_t* r, poly8x8_t* v0) { *r = vceqz_p8(*v0); }
void VceqzdS64(uint64_t* r, int64_t* v0) { *r = vceqzd_s64(*v0); }
void VceqzdU64(uint64_t* r, uint64_t* v0) { *r = vceqzd_u64(*v0); }
void VceqzdF64(uint64_t* r, float64_t* v0) { *r = vceqzd_f64(*v0); }
void VceqzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vceqzq_s8(*v0); }
void VceqzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vceqzq_s16(*v0); }
void VceqzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vceqzq_s32(*v0); }
void VceqzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vceqzq_s64(*v0); }
void VceqzqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vceqzq_u8(*v0); }
void VceqzqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vceqzq_u16(*v0); }
void VceqzqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vceqzq_u32(*v0); }
void VceqzqU64(uint64x2_t* r, uint64x2_t* v0) { *r = vceqzq_u64(*v0); }
void VceqzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vceqzq_f32(*v0); }
void VceqzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vceqzq_f64(*v0); }
void VceqzqP64(uint64x2_t* r, poly64x2_t* v0) { *r = vceqzq_p64(*v0); }
void VceqzqP8(uint8x16_t* r, poly8x16_t* v0) { *r = vceqzq_p8(*v0); }
void VceqzsF32(uint32_t* r, float32_t* v0) { *r = vceqzs_f32(*v0); }
void VcgeS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcge_s8(*v0, *v1); }
void VcgeS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcge_s16(*v0, *v1); }
void VcgeS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcge_s32(*v0, *v1); }
void VcgeS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcge_s64(*v0, *v1); }
void VcgeU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcge_u8(*v0, *v1); }
void VcgeU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcge_u16(*v0, *v1); }
void VcgeU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcge_u32(*v0, *v1); }
void VcgeU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcge_u64(*v0, *v1); }
void VcgeF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcge_f32(*v0, *v1); }
void VcgeF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcge_f64(*v0, *v1); }
void VcgedS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcged_s64(*v0, *v1); }
void VcgedU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcged_u64(*v0, *v1); }
void VcgedF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcged_f64(*v0, *v1); }
void VcgeqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcgeq_s8(*v0, *v1); }
void VcgeqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcgeq_s16(*v0, *v1); }
void VcgeqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcgeq_s32(*v0, *v1); }
void VcgeqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcgeq_s64(*v0, *v1); }
void VcgeqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcgeq_u8(*v0, *v1); }
void VcgeqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcgeq_u16(*v0, *v1); }
void VcgeqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcgeq_u32(*v0, *v1); }
void VcgeqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcgeq_u64(*v0, *v1); }
void VcgeqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcgeq_f32(*v0, *v1); }
void VcgeqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcgeq_f64(*v0, *v1); }
void VcgesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcges_f32(*v0, *v1); }
void VcgezS8(uint8x8_t* r, int8x8_t* v0) { *r = vcgez_s8(*v0); }
void VcgezS16(uint16x4_t* r, int16x4_t* v0) { *r = vcgez_s16(*v0); }
void VcgezS32(uint32x2_t* r, int32x2_t* v0) { *r = vcgez_s32(*v0); }
void VcgezS64(uint64x1_t* r, int64x1_t* v0) { *r = vcgez_s64(*v0); }
void VcgezF32(uint32x2_t* r, float32x2_t* v0) { *r = vcgez_f32(*v0); }
void VcgezF64(uint64x1_t* r, float64x1_t* v0) { *r = vcgez_f64(*v0); }
void VcgezdS64(uint64_t* r, int64_t* v0) { *r = vcgezd_s64(*v0); }
void VcgezdF64(uint64_t* r, float64_t* v0) { *r = vcgezd_f64(*v0); }
void VcgezqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcgezq_s8(*v0); }
void VcgezqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcgezq_s16(*v0); }
void VcgezqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcgezq_s32(*v0); }
void VcgezqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcgezq_s64(*v0); }
void VcgezqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcgezq_f32(*v0); }
void VcgezqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcgezq_f64(*v0); }
void VcgezsF32(uint32_t* r, float32_t* v0) { *r = vcgezs_f32(*v0); }
void VcgtS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcgt_s8(*v0, *v1); }
void VcgtS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcgt_s16(*v0, *v1); }
void VcgtS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcgt_s32(*v0, *v1); }
void VcgtS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcgt_s64(*v0, *v1); }
void VcgtU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcgt_u8(*v0, *v1); }
void VcgtU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcgt_u16(*v0, *v1); }
void VcgtU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcgt_u32(*v0, *v1); }
void VcgtU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcgt_u64(*v0, *v1); }
void VcgtF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcgt_f32(*v0, *v1); }
void VcgtF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcgt_f64(*v0, *v1); }
void VcgtdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcgtd_s64(*v0, *v1); }
void VcgtdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcgtd_u64(*v0, *v1); }
void VcgtdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcgtd_f64(*v0, *v1); }
void VcgtqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcgtq_s8(*v0, *v1); }
void VcgtqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcgtq_s16(*v0, *v1); }
void VcgtqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcgtq_s32(*v0, *v1); }
void VcgtqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcgtq_s64(*v0, *v1); }
void VcgtqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcgtq_u8(*v0, *v1); }
void VcgtqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcgtq_u16(*v0, *v1); }
void VcgtqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcgtq_u32(*v0, *v1); }
void VcgtqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcgtq_u64(*v0, *v1); }
void VcgtqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcgtq_f32(*v0, *v1); }
void VcgtqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcgtq_f64(*v0, *v1); }
void VcgtsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcgts_f32(*v0, *v1); }
void VcgtzS8(uint8x8_t* r, int8x8_t* v0) { *r = vcgtz_s8(*v0); }
void VcgtzS16(uint16x4_t* r, int16x4_t* v0) { *r = vcgtz_s16(*v0); }
void VcgtzS32(uint32x2_t* r, int32x2_t* v0) { *r = vcgtz_s32(*v0); }
void VcgtzS64(uint64x1_t* r, int64x1_t* v0) { *r = vcgtz_s64(*v0); }
void VcgtzF32(uint32x2_t* r, float32x2_t* v0) { *r = vcgtz_f32(*v0); }
void VcgtzF64(uint64x1_t* r, float64x1_t* v0) { *r = vcgtz_f64(*v0); }
void VcgtzdS64(uint64_t* r, int64_t* v0) { *r = vcgtzd_s64(*v0); }
void VcgtzdF64(uint64_t* r, float64_t* v0) { *r = vcgtzd_f64(*v0); }
void VcgtzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcgtzq_s8(*v0); }
void VcgtzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcgtzq_s16(*v0); }
void VcgtzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcgtzq_s32(*v0); }
void VcgtzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcgtzq_s64(*v0); }
void VcgtzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcgtzq_f32(*v0); }
void VcgtzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcgtzq_f64(*v0); }
void VcgtzsF32(uint32_t* r, float32_t* v0) { *r = vcgtzs_f32(*v0); }
void VcleS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcle_s8(*v0, *v1); }
void VcleS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcle_s16(*v0, *v1); }
void VcleS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcle_s32(*v0, *v1); }
void VcleS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcle_s64(*v0, *v1); }
void VcleU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcle_u8(*v0, *v1); }
void VcleU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcle_u16(*v0, *v1); }
void VcleU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcle_u32(*v0, *v1); }
void VcleU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcle_u64(*v0, *v1); }
void VcleF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcle_f32(*v0, *v1); }
void VcleF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcle_f64(*v0, *v1); }
void VcledS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcled_s64(*v0, *v1); }
void VcledU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcled_u64(*v0, *v1); }
void VcledF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcled_f64(*v0, *v1); }
void VcleqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcleq_s8(*v0, *v1); }
void VcleqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcleq_s16(*v0, *v1); }
void VcleqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcleq_s32(*v0, *v1); }
void VcleqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcleq_s64(*v0, *v1); }
void VcleqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcleq_u8(*v0, *v1); }
void VcleqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcleq_u16(*v0, *v1); }
void VcleqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcleq_u32(*v0, *v1); }
void VcleqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcleq_u64(*v0, *v1); }
void VcleqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcleq_f32(*v0, *v1); }
void VcleqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcleq_f64(*v0, *v1); }
void VclesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcles_f32(*v0, *v1); }
void VclezS8(uint8x8_t* r, int8x8_t* v0) { *r = vclez_s8(*v0); }
void VclezS16(uint16x4_t* r, int16x4_t* v0) { *r = vclez_s16(*v0); }
void VclezS32(uint32x2_t* r, int32x2_t* v0) { *r = vclez_s32(*v0); }
void VclezS64(uint64x1_t* r, int64x1_t* v0) { *r = vclez_s64(*v0); }
void VclezF32(uint32x2_t* r, float32x2_t* v0) { *r = vclez_f32(*v0); }
void VclezF64(uint64x1_t* r, float64x1_t* v0) { *r = vclez_f64(*v0); }
void VclezdS64(uint64_t* r, int64_t* v0) { *r = vclezd_s64(*v0); }
void VclezdF64(uint64_t* r, float64_t* v0) { *r = vclezd_f64(*v0); }
void VclezqS8(uint8x16_t* r, int8x16_t* v0) { *r = vclezq_s8(*v0); }
void VclezqS16(uint16x8_t* r, int16x8_t* v0) { *r = vclezq_s16(*v0); }
void VclezqS32(uint32x4_t* r, int32x4_t* v0) { *r = vclezq_s32(*v0); }
void VclezqS64(uint64x2_t* r, int64x2_t* v0) { *r = vclezq_s64(*v0); }
void VclezqF32(uint32x4_t* r, float32x4_t* v0) { *r = vclezq_f32(*v0); }
void VclezqF64(uint64x2_t* r, float64x2_t* v0) { *r = vclezq_f64(*v0); }
void VclezsF32(uint32_t* r, float32_t* v0) { *r = vclezs_f32(*v0); }
void VclsS8(int8x8_t* r, int8x8_t* v0) { *r = vcls_s8(*v0); }
void VclsS16(int16x4_t* r, int16x4_t* v0) { *r = vcls_s16(*v0); }
void VclsS32(int32x2_t* r, int32x2_t* v0) { *r = vcls_s32(*v0); }
void VclsU8(int8x8_t* r, uint8x8_t* v0) { *r = vcls_u8(*v0); }
void VclsU16(int16x4_t* r, uint16x4_t* v0) { *r = vcls_u16(*v0); }
void VclsU32(int32x2_t* r, uint32x2_t* v0) { *r = vcls_u32(*v0); }
void VclsqS8(int8x16_t* r, int8x16_t* v0) { *r = vclsq_s8(*v0); }
void VclsqS16(int16x8_t* r, int16x8_t* v0) { *r = vclsq_s16(*v0); }
void VclsqS32(int32x4_t* r, int32x4_t* v0) { *r = vclsq_s32(*v0); }
void VclsqU8(int8x16_t* r, uint8x16_t* v0) { *r = vclsq_u8(*v0); }
void VclsqU16(int16x8_t* r, uint16x8_t* v0) { *r = vclsq_u16(*v0); }
void VclsqU32(int32x4_t* r, uint32x4_t* v0) { *r = vclsq_u32(*v0); }
void VcltS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vclt_s8(*v0, *v1); }
void VcltS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vclt_s16(*v0, *v1); }
void VcltS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vclt_s32(*v0, *v1); }
void VcltS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vclt_s64(*v0, *v1); }
void VcltU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vclt_u8(*v0, *v1); }
void VcltU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vclt_u16(*v0, *v1); }
void VcltU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vclt_u32(*v0, *v1); }
void VcltU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vclt_u64(*v0, *v1); }
void VcltF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vclt_f32(*v0, *v1); }
void VcltF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vclt_f64(*v0, *v1); }
void VcltdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcltd_s64(*v0, *v1); }
void VcltdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcltd_u64(*v0, *v1); }
void VcltdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcltd_f64(*v0, *v1); }
void VcltqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcltq_s8(*v0, *v1); }
void VcltqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcltq_s16(*v0, *v1); }
void VcltqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcltq_s32(*v0, *v1); }
void VcltqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcltq_s64(*v0, *v1); }
void VcltqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcltq_u8(*v0, *v1); }
void VcltqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcltq_u16(*v0, *v1); }
void VcltqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcltq_u32(*v0, *v1); }
void VcltqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcltq_u64(*v0, *v1); }
void VcltqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vcltq_f32(*v0, *v1); }
void VcltqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vcltq_f64(*v0, *v1); }
void VcltsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vclts_f32(*v0, *v1); }
void VcltzS8(uint8x8_t* r, int8x8_t* v0) { *r = vcltz_s8(*v0); }
void VcltzS16(uint16x4_t* r, int16x4_t* v0) { *r = vcltz_s16(*v0); }
void VcltzS32(uint32x2_t* r, int32x2_t* v0) { *r = vcltz_s32(*v0); }
void VcltzS64(uint64x1_t* r, int64x1_t* v0) { *r = vcltz_s64(*v0); }
void VcltzF32(uint32x2_t* r, float32x2_t* v0) { *r = vcltz_f32(*v0); }
void VcltzF64(uint64x1_t* r, float64x1_t* v0) { *r = vcltz_f64(*v0); }
void VcltzdS64(uint64_t* r, int64_t* v0) { *r = vcltzd_s64(*v0); }
void VcltzdF64(uint64_t* r, float64_t* v0) { *r = vcltzd_f64(*v0); }
void VcltzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcltzq_s8(*v0); }
void VcltzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcltzq_s16(*v0); }
void VcltzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcltzq_s32(*v0); }
void VcltzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcltzq_s64(*v0); }
void VcltzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcltzq_f32(*v0); }
void VcltzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcltzq_f64(*v0); }
void VcltzsF32(uint32_t* r, float32_t* v0) { *r = vcltzs_f32(*v0); }
void VclzS8(int8x8_t* r, int8x8_t* v0) { *r = vclz_s8(*v0); }
void VclzS16(int16x4_t* r, int16x4_t* v0) { *r = vclz_s16(*v0); }
void VclzS32(int32x2_t* r, int32x2_t* v0) { *r = vclz_s32(*v0); }
void VclzU8(uint8x8_t* r, uint8x8_t* v0) { *r = vclz_u8(*v0); }
void VclzU16(uint16x4_t* r, uint16x4_t* v0) { *r = vclz_u16(*v0); }
void VclzU32(uint32x2_t* r, uint32x2_t* v0) { *r = vclz_u32(*v0); }
void VclzqS8(int8x16_t* r, int8x16_t* v0) { *r = vclzq_s8(*v0); }
void VclzqS16(int16x8_t* r, int16x8_t* v0) { *r = vclzq_s16(*v0); }
void VclzqS32(int32x4_t* r, int32x4_t* v0) { *r = vclzq_s32(*v0); }
void VclzqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vclzq_u8(*v0); }
void VclzqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vclzq_u16(*v0); }
void VclzqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vclzq_u32(*v0); }
void VcntS8(int8x8_t* r, int8x8_t* v0) { *r = vcnt_s8(*v0); }
void VcntU8(uint8x8_t* r, uint8x8_t* v0) { *r = vcnt_u8(*v0); }
void VcntP8(poly8x8_t* r, poly8x8_t* v0) { *r = vcnt_p8(*v0); }
void VcntqS8(int8x16_t* r, int8x16_t* v0) { *r = vcntq_s8(*v0); }
void VcntqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vcntq_u8(*v0); }
void VcntqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vcntq_p8(*v0); }
void VcombineS8(int8x16_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcombine_s8(*v0, *v1); }
void VcombineS16(int16x8_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcombine_s16(*v0, *v1); }
void VcombineS32(int32x4_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcombine_s32(*v0, *v1); }
void VcombineS64(int64x2_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcombine_s64(*v0, *v1); }
void VcombineU8(uint8x16_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcombine_u8(*v0, *v1); }
void VcombineU16(uint16x8_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcombine_u16(*v0, *v1); }
void VcombineU32(uint32x4_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcombine_u32(*v0, *v1); }
void VcombineU64(uint64x2_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcombine_u64(*v0, *v1); }
void VcombineF32(float32x4_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcombine_f32(*v0, *v1); }
void VcombineF64(float64x2_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcombine_f64(*v0, *v1); }
void VcombineP16(poly16x8_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vcombine_p16(*v0, *v1); }
void VcombineP64(poly64x2_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vcombine_p64(*v0, *v1); }
void VcombineP8(poly8x16_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vcombine_p8(*v0, *v1); }
void VcvtF32S32(float32x2_t* r, int32x2_t* v0) { *r = vcvt_f32_s32(*v0); }
void VcvtF32U32(float32x2_t* r, uint32x2_t* v0) { *r = vcvt_f32_u32(*v0); }
void VcvtF32F64(float32x2_t* r, float64x2_t* v0) { *r = vcvt_f32_f64(*v0); }
void VcvtF64S64(float64x1_t* r, int64x1_t* v0) { *r = vcvt_f64_s64(*v0); }
void VcvtF64U64(float64x1_t* r, uint64x1_t* v0) { *r = vcvt_f64_u64(*v0); }
void VcvtF64F32(float64x2_t* r, float32x2_t* v0) { *r = vcvt_f64_f32(*v0); }
void VcvtHighF32F64(float32x4_t* r, float32x2_t* v0, float64x2_t* v1) { *r = vcvt_high_f32_f64(*v0, *v1); }
void VcvtHighF64F32(float64x2_t* r, float32x4_t* v0) { *r = vcvt_high_f64_f32(*v0); }
void VcvtS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvt_s32_f32(*v0); }
void VcvtS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvt_s64_f64(*v0); }
void VcvtU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvt_u32_f32(*v0); }
void VcvtU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvt_u64_f64(*v0); }
void VcvtaS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvta_s32_f32(*v0); }
void VcvtaS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvta_s64_f64(*v0); }
void VcvtaU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvta_u32_f32(*v0); }
void VcvtaU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvta_u64_f64(*v0); }
void VcvtadS64F64(int64_t* r, float64_t* v0) { *r = vcvtad_s64_f64(*v0); }
void VcvtadU64F64(uint64_t* r, float64_t* v0) { *r = vcvtad_u64_f64(*v0); }
void VcvtaqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtaq_s32_f32(*v0); }
void VcvtaqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtaq_s64_f64(*v0); }
void VcvtaqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtaq_u32_f32(*v0); }
void VcvtaqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtaq_u64_f64(*v0); }
void VcvtasS32F32(int32_t* r, float32_t* v0) { *r = vcvtas_s32_f32(*v0); }
void VcvtasU32F32(uint32_t* r, float32_t* v0) { *r = vcvtas_u32_f32(*v0); }
void VcvtdF64S64(float64_t* r, int64_t* v0) { *r = vcvtd_f64_s64(*v0); }
void VcvtdF64U64(float64_t* r, uint64_t* v0) { *r = vcvtd_f64_u64(*v0); }
void VcvtdS64F64(int64_t* r, float64_t* v0) { *r = vcvtd_s64_f64(*v0); }
void VcvtdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtd_u64_f64(*v0); }
void VcvtmS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtm_s32_f32(*v0); }
void VcvtmS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtm_s64_f64(*v0); }
void VcvtmU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtm_u32_f32(*v0); }
void VcvtmU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtm_u64_f64(*v0); }
void VcvtmdS64F64(int64_t* r, float64_t* v0) { *r = vcvtmd_s64_f64(*v0); }
void VcvtmdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtmd_u64_f64(*v0); }
void VcvtmqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtmq_s32_f32(*v0); }
void VcvtmqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtmq_s64_f64(*v0); }
void VcvtmqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtmq_u32_f32(*v0); }
void VcvtmqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtmq_u64_f64(*v0); }
void VcvtmsS32F32(int32_t* r, float32_t* v0) { *r = vcvtms_s32_f32(*v0); }
void VcvtmsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtms_u32_f32(*v0); }
void VcvtnS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtn_s32_f32(*v0); }
void VcvtnS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtn_s64_f64(*v0); }
void VcvtnU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtn_u32_f32(*v0); }
void VcvtnU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtn_u64_f64(*v0); }
void VcvtndS64F64(int64_t* r, float64_t* v0) { *r = vcvtnd_s64_f64(*v0); }
void VcvtndU64F64(uint64_t* r, float64_t* v0) { *r = vcvtnd_u64_f64(*v0); }
void VcvtnqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtnq_s32_f32(*v0); }
void VcvtnqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtnq_s64_f64(*v0); }
void VcvtnqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtnq_u32_f32(*v0); }
void VcvtnqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtnq_u64_f64(*v0); }
void VcvtnsS32F32(int32_t* r, float32_t* v0) { *r = vcvtns_s32_f32(*v0); }
void VcvtnsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtns_u32_f32(*v0); }
void VcvtpS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtp_s32_f32(*v0); }
void VcvtpS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtp_s64_f64(*v0); }
void VcvtpU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtp_u32_f32(*v0); }
void VcvtpU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtp_u64_f64(*v0); }
void VcvtpdS64F64(int64_t* r, float64_t* v0) { *r = vcvtpd_s64_f64(*v0); }
void VcvtpdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtpd_u64_f64(*v0); }
void VcvtpqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtpq_s32_f32(*v0); }
void VcvtpqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtpq_s64_f64(*v0); }
void VcvtpqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtpq_u32_f32(*v0); }
void VcvtpqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtpq_u64_f64(*v0); }
void VcvtpsS32F32(int32_t* r, float32_t* v0) { *r = vcvtps_s32_f32(*v0); }
void VcvtpsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtps_u32_f32(*v0); }
void VcvtqF32S32(float32x4_t* r, int32x4_t* v0) { *r = vcvtq_f32_s32(*v0); }
void VcvtqF32U32(float32x4_t* r, uint32x4_t* v0) { *r = vcvtq_f32_u32(*v0); }
void VcvtqF64S64(float64x2_t* r, int64x2_t* v0) { *r = vcvtq_f64_s64(*v0); }
void VcvtqF64U64(float64x2_t* r, uint64x2_t* v0) { *r = vcvtq_f64_u64(*v0); }
void VcvtqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtq_s32_f32(*v0); }
void VcvtqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtq_s64_f64(*v0); }
void VcvtqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtq_u32_f32(*v0); }
void VcvtqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtq_u64_f64(*v0); }
void VcvtsF32S32(float32_t* r, int32_t* v0) { *r = vcvts_f32_s32(*v0); }
void VcvtsF32U32(float32_t* r, uint32_t* v0) { *r = vcvts_f32_u32(*v0); }
void VcvtsS32F32(int32_t* r, float32_t* v0) { *r = vcvts_s32_f32(*v0); }
void VcvtsU32F32(uint32_t* r, float32_t* v0) { *r = vcvts_u32_f32(*v0); }
void VcvtxF32F64(float32x2_t* r, float64x2_t* v0) { *r = vcvtx_f32_f64(*v0); }
void VcvtxHighF32F64(float32x4_t* r, float32x2_t* v0, float64x2_t* v1) { *r = vcvtx_high_f32_f64(*v0, *v1); }
void VcvtxdF32F64(float32_t* r, float64_t* v0) { *r = vcvtxd_f32_f64(*v0); }
void VdivF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vdiv_f32(*v0, *v1); }
void VdivF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vdiv_f64(*v0, *v1); }
void VdivqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vdivq_f32(*v0, *v1); }
void VdivqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vdivq_f64(*v0, *v1); }
void VdotS32(int32x2_t* r, int32x2_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vdot_s32(*v0, *v1, *v2); }
void VdotU32(uint32x2_t* r, uint32x2_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vdot_u32(*v0, *v1, *v2); }
void VdotqS32(int32x4_t* r, int32x4_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vdotq_s32(*v0, *v1, *v2); }
void VdotqU32(uint32x4_t* r, uint32x4_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vdotq_u32(*v0, *v1, *v2); }
void VdupNS8(int8x8_t* r, int8_t* v0) { *r = vdup_n_s8(*v0); }
void VdupNS16(int16x4_t* r, int16_t* v0) { *r = vdup_n_s16(*v0); }
void VdupNS32(int32x2_t* r, int32_t* v0) { *r = vdup_n_s32(*v0); }
void VdupNS64(int64x1_t* r, int64_t* v0) { *r = vdup_n_s64(*v0); }
void VdupNU8(uint8x8_t* r, uint8_t* v0) { *r = vdup_n_u8(*v0); }
void VdupNU16(uint16x4_t* r, uint16_t* v0) { *r = vdup_n_u16(*v0); }
void VdupNU32(uint32x2_t* r, uint32_t* v0) { *r = vdup_n_u32(*v0); }
void VdupNU64(uint64x1_t* r, uint64_t* v0) { *r = vdup_n_u64(*v0); }
void VdupNF32(float32x2_t* r, float32_t* v0) { *r = vdup_n_f32(*v0); }
void VdupNF64(float64x1_t* r, float64_t* v0) { *r = vdup_n_f64(*v0); }
void VdupNP16(poly16x4_t* r, poly16_t* v0) { *r = vdup_n_p16(*v0); }
void VdupNP64(poly64x1_t* r, poly64_t* v0) { *r = vdup_n_p64(*v0); }
void VdupNP8(poly8x8_t* r, poly8_t* v0) { *r = vdup_n_p8(*v0); }
void VdupqNS8(int8x16_t* r, int8_t* v0) { *r = vdupq_n_s8(*v0); }
void VdupqNS16(int16x8_t* r, int16_t* v0) { *r = vdupq_n_s16(*v0); }
void VdupqNS32(int32x4_t* r, int32_t* v0) { *r = vdupq_n_s32(*v0); }
void VdupqNS64(int64x2_t* r, int64_t* v0) { *r = vdupq_n_s64(*v0); }
void VdupqNU8(uint8x16_t* r, uint8_t* v0) { *r = vdupq_n_u8(*v0); }
void VdupqNU16(uint16x8_t* r, uint16_t* v0) { *r = vdupq_n_u16(*v0); }
void VdupqNU32(uint32x4_t* r, uint32_t* v0) { *r = vdupq_n_u32(*v0); }
void VdupqNU64(uint64x2_t* r, uint64_t* v0) { *r = vdupq_n_u64(*v0); }
void VdupqNF32(float32x4_t* r, float32_t* v0) { *r = vdupq_n_f32(*v0); }
void VdupqNF64(float64x2_t* r, float64_t* v0) { *r = vdupq_n_f64(*v0); }
void VdupqNP16(poly16x8_t* r, poly16_t* v0) { *r = vdupq_n_p16(*v0); }
void VdupqNP64(poly64x2_t* r, poly64_t* v0) { *r = vdupq_n_p64(*v0); }
void VdupqNP8(poly8x16_t* r, poly8_t* v0) { *r = vdupq_n_p8(*v0); }
void VeorS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = veor_s8(*v0, *v1); }
void VeorS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = veor_s16(*v0, *v1); }
void VeorS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = veor_s32(*v0, *v1); }
void VeorS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = veor_s64(*v0, *v1); }
void VeorU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = veor_u8(*v0, *v1); }
void VeorU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = veor_u16(*v0, *v1); }
void VeorU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = veor_u32(*v0, *v1); }
void VeorU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = veor_u64(*v0, *v1); }
void Veor3QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = veor3q_s8(*v0, *v1, *v2); }
void Veor3QS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = veor3q_s16(*v0, *v1, *v2); }
void Veor3QS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = veor3q_s32(*v0, *v1, *v2); }
void Veor3QS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = veor3q_s64(*v0, *v1, *v2); }
void Veor3QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = veor3q_u8(*v0, *v1, *v2); }
void Veor3QU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = veor3q_u16(*v0, *v1, *v2); }
void Veor3QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = veor3q_u32(*v0, *v1, *v2); }
void Veor3QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = veor3q_u64(*v0, *v1, *v2); }
void VeorqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = veorq_s8(*v0, *v1); }
void VeorqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = veorq_s16(*v0, *v1); }
void VeorqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = veorq_s32(*v0, *v1); }
void VeorqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = veorq_s64(*v0, *v1); }
void VeorqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = veorq_u8(*v0, *v1); }
void VeorqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = veorq_u16(*v0, *v1); }
void VeorqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = veorq_u32(*v0, *v1); }
void VeorqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = veorq_u64(*v0, *v1); }
void VfmaF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2_t* v2) { *r = vfma_f32(*v0, *v1, *v2); }
void VfmaF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1_t* v2) { *r = vfma_f64(*v0, *v1, *v2); }
void VfmaNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_t* v2) { *r = vfma_n_f32(*v0, *v1, *v2); }
void VfmaNF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64_t* v2) { *r = vfma_n_f64(*v0, *v1, *v2); }
void VfmaqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x4_t* v2) { *r = vfmaq_f32(*v0, *v1, *v2); }
void VfmaqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x2_t* v2) { *r = vfmaq_f64(*v0, *v1, *v2); }
void VfmaqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32_t* v2) { *r = vfmaq_n_f32(*v0, *v1, *v2); }
void VfmaqNF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64_t* v2) { *r = vfmaq_n_f64(*v0, *v1, *v2); }
void VfmsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2_t* v2) { *r = vfms_f32(*v0, *v1, *v2); }
void VfmsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1_t* v2) { *r = vfms_f64(*v0, *v1, *v2); }
void VfmsNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_t* v2) { *r = vfms_n_f32(*v0, *v1, *v2); }
void VfmsNF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64_t* v2) { *r = vfms_n_f64(*v0, *v1, *v2); }
void VfmsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x4_t* v2) { *r = vfmsq_f32(*v0, *v1, *v2); }
void VfmsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x2_t* v2) { *r = vfmsq_f64(*v0, *v1, *v2); }
void VfmsqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32_t* v2) { *r = vfmsq_n_f32(*v0, *v1, *v2); }
void VfmsqNF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64_t* v2) { *r = vfmsq_n_f64(*v0, *v1, *v2); }
void VgetHighS8(int8x8_t* r, int8x16_t* v0) { *r = vget_high_s8(*v0); }
void VgetHighS16(int16x4_t* r, int16x8_t* v0) { *r = vget_high_s16(*v0); }
void VgetHighS32(int32x2_t* r, int32x4_t* v0) { *r = vget_high_s32(*v0); }
void VgetHighS64(int64x1_t* r, int64x2_t* v0) { *r = vget_high_s64(*v0); }
void VgetHighU8(uint8x8_t* r, uint8x16_t* v0) { *r = vget_high_u8(*v0); }
void VgetHighU16(uint16x4_t* r, uint16x8_t* v0) { *r = vget_high_u16(*v0); }
void VgetHighU32(uint32x2_t* r, uint32x4_t* v0) { *r = vget_high_u32(*v0); }
void VgetHighU64(uint64x1_t* r, uint64x2_t* v0) { *r = vget_high_u64(*v0); }
void VgetHighF32(float32x2_t* r, float32x4_t* v0) { *r = vget_high_f32(*v0); }
void VgetHighF64(float64x1_t* r, float64x2_t* v0) { *r = vget_high_f64(*v0); }
void VgetHighP16(poly16x4_t* r, poly16x8_t* v0) { *r = vget_high_p16(*v0); }
void VgetHighP64(poly64x1_t* r, poly64x2_t* v0) { *r = vget_high_p64(*v0); }
void VgetHighP8(poly8x8_t* r, poly8x16_t* v0) { *r = vget_high_p8(*v0); }
void VgetLowS8(int8x8_t* r, int8x16_t* v0) { *r = vget_low_s8(*v0); }
void VgetLowS16(int16x4_t* r, int16x8_t* v0) { *r = vget_low_s16(*v0); }
void VgetLowS32(int32x2_t* r, int32x4_t* v0) { *r = vget_low_s32(*v0); }
void VgetLowS64(int64x1_t* r, int64x2_t* v0) { *r = vget_low_s64(*v0); }
void VgetLowU8(uint8x8_t* r, uint8x16_t* v0) { *r = vget_low_u8(*v0); }
void VgetLowU16(uint16x4_t* r, uint16x8_t* v0) { *r = vget_low_u16(*v0); }
void VgetLowU32(uint32x2_t* r, uint32x4_t* v0) { *r = vget_low_u32(*v0); }
void VgetLowU64(uint64x1_t* r, uint64x2_t* v0) { *r = vget_low_u64(*v0); }
void VgetLowF32(float32x2_t* r, float32x4_t* v0) { *r = vget_low_f32(*v0); }
void VgetLowF64(float64x1_t* r, float64x2_t* v0) { *r = vget_low_f64(*v0); }
void VgetLowP16(poly16x4_t* r, poly16x8_t* v0) { *r = vget_low_p16(*v0); }
void VgetLowP64(poly64x1_t* r, poly64x2_t* v0) { *r = vget_low_p64(*v0); }
void VgetLowP8(poly8x8_t* r, poly8x16_t* v0) { *r = vget_low_p8(*v0); }
void VhaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vhadd_s8(*v0, *v1); }
void VhaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vhadd_s16(*v0, *v1); }
void VhaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vhadd_s32(*v0, *v1); }
void VhaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vhadd_u8(*v0, *v1); }
void VhaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vhadd_u16(*v0, *v1); }
void VhaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vhadd_u32(*v0, *v1); }
void VhaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vhaddq_s8(*v0, *v1); }
void VhaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vhaddq_s16(*v0, *v1); }
void VhaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vhaddq_s32(*v0, *v1); }
void VhaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vhaddq_u8(*v0, *v1); }
void VhaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vhaddq_u16(*v0, *v1); }
void VhaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vhaddq_u32(*v0, *v1); }
void VhsubS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vhsub_s8(*v0, *v1); }
void VhsubS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vhsub_s16(*v0, *v1); }
void VhsubS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vhsub_s32(*v0, *v1); }
void VhsubU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vhsub_u8(*v0, *v1); }
void VhsubU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vhsub_u16(*v0, *v1); }
void VhsubU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vhsub_u32(*v0, *v1); }
void VhsubqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vhsubq_s8(*v0, *v1); }
void VhsubqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vhsubq_s16(*v0, *v1); }
void VhsubqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vhsubq_s32(*v0, *v1); }
void VhsubqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vhsubq_u8(*v0, *v1); }
void VhsubqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vhsubq_u16(*v0, *v1); }
void VhsubqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vhsubq_u32(*v0, *v1); }
void VmaxS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmax_s8(*v0, *v1); }
void VmaxS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmax_s16(*v0, *v1); }
void VmaxS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmax_s32(*v0, *v1); }
void VmaxU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmax_u8(*v0, *v1); }
void VmaxU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmax_u16(*v0, *v1); }
void VmaxU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmax_u32(*v0, *v1); }
void VmaxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vmax_f32(*v0, *v1); }
void VmaxF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vmax_f64(*v0, *v1); }
void VmaxnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vmaxnm_f32(*v0, *v1); }
void VmaxnmF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vmaxnm_f64(*v0, *v1); }
void VmaxnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vmaxnmq_f32(*v0, *v1); }
void VmaxnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vmaxnmq_f64(*v0, *v1); }
void VmaxnmvF32(float32_t* r, float32x2_t* v0) { *r = vmaxnmv_f32(*v0); }
void VmaxnmvqF32(float32_t* r, float32x4_t* v0) { *r = vmaxnmvq_f32(*v0); }
void VmaxnmvqF64(float64_t* r, float64x2_t* v0) { *r = vmaxnmvq_f64(*v0); }
void VmaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmaxq_s8(*v0, *v1); }
void VmaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmaxq_s16(*v0, *v1); }
void VmaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmaxq_s32(*v0, *v1); }
void VmaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vmaxq_u8(*v0, *v1); }
void VmaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmaxq_u16(*v0, *v1); }
void VmaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmaxq_u32(*v0, *v1); }
void VmaxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vmaxq_f32(*v0, *v1); }
void VmaxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vmaxq_f64(*v0, *v1); }
void VmaxvS8(int8_t* r, int8x8_t* v0) { *r = vmaxv_s8(*v0); }
void VmaxvS16(int16_t* r, int16x4_t* v0) { *r = vmaxv_s16(*v0); }
void VmaxvS32(int32_t* r, int32x2_t* v0) { *r = vmaxv_s32(*v0); }
void VmaxvU8(uint8_t* r, uint8x8_t* v0) { *r = vmaxv_u8(*v0); }
void VmaxvU16(uint16_t* r, uint16x4_t* v0) { *r = vmaxv_u16(*v0); }
void VmaxvU32(uint32_t* r, uint32x2_t* v0) { *r = vmaxv_u32(*v0); }
void VmaxvF32(float32_t* r, float32x2_t* v0) { *r = vmaxv_f32(*v0); }
void VmaxvqS8(int8_t* r, int8x16_t* v0) { *r = vmaxvq_s8(*v0); }
void VmaxvqS16(int16_t* r, int16x8_t* v0) { *r = vmaxvq_s16(*v0); }
void VmaxvqS32(int32_t* r, int32x4_t* v0) { *r = vmaxvq_s32(*v0); }
void VmaxvqU8(uint8_t* r, uint8x16_t* v0) { *r = vmaxvq_u8(*v0); }
void VmaxvqU16(uint16_t* r, uint16x8_t* v0) { *r = vmaxvq_u16(*v0); }
void VmaxvqU32(uint32_t* r, uint32x4_t* v0) { *r = vmaxvq_u32(*v0); }
void VmaxvqF32(float32_t* r, float32x4_t* v0) { *r = vmaxvq_f32(*v0); }
void VmaxvqF64(float64_t* r, float64x2_t* v0) { *r = vmaxvq_f64(*v0); }
void VminS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmin_s8(*v0, *v1); }
void VminS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmin_s16(*v0, *v1); }
void VminS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmin_s32(*v0, *v1); }
void VminU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmin_u8(*v0, *v1); }
void VminU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmin_u16(*v0, *v1); }
void VminU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmin_u32(*v0, *v1); }
void VminF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vmin_f32(*v0, *v1); }
void VminF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vmin_f64(*v0, *v1); }
void VminnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vminnm_f32(*v0, *v1); }
void VminnmF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vminnm_f64(*v0, *v1); }
void VminnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vminnmq_f32(*v0, *v1); }
void VminnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vminnmq_f64(*v0, *v1); }
void VminnmvF32(float32_t* r, float32x2_t* v0) { *r = vminnmv_f32(*v0); }
void VminnmvqF32(float32_t* r, float32x4_t* v0) { *r = vminnmvq_f32(*v0); }
void VminnmvqF64(float64_t* r, float64x2_t* v0) { *r = vminnmvq_f64(*v0); }
void VminqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vminq_s8(*v0, *v1); }
void VminqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vminq_s16(*v0, *v1); }
void VminqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vminq_s32(*v0, *v1); }
void VminqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vminq_u8(*v0, *v1); }
void VminqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vminq_u16(*v0, *v1); }
void VminqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vminq_u32(*v0, *v1); }
void VminqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vminq_f32(*v0, *v1); }
void VminqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vminq_f64(*v0, *v1); }
void VminvS8(int8_t* r, int8x8_t* v0) { *r = vminv_s8(*v0); }
void VminvS16(int16_t* r, int16x4_t* v0) { *r = vminv_s16(*v0); }
void VminvS32(int32_t* r, int32x2_t* v0) { *r = vminv_s32(*v0); }
void VminvU8(uint8_t* r, uint8x8_t* v0) { *r = vminv_u8(*v0); }
void VminvU16(uint16_t* r, uint16x4_t* v0) { *r = vminv_u16(*v0); }
void VminvU32(uint32_t* r, uint32x2_t* v0) { *r = vminv_u32(*v0); }
void VminvF32(float32_t* r, float32x2_t* v0) { *r = vminv_f32(*v0); }
void VminvqS8(int8_t* r, int8x16_t* v0) { *r = vminvq_s8(*v0); }
void VminvqS16(int16_t* r, int16x8_t* v0) { *r = vminvq_s16(*v0); }
void VminvqS32(int32_t* r, int32x4_t* v0) { *r = vminvq_s32(*v0); }
void VminvqU8(uint8_t* r, uint8x16_t* v0) { *r = vminvq_u8(*v0); }
void VminvqU16(uint16_t* r, uint16x8_t* v0) { *r = vminvq_u16(*v0); }
void VminvqU32(uint32_t* r, uint32x4_t* v0) { *r = vminvq_u32(*v0); }
void VminvqF32(float32_t* r, float32x4_t* v0) { *r = vminvq_f32(*v0); }
void VminvqF64(float64_t* r, float64x2_t* v0) { *r = vminvq_f64(*v0); }
void VmlaS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vmla_s8(*v0, *v1, *v2); }
void VmlaS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vmla_s16(*v0, *v1, *v2); }
void VmlaS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vmla_s32(*v0, *v1, *v2); }
void VmlaU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vmla_u8(*v0, *v1, *v2); }
void VmlaU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vmla_u16(*v0, *v1, *v2); }
void VmlaU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vmla_u32(*v0, *v1, *v2); }
void VmlaF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2_t* v2) { *r = vmla_f32(*v0, *v1, *v2); }
void VmlaF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1_t* v2) { *r = vmla_f64(*v0, *v1, *v2); }
void VmlaNS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vmla_n_s16(*v0, *v1, *v2); }
void VmlaNS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vmla_n_s32(*v0, *v1, *v2); }
void VmlaNU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16_t* v2) { *r = vmla_n_u16(*v0, *v1, *v2); }
void VmlaNU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32_t* v2) { *r = vmla_n_u32(*v0, *v1, *v2); }
void VmlaNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_t* v2) { *r = vmla_n_f32(*v0, *v1, *v2); }
void VmlalS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vmlal_s8(*v0, *v1, *v2); }
void VmlalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vmlal_s16(*v0, *v1, *v2); }
void VmlalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vmlal_s32(*v0, *v1, *v2); }
void VmlalU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vmlal_u8(*v0, *v1, *v2); }
void VmlalU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vmlal_u16(*v0, *v1, *v2); }
void VmlalU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vmlal_u32(*v0, *v1, *v2); }
void VmlalHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vmlal_high_s8(*v0, *v1, *v2); }
void VmlalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vmlal_high_s16(*v0, *v1, *v2); }
void VmlalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vmlal_high_s32(*v0, *v1, *v2); }
void VmlalHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vmlal_high_u8(*v0, *v1, *v2); }
void VmlalHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vmlal_high_u16(*v0, *v1, *v2); }
void VmlalHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vmlal_high_u32(*v0, *v1, *v2); }
void VmlalHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* v2) { *r = vmlal_high_n_s16(*v0, *v1, *v2); }
void VmlalHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* v2) { *r = vmlal_high_n_s32(*v0, *v1, *v2); }
void VmlalHighNU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16_t* v2) { *r = vmlal_high_n_u16(*v0, *v1, *v2); }
void VmlalHighNU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32_t* v2) { *r = vmlal_high_n_u32(*v0, *v1, *v2); }
void VmlalNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vmlal_n_s16(*v0, *v1, *v2); }
void VmlalNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vmlal_n_s32(*v0, *v1, *v2); }
void VmlalNU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16_t* v2) { *r = vmlal_n_u16(*v0, *v1, *v2); }
void VmlalNU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32_t* v2) { *r = vmlal_n_u32(*v0, *v1, *v2); }
void VmlaqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vmlaq_s8(*v0, *v1, *v2); }
void VmlaqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vmlaq_s16(*v0, *v1, *v2); }
void VmlaqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vmlaq_s32(*v0, *v1, *v2); }
void VmlaqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vmlaq_u8(*v0, *v1, *v2); }
void VmlaqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vmlaq_u16(*v0, *v1, *v2); }
void VmlaqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vmlaq_u32(*v0, *v1, *v2); }
void VmlaqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x4_t* v2) { *r = vmlaq_f32(*v0, *v1, *v2); }
void VmlaqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x2_t* v2) { *r = vmlaq_f64(*v0, *v1, *v2); }
void VmlaqNS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16_t* v2) { *r = vmlaq_n_s16(*v0, *v1, *v2); }
void VmlaqNS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32_t* v2) { *r = vmlaq_n_s32(*v0, *v1, *v2); }
void VmlaqNU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16_t* v2) { *r = vmlaq_n_u16(*v0, *v1, *v2); }
void VmlaqNU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32_t* v2) { *r = vmlaq_n_u32(*v0, *v1, *v2); }
void VmlaqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32_t* v2) { *r = vmlaq_n_f32(*v0, *v1, *v2); }
void VmlsS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vmls_s8(*v0, *v1, *v2); }
void VmlsS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vmls_s16(*v0, *v1, *v2); }
void VmlsS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vmls_s32(*v0, *v1, *v2); }
void VmlsU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vmls_u8(*v0, *v1, *v2); }
void VmlsU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vmls_u16(*v0, *v1, *v2); }
void VmlsU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vmls_u32(*v0, *v1, *v2); }
void VmlsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2_t* v2) { *r = vmls_f32(*v0, *v1, *v2); }
void VmlsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1_t* v2) { *r = vmls_f64(*v0, *v1, *v2); }
void VmlsNS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vmls_n_s16(*v0, *v1, *v2); }
void VmlsNS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vmls_n_s32(*v0, *v1, *v2); }
void VmlsNU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16_t* v2) { *r = vmls_n_u16(*v0, *v1, *v2); }
void VmlsNU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32_t* v2) { *r = vmls_n_u32(*v0, *v1, *v2); }
void VmlsNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_t* v2) { *r = vmls_n_f32(*v0, *v1, *v2); }
void VmlslS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vmlsl_s8(*v0, *v1, *v2); }
void VmlslS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vmlsl_s16(*v0, *v1, *v2); }
void VmlslS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vmlsl_s32(*v0, *v1, *v2); }
void VmlslU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vmlsl_u8(*v0, *v1, *v2); }
void VmlslU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t* v2) { *r = vmlsl_u16(*v0, *v1, *v2); }
void VmlslU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t* v2) { *r = vmlsl_u32(*v0, *v1, *v2); }
void VmlslHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vmlsl_high_s8(*v0, *v1, *v2); }
void VmlslHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vmlsl_high_s16(*v0, *v1, *v2); }
void VmlslHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vmlsl_high_s32(*v0, *v1, *v2); }
void VmlslHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vmlsl_high_u8(*v0, *v1, *v2); }
void VmlslHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vmlsl_high_u16(*v0, *v1, *v2); }
void VmlslHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vmlsl_high_u32(*v0, *v1, *v2); }
void VmlslHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* v2) { *r = vmlsl_high_n_s16(*v0, *v1, *v2); }
void VmlslHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* v2) { *r = vmlsl_high_n_s32(*v0, *v1, *v2); }
void VmlslHighNU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16_t* v2) { *r = vmlsl_high_n_u16(*v0, *v1, *v2); }
void VmlslHighNU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32_t* v2) { *r = vmlsl_high_n_u32(*v0, *v1, *v2); }
void VmlslNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vmlsl_n_s16(*v0, *v1, *v2); }
void VmlslNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vmlsl_n_s32(*v0, *v1, *v2); }
void VmlslNU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16_t* v2) { *r = vmlsl_n_u16(*v0, *v1, *v2); }
void VmlslNU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32_t* v2) { *r = vmlsl_n_u32(*v0, *v1, *v2); }
void VmlsqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vmlsq_s8(*v0, *v1, *v2); }
void VmlsqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vmlsq_s16(*v0, *v1, *v2); }
void VmlsqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vmlsq_s32(*v0, *v1, *v2); }
void VmlsqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vmlsq_u8(*v0, *v1, *v2); }
void VmlsqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vmlsq_u16(*v0, *v1, *v2); }
void VmlsqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vmlsq_u32(*v0, *v1, *v2); }
void VmlsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x4_t* v2) { *r = vmlsq_f32(*v0, *v1, *v2); }
void VmlsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x2_t* v2) { *r = vmlsq_f64(*v0, *v1, *v2); }
void VmlsqNS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16_t* v2) { *r = vmlsq_n_s16(*v0, *v1, *v2); }
void VmlsqNS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32_t* v2) { *r = vmlsq_n_s32(*v0, *v1, *v2); }
void VmlsqNU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16_t* v2) { *r = vmlsq_n_u16(*v0, *v1, *v2); }
void VmlsqNU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32_t* v2) { *r = vmlsq_n_u32(*v0, *v1, *v2); }
void VmlsqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32_t* v2) { *r = vmlsq_n_f32(*v0, *v1, *v2); }
void VmmlaqS32(int32x4_t* r, int32x4_t* v0, int8x16_t* v1, int8x16_t* v2) { *r = vmmlaq_s32(*v0, *v1, *v2); }
void VmmlaqU32(uint32x4_t* r, uint32x4_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vmmlaq_u32(*v0, *v1, *v2); }
void VmovNS8(int8x8_t* r, int8_t* v0) { *r = vmov_n_s8(*v0); }
void VmovNS16(int16x4_t* r, int16_t* v0) { *r = vmov_n_s16(*v0); }
void VmovNS32(int32x2_t* r, int32_t* v0) { *r = vmov_n_s32(*v0); }
void VmovNS64(int64x1_t* r, int64_t* v0) { *r = vmov_n_s64(*v0); }
void VmovNU8(uint8x8_t* r, uint8_t* v0) { *r = vmov_n_u8(*v0); }
void VmovNU16(uint16x4_t* r, uint16_t* v0) { *r = vmov_n_u16(*v0); }
void VmovNU32(uint32x2_t* r, uint32_t* v0) { *r = vmov_n_u32(*v0); }
void VmovNU64(uint64x1_t* r, uint64_t* v0) { *r = vmov_n_u64(*v0); }
void VmovNF32(float32x2_t* r, float32_t* v0) { *r = vmov_n_f32(*v0); }
void VmovNF64(float64x1_t* r, float64_t* v0) { *r = vmov_n_f64(*v0); }
void VmovNP16(poly16x4_t* r, poly16_t* v0) { *r = vmov_n_p16(*v0); }
void VmovNP64(poly64x1_t* r, poly64_t* v0) { *r = vmov_n_p64(*v0); }
void VmovNP8(poly8x8_t* r, poly8_t* v0) { *r = vmov_n_p8(*v0); }
void VmovlS8(int16x8_t* r, int8x8_t* v0) { *r = vmovl_s8(*v0); }
void VmovlS16(int32x4_t* r, int16x4_t* v0) { *r = vmovl_s16(*v0); }
void VmovlS32(int64x2_t* r, int32x2_t* v0) { *r = vmovl_s32(*v0); }
void VmovlU8(uint16x8_t* r, uint8x8_t* v0) { *r = vmovl_u8(*v0); }
void VmovlU16(uint32x4_t* r, uint16x4_t* v0) { *r = vmovl_u16(*v0); }
void VmovlU32(uint64x2_t* r, uint32x2_t* v0) { *r = vmovl_u32(*v0); }
void VmovlHighS8(int16x8_t* r, int8x16_t* v0) { *r = vmovl_high_s8(*v0); }
void VmovlHighS16(int32x4_t* r, int16x8_t* v0) { *r = vmovl_high_s16(*v0); }
void VmovlHighS32(int64x2_t* r, int32x4_t* v0) { *r = vmovl_high_s32(*v0); }
void VmovlHighU8(uint16x8_t* r, uint8x16_t* v0) { *r = vmovl_high_u8(*v0); }
void VmovlHighU16(uint32x4_t* r, uint16x8_t* v0) { *r = vmovl_high_u16(*v0); }
void VmovlHighU32(uint64x2_t* r, uint32x4_t* v0) { *r = vmovl_high_u32(*v0); }
void VmovnS16(int8x8_t* r, int16x8_t* v0) { *r = vmovn_s16(*v0); }
void VmovnS32(int16x4_t* r, int32x4_t* v0) { *r = vmovn_s32(*v0); }
void VmovnS64(int32x2_t* r, int64x2_t* v0) { *r = vmovn_s64(*v0); }
void VmovnU16(uint8x8_t* r, uint16x8_t* v0) { *r = vmovn_u16(*v0); }
void VmovnU32(uint16x4_t* r, uint32x4_t* v0) { *r = vmovn_u32(*v0); }
void VmovnU64(uint32x2_t* r, uint64x2_t* v0) { *r = vmovn_u64(*v0); }
void VmovnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1) { *r = vmovn_high_s16(*v0, *v1); }
void VmovnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1) { *r = vmovn_high_s32(*v0, *v1); }
void VmovnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1) { *r = vmovn_high_s64(*v0, *v1); }
void VmovnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1) { *r = vmovn_high_u16(*v0, *v1); }
void VmovnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1) { *r = vmovn_high_u32(*v0, *v1); }
void VmovnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1) { *r = vmovn_high_u64(*v0, *v1); }
void VmovqNS8(int8x16_t* r, int8_t* v0) { *r = vmovq_n_s8(*v0); }
void VmovqNS16(int16x8_t* r, int16_t* v0) { *r = vmovq_n_s16(*v0); }
void VmovqNS32(int32x4_t* r, int32_t* v0) { *r = vmovq_n_s32(*v0); }
void VmovqNS64(int64x2_t* r, int64_t* v0) { *r = vmovq_n_s64(*v0); }
void VmovqNU8(uint8x16_t* r, uint8_t* v0) { *r = vmovq_n_u8(*v0); }
void VmovqNU16(uint16x8_t* r, uint16_t* v0) { *r = vmovq_n_u16(*v0); }
void VmovqNU32(uint32x4_t* r, uint32_t* v0) { *r = vmovq_n_u32(*v0); }
void VmovqNU64(uint64x2_t* r, uint64_t* v0) { *r = vmovq_n_u64(*v0); }
void VmovqNF32(float32x4_t* r, float32_t* v0) { *r = vmovq_n_f32(*v0); }
void VmovqNF64(float64x2_t* r, float64_t* v0) { *r = vmovq_n_f64(*v0); }
void VmovqNP16(poly16x8_t* r, poly16_t* v0) { *r = vmovq_n_p16(*v0); }
void VmovqNP64(poly64x2_t* r, poly64_t* v0) { *r = vmovq_n_p64(*v0); }
void VmovqNP8(poly8x16_t* r, poly8_t* v0) { *r = vmovq_n_p8(*v0); }
void VmulS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmul_s8(*v0, *v1); }
void VmulS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmul_s16(*v0, *v1); }
void VmulS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmul_s32(*v0, *v1); }
void VmulU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmul_u8(*v0, *v1); }
void VmulU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmul_u16(*v0, *v1); }
void VmulU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmul_u32(*v0, *v1); }
void VmulF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vmul_f32(*v0, *v1); }
void VmulF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vmul_f64(*v0, *v1); }
void VmulNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vmul_n_s16(*v0, *v1); }
void VmulNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vmul_n_s32(*v0, *v1); }
void VmulNU16(uint16x4_t* r, uint16x4_t* v0, uint16_t* v1) { *r = vmul_n_u16(*v0, *v1); }
void VmulNU32(uint32x2_t* r, uint32x2_t* v0, uint32_t* v1) { *r = vmul_n_u32(*v0, *v1); }
void VmulNF32(float32x2_t* r, float32x2_t* v0, float32_t* v1) { *r = vmul_n_f32(*v0, *v1); }
void VmulNF64(float64x1_t* r, float64x1_t* v0, float64_t* v1) { *r = vmul_n_f64(*v0, *v1); }
void VmulP8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vmul_p8(*v0, *v1); }
void VmullS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmull_s8(*v0, *v1); }
void VmullS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmull_s16(*v0, *v1); }
void VmullS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmull_s32(*v0, *v1); }
void VmullU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmull_u8(*v0, *v1); }
void VmullU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmull_u16(*v0, *v1); }
void VmullU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmull_u32(*v0, *v1); }
void VmullHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmull_high_s8(*v0, *v1); }
void VmullHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmull_high_s16(*v0, *v1); }
void VmullHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmull_high_s32(*v0, *v1); }
void VmullHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vmull_high_u8(*v0, *v1); }
void VmullHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmull_high_u16(*v0, *v1); }
void VmullHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmull_high_u32(*v0, *v1); }
void VmullHighNS16(int32x4_t* r, int16x8_t* v0, int16_t* v1) { *r = vmull_high_n_s16(*v0, *v1); }
void VmullHighNS32(int64x2_t* r, int32x4_t* v0, int32_t* v1) { *r = vmull_high_n_s32(*v0, *v1); }
void VmullHighNU16(uint32x4_t* r, uint16x8_t* v0, uint16_t* v1) { *r = vmull_high_n_u16(*v0, *v1); }
void VmullHighNU32(uint64x2_t* r, uint32x4_t* v0, uint32_t* v1) { *r = vmull_high_n_u32(*v0, *v1); }
void VmullHighP64(poly128_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vmull_high_p64(*v0, *v1); }
void VmullHighP8(poly16x8_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vmull_high_p8(*v0, *v1); }
void VmullNS16(int32x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vmull_n_s16(*v0, *v1); }
void VmullNS32(int64x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vmull_n_s32(*v0, *v1); }
void VmullNU16(uint32x4_t* r, uint16x4_t* v0, uint16_t* v1) { *r = vmull_n_u16(*v0, *v1); }
void VmullNU32(uint64x2_t* r, uint32x2_t* v0, uint32_t* v1) { *r = vmull_n_u32(*v0, *v1); }
void VmullP64(poly128_t* r, poly64_t* v0, poly64_t* v1) { *r = vmull_p64(*v0, *v1); }
void VmullP8(poly16x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vmull_p8(*v0, *v1); }
void VmulqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmulq_s8(*v0, *v1); }
void VmulqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmulq_s16(*v0, *v1); }
void VmulqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmulq_s32(*v0, *v1); }
void VmulqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vmulq_u8(*v0, *v1); }
void VmulqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmulq_u16(*v0, *v1); }
void VmulqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmulq_u32(*v0, *v1); }
void VmulqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vmulq_f32(*v0, *v1); }
void VmulqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vmulq_f64(*v0, *v1); }
void VmulqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vmulq_n_s16(*v0, *v1); }
void VmulqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vmulq_n_s32(*v0, *v1); }
void VmulqNU16(uint16x8_t* r, uint16x8_t* v0, uint16_t* v1) { *r = vmulq_n_u16(*v0, *v1); }
void VmulqNU32(uint32x4_t* r, uint32x4_t* v0, uint32_t* v1) { *r = vmulq_n_u32(*v0, *v1); }
void VmulqNF32(float32x4_t* r, float32x4_t* v0, float32_t* v1) { *r = vmulq_n_f32(*v0, *v1); }
void VmulqNF64(float64x2_t* r, float64x2_t* v0, float64_t* v1) { *r = vmulq_n_f64(*v0, *v1); }
void VmulqP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vmulq_p8(*v0, *v1); }
void VmulxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vmulx_f32(*v0, *v1); }
void VmulxF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vmulx_f64(*v0, *v1); }
void VmulxdF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vmulxd_f64(*v0, *v1); }
void VmulxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vmulxq_f32(*v0, *v1); }
void VmulxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vmulxq_f64(*v0, *v1); }
void VmulxsF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vmulxs_f32(*v0, *v1); }
void VmvnS8(int8x8_t* r, int8x8_t* v0) { *r = vmvn_s8(*v0); }
void VmvnS16(int16x4_t* r, int16x4_t* v0) { *r = vmvn_s16(*v0); }
void VmvnS32(int32x2_t* r, int32x2_t* v0) { *r = vmvn_s32(*v0); }
void VmvnU8(uint8x8_t* r, uint8x8_t* v0) { *r = vmvn_u8(*v0); }
void VmvnU16(uint16x4_t* r, uint16x4_t* v0) { *r = vmvn_u16(*v0); }
void VmvnU32(uint32x2_t* r, uint32x2_t* v0) { *r = vmvn_u32(*v0); }
void VmvnP8(poly8x8_t* r, poly8x8_t* v0) { *r = vmvn_p8(*v0); }
void VmvnqS8(int8x16_t* r, int8x16_t* v0) { *r = vmvnq_s8(*v0); }
void VmvnqS16(int16x8_t* r, int16x8_t* v0) { *r = vmvnq_s16(*v0); }
void VmvnqS32(int32x4_t* r, int32x4_t* v0) { *r = vmvnq_s32(*v0); }
void VmvnqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vmvnq_u8(*v0); }
void VmvnqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vmvnq_u16(*v0); }
void VmvnqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vmvnq_u32(*v0); }
void VmvnqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vmvnq_p8(*v0); }
void VnegS8(int8x8_t* r, int8x8_t* v0) { *r = vneg_s8(*v0); }
void VnegS16(int16x4_t* r, int16x4_t* v0) { *r = vneg_s16(*v0); }
void VnegS32(int32x2_t* r, int32x2_t* v0) { *r = vneg_s32(*v0); }
void VnegS64(int64x1_t* r, int64x1_t* v0) { *r = vneg_s64(*v0); }
void VnegF32(float32x2_t* r, float32x2_t* v0) { *r = vneg_f32(*v0); }
void VnegF64(float64x1_t* r, float64x1_t* v0) { *r = vneg_f64(*v0); }
void VnegdS64(int64_t* r, int64_t* v0) { *r = vnegd_s64(*v0); }
void VnegqS8(int8x16_t* r, int8x16_t* v0) { *r = vnegq_s8(*v0); }
void VnegqS16(int16x8_t* r, int16x8_t* v0) { *r = vnegq_s16(*v0); }
void VnegqS32(int32x4_t* r, int32x4_t* v0) { *r = vnegq_s32(*v0); }
void VnegqS64(int64x2_t* r, int64x2_t* v0) { *r = vnegq_s64(*v0); }
void VnegqF32(float32x4_t* r, float32x4_t* v0) { *r = vnegq_f32(*v0); }
void VnegqF64(float64x2_t* r, float64x2_t* v0) { *r = vnegq_f64(*v0); }
void VornS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vorn_s8(*v0, *v1); }
void VornS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vorn_s16(*v0, *v1); }
void VornS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vorn_s32(*v0, *v1); }
void VornS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vorn_s64(*v0, *v1); }
void VornU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vorn_u8(*v0, *v1); }
void VornU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vorn_u16(*v0, *v1); }
void VornU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vorn_u32(*v0, *v1); }
void VornU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vorn_u64(*v0, *v1); }
void VornqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vornq_s8(*v0, *v1); }
void VornqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vornq_s16(*v0, *v1); }
void VornqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vornq_s32(*v0, *v1); }
void VornqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vornq_s64(*v0, *v1); }
void VornqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vornq_u8(*v0, *v1); }
void VornqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vornq_u16(*v0, *v1); }
void VornqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vornq_u32(*v0, *v1); }
void VornqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vornq_u64(*v0, *v1); }
void VorrS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vorr_s8(*v0, *v1); }
void VorrS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vorr_s16(*v0, *v1); }
void VorrS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vorr_s32(*v0, *v1); }
void VorrS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vorr_s64(*v0, *v1); }
void VorrU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vorr_u8(*v0, *v1); }
void VorrU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vorr_u16(*v0, *v1); }
void VorrU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vorr_u32(*v0, *v1); }
void VorrU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vorr_u64(*v0, *v1); }
void VorrqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vorrq_s8(*v0, *v1); }
void VorrqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vorrq_s16(*v0, *v1); }
void VorrqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vorrq_s32(*v0, *v1); }
void VorrqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vorrq_s64(*v0, *v1); }
void VorrqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vorrq_u8(*v0, *v1); }
void VorrqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vorrq_u16(*v0, *v1); }
void VorrqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vorrq_u32(*v0, *v1); }
void VorrqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vorrq_u64(*v0, *v1); }
void VpadalS8(int16x4_t* r, int16x4_t* v0, int8x8_t* v1) { *r = vpadal_s8(*v0, *v1); }
void VpadalS16(int32x2_t* r, int32x2_t* v0, int16x4_t* v1) { *r = vpadal_s16(*v0, *v1); }
void VpadalS32(int64x1_t* r, int64x1_t* v0, int32x2_t* v1) { *r = vpadal_s32(*v0, *v1); }
void VpadalU8(uint16x4_t* r, uint16x4_t* v0, uint8x8_t* v1) { *r = vpadal_u8(*v0, *v1); }
void VpadalU16(uint32x2_t* r, uint32x2_t* v0, uint16x4_t* v1) { *r = vpadal_u16(*v0, *v1); }
void VpadalU32(uint64x1_t* r, uint64x1_t* v0, uint32x2_t* v1) { *r = vpadal_u32(*v0, *v1); }
void VpadalqS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1) { *r = vpadalq_s8(*v0, *v1); }
void VpadalqS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1) { *r = vpadalq_s16(*v0, *v1); }
void VpadalqS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1) { *r = vpadalq_s32(*v0, *v1); }
void VpadalqU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1) { *r = vpadalq_u8(*v0, *v1); }
void VpadalqU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1) { *r = vpadalq_u16(*v0, *v1); }
void VpadalqU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1) { *r = vpadalq_u32(*v0, *v1); }
void VpaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpadd_s8(*v0, *v1); }
void VpaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpadd_s16(*v0, *v1); }
void VpaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpadd_s32(*v0, *v1); }
void VpaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpadd_u8(*v0, *v1); }
void VpaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpadd_u16(*v0, *v1); }
void VpaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpadd_u32(*v0, *v1); }
void VpaddF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vpadd_f32(*v0, *v1); }
void VpadddS64(int64_t* r, int64x2_t* v0) { *r = vpaddd_s64(*v0); }
void VpadddU64(uint64_t* r, uint64x2_t* v0) { *r = vpaddd_u64(*v0); }
void VpadddF64(float64_t* r, float64x2_t* v0) { *r = vpaddd_f64(*v0); }
void VpaddlS8(int16x4_t* r, int8x8_t* v0) { *r = vpaddl_s8(*v0); }
void VpaddlS16(int32x2_t* r, int16x4_t* v0) { *r = vpaddl_s16(*v0); }
void VpaddlS32(int64x1_t* r, int32x2_t* v0) { *r = vpaddl_s32(*v0); }
void VpaddlU8(uint16x4_t* r, uint8x8_t* v0) { *r = vpaddl_u8(*v0); }
void VpaddlU16(uint32x2_t* r, uint16x4_t* v0) { *r = vpaddl_u16(*v0); }
void VpaddlU32(uint64x1_t* r, uint32x2_t* v0) { *r = vpaddl_u32(*v0); }
void VpaddlqS8(int16x8_t* r, int8x16_t* v0) { *r = vpaddlq_s8(*v0); }
void VpaddlqS16(int32x4_t* r, int16x8_t* v0) { *r = vpaddlq_s16(*v0); }
void VpaddlqS32(int64x2_t* r, int32x4_t* v0) { *r = vpaddlq_s32(*v0); }
void VpaddlqU8(uint16x8_t* r, uint8x16_t* v0) { *r = vpaddlq_u8(*v0); }
void VpaddlqU16(uint32x4_t* r, uint16x8_t* v0) { *r = vpaddlq_u16(*v0); }
void VpaddlqU32(uint64x2_t* r, uint32x4_t* v0) { *r = vpaddlq_u32(*v0); }
void VpaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpaddq_s8(*v0, *v1); }
void VpaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpaddq_s16(*v0, *v1); }
void VpaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpaddq_s32(*v0, *v1); }
void VpaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vpaddq_s64(*v0, *v1); }
void VpaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpaddq_u8(*v0, *v1); }
void VpaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpaddq_u16(*v0, *v1); }
void VpaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpaddq_u32(*v0, *v1); }
void VpaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vpaddq_u64(*v0, *v1); }
void VpaddqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vpaddq_f32(*v0, *v1); }
void VpaddqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vpaddq_f64(*v0, *v1); }
void VpaddsF32(float32_t* r, float32x2_t* v0) { *r = vpadds_f32(*v0); }
void VpmaxS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpmax_s8(*v0, *v1); }
void VpmaxS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpmax_s16(*v0, *v1); }
void VpmaxS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpmax_s32(*v0, *v1); }
void VpmaxU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpmax_u8(*v0, *v1); }
void VpmaxU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpmax_u16(*v0, *v1); }
void VpmaxU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpmax_u32(*v0, *v1); }
void VpmaxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vpmax_f32(*v0, *v1); }
void VpmaxnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vpmaxnm_f32(*v0, *v1); }
void VpmaxnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vpmaxnmq_f32(*v0, *v1); }
void VpmaxnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vpmaxnmq_f64(*v0, *v1); }
void VpmaxnmqdF64(float64_t* r, float64x2_t* v0) { *r = vpmaxnmqd_f64(*v0); }
void VpmaxnmsF32(float32_t* r, float32x2_t* v0) { *r = vpmaxnms_f32(*v0); }
void VpmaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpmaxq_s8(*v0, *v1); }
void VpmaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpmaxq_s16(*v0, *v1); }
void VpmaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpmaxq_s32(*v0, *v1); }
void VpmaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpmaxq_u8(*v0, *v1); }
void VpmaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpmaxq_u16(*v0, *v1); }
void VpmaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpmaxq_u32(*v0, *v1); }
void VpmaxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vpmaxq_f32(*v0, *v1); }
void VpmaxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vpmaxq_f64(*v0, *v1); }
void VpmaxqdF64(float64_t* r, float64x2_t* v0) { *r = vpmaxqd_f64(*v0); }
void VpmaxsF32(float32_t* r, float32x2_t* v0) { *r = vpmaxs_f32(*v0); }
void VpminS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpmin_s8(*v0, *v1); }
void VpminS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpmin_s16(*v0, *v1); }
void VpminS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpmin_s32(*v0, *v1); }
void VpminU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpmin_u8(*v0, *v1); }
void VpminU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpmin_u16(*v0, *v1); }
void VpminU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpmin_u32(*v0, *v1); }
void VpminF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vpmin_f32(*v0, *v1); }
void VpminnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vpminnm_f32(*v0, *v1); }
void VpminnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vpminnmq_f32(*v0, *v1); }
void VpminnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vpminnmq_f64(*v0, *v1); }
void VpminnmqdF64(float64_t* r, float64x2_t* v0) { *r = vpminnmqd_f64(*v0); }
void VpminnmsF32(float32_t* r, float32x2_t* v0) { *r = vpminnms_f32(*v0); }
void VpminqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpminq_s8(*v0, *v1); }
void VpminqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpminq_s16(*v0, *v1); }
void VpminqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpminq_s32(*v0, *v1); }
void VpminqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpminq_u8(*v0, *v1); }
void VpminqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpminq_u16(*v0, *v1); }
void VpminqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpminq_u32(*v0, *v1); }
void VpminqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vpminq_f32(*v0, *v1); }
void VpminqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vpminq_f64(*v0, *v1); }
void VpminqdF64(float64_t* r, float64x2_t* v0) { *r = vpminqd_f64(*v0); }
void VpminsF32(float32_t* r, float32x2_t* v0) { *r = vpmins_f32(*v0); }
void VqabsS8(int8x8_t* r, int8x8_t* v0) { *r = vqabs_s8(*v0); }
void VqabsS16(int16x4_t* r, int16x4_t* v0) { *r = vqabs_s16(*v0); }
void VqabsS32(int32x2_t* r, int32x2_t* v0) { *r = vqabs_s32(*v0); }
void VqabsS64(int64x1_t* r, int64x1_t* v0) { *r = vqabs_s64(*v0); }
void VqabsbS8(int8_t* r, int8_t* v0) { *r = vqabsb_s8(*v0); }
void VqabsdS64(int64_t* r, int64_t* v0) { *r = vqabsd_s64(*v0); }
void VqabshS16(int16_t* r, int16_t* v0) { *r = vqabsh_s16(*v0); }
void VqabsqS8(int8x16_t* r, int8x16_t* v0) { *r = vqabsq_s8(*v0); }
void VqabsqS16(int16x8_t* r, int16x8_t* v0) { *r = vqabsq_s16(*v0); }
void VqabsqS32(int32x4_t* r, int32x4_t* v0) { *r = vqabsq_s32(*v0); }
void VqabsqS64(int64x2_t* r, int64x2_t* v0) { *r = vqabsq_s64(*v0); }
void VqabssS32(int32_t* r, int32_t* v0) { *r = vqabss_s32(*v0); }
void VqaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqadd_s8(*v0, *v1); }
void VqaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqadd_s16(*v0, *v1); }
void VqaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqadd_s32(*v0, *v1); }
void VqaddS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqadd_s64(*v0, *v1); }
void VqaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vqadd_u8(*v0, *v1); }
void VqaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vqadd_u16(*v0, *v1); }
void VqaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vqadd_u32(*v0, *v1); }
void VqaddU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vqadd_u64(*v0, *v1); }
void VqaddbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqaddb_s8(*v0, *v1); }
void VqaddbU8(uint8_t* r, uint8_t* v0, uint8_t* v1) { *r = vqaddb_u8(*v0, *v1); }
void VqadddS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqaddd_s64(*v0, *v1); }
void VqadddU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vqaddd_u64(*v0, *v1); }
void VqaddhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqaddh_s16(*v0, *v1); }
void VqaddhU16(uint16_t* r, uint16_t* v0, uint16_t* v1) { *r = vqaddh_u16(*v0, *v1); }
void VqaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqaddq_s8(*v0, *v1); }
void VqaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqaddq_s16(*v0, *v1); }
void VqaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqaddq_s32(*v0, *v1); }
void VqaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqaddq_s64(*v0, *v1); }
void VqaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqaddq_u8(*v0, *v1); }
void VqaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vqaddq_u16(*v0, *v1); }
void VqaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vqaddq_u32(*v0, *v1); }
void VqaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vqaddq_u64(*v0, *v1); }
void VqaddsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqadds_s32(*v0, *v1); }
void VqaddsU32(uint32_t* r, uint32_t* v0, uint32_t* v1) { *r = vqadds_u32(*v0, *v1); }
void VqdmlalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vqdmlal_s16(*v0, *v1, *v2); }
void VqdmlalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vqdmlal_s32(*v0, *v1, *v2); }
void VqdmlalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vqdmlal_high_s16(*v0, *v1, *v2); }
void VqdmlalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vqdmlal_high_s32(*v0, *v1, *v2); }
void VqdmlalHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* v2) { *r = vqdmlal_high_n_s16(*v0, *v1, *v2); }
void VqdmlalHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* v2) { *r = vqdmlal_high_n_s32(*v0, *v1, *v2); }
void VqdmlalNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vqdmlal_n_s16(*v0, *v1, *v2); }
void VqdmlalNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vqdmlal_n_s32(*v0, *v1, *v2); }
void VqdmlalhS16(int32_t* r, int32_t* v0, int16_t* v1, int16_t* v2) { *r = vqdmlalh_s16(*v0, *v1, *v2); }
void VqdmlalsS32(int64_t* r, int64_t* v0, int32_t* v1, int32_t* v2) { *r = vqdmlals_s32(*v0, *v1, *v2); }
void VqdmlslS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vqdmlsl_s16(*v0, *v1, *v2); }
void VqdmlslS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vqdmlsl_s32(*v0, *v1, *v2); }
void VqdmlslHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vqdmlsl_high_s16(*v0, *v1, *v2); }
void VqdmlslHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vqdmlsl_high_s32(*v0, *v1, *v2); }
void VqdmlslHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* v2) { *r = vqdmlsl_high_n_s16(*v0, *v1, *v2); }
void VqdmlslHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* v2) { *r = vqdmlsl_high_n_s32(*v0, *v1, *v2); }
void VqdmlslNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) { *r = vqdmlsl_n_s16(*v0, *v1, *v2); }
void VqdmlslNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) { *r = vqdmlsl_n_s32(*v0, *v1, *v2); }
void VqdmlslhS16(int32_t* r, int32_t* v0, int16_t* v1, int16_t* v2) { *r = vqdmlslh_s16(*v0, *v1, *v2); }
void VqdmlslsS32(int64_t* r, int64_t* v0, int32_t* v1, int32_t* v2) { *r = vqdmlsls_s32(*v0, *v1, *v2); }
void VqdmulhS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqdmulh_s16(*v0, *v1); }
void VqdmulhS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqdmulh_s32(*v0, *v1); }
void VqdmulhNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqdmulh_n_s16(*v0, *v1); }
void VqdmulhNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqdmulh_n_s32(*v0, *v1); }
void VqdmulhhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqdmulhh_s16(*v0, *v1); }
void VqdmulhqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqdmulhq_s16(*v0, *v1); }
void VqdmulhqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqdmulhq_s32(*v0, *v1); }
void VqdmulhqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vqdmulhq_n_s16(*v0, *v1); }
void VqdmulhqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vqdmulhq_n_s32(*v0, *v1); }
void VqdmulhsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqdmulhs_s32(*v0, *v1); }
void VqdmullS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqdmull_s16(*v0, *v1); }
void VqdmullS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqdmull_s32(*v0, *v1); }
void VqdmullHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqdmull_high_s16(*v0, *v1); }
void VqdmullHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqdmull_high_s32(*v0, *v1); }
void VqdmullHighNS16(int32x4_t* r, int16x8_t* v0, int16_t* v1) { *r = vqdmull_high_n_s16(*v0, *v1); }
void VqdmullHighNS32(int64x2_t* r, int32x4_t* v0, int32_t* v1) { *r = vqdmull_high_n_s32(*v0, *v1); }
void VqdmullNS16(int32x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqdmull_n_s16(*v0, *v1); }
void VqdmullNS32(int64x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqdmull_n_s32(*v0, *v1); }
void VqdmullhS16(int32_t* r, int16_t* v0, int16_t* v1) { *r = vqdmullh_s16(*v0, *v1); }
void VqdmullsS32(int64_t* r, int32_t* v0, int32_t* v1) { *r = vqdmulls_s32(*v0, *v1); }
void VqmovnS16(int8x8_t* r, int16x8_t* v0) { *r = vqmovn_s16(*v0); }
void VqmovnS32(int16x4_t* r, int32x4_t* v0) { *r = vqmovn_s32(*v0); }
void VqmovnS64(int32x2_t* r, int64x2_t* v0) { *r = vqmovn_s64(*v0); }
void VqmovnU16(uint8x8_t* r, uint16x8_t* v0) { *r = vqmovn_u16(*v0); }
void VqmovnU32(uint16x4_t* r, uint32x4_t* v0) { *r = vqmovn_u32(*v0); }
void VqmovnU64(uint32x2_t* r, uint64x2_t* v0) { *r = vqmovn_u64(*v0); }
void VqmovnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1) { *r = vqmovn_high_s16(*v0, *v1); }
void VqmovnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1) { *r = vqmovn_high_s32(*v0, *v1); }
void VqmovnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1) { *r = vqmovn_high_s64(*v0, *v1); }
void VqmovnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1) { *r = vqmovn_high_u16(*v0, *v1); }
void VqmovnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1) { *r = vqmovn_high_u32(*v0, *v1); }
void VqmovnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1) { *r = vqmovn_high_u64(*v0, *v1); }
void VqmovndS64(int32_t* r, int64_t* v0) { *r = vqmovnd_s64(*v0); }
void VqmovndU64(uint32_t* r, uint64_t* v0) { *r = vqmovnd_u64(*v0); }
void VqmovnhS16(int8_t* r, int16_t* v0) { *r = vqmovnh_s16(*v0); }
void VqmovnhU16(uint8_t* r, uint16_t* v0) { *r = vqmovnh_u16(*v0); }
void VqmovnsS32(int16_t* r, int32_t* v0) { *r = vqmovns_s32(*v0); }
void VqmovnsU32(uint16_t* r, uint32_t* v0) { *r = vqmovns_u32(*v0); }
void VqmovunS16(uint8x8_t* r, int16x8_t* v0) { *r = vqmovun_s16(*v0); }
void VqmovunS32(uint16x4_t* r, int32x4_t* v0) { *r = vqmovun_s32(*v0); }
void VqmovunS64(uint32x2_t* r, int64x2_t* v0) { *r = vqmovun_s64(*v0); }
void VqmovunHighS16(uint8x16_t* r, uint8x8_t* v0, int16x8_t* v1) { *r = vqmovun_high_s16(*v0, *v1); }
void VqmovunHighS32(uint16x8_t* r, uint16x4_t* v0, int32x4_t* v1) { *r = vqmovun_high_s32(*v0, *v1); }
void VqmovunHighS64(uint32x4_t* r, uint32x2_t* v0, int64x2_t* v1) { *r = vqmovun_high_s64(*v0, *v1); }
void VqmovundS64(uint32_t* r, int64_t* v0) { *r = vqmovund_s64(*v0); }
void VqmovunhS16(uint8_t* r, int16_t* v0) { *r = vqmovunh_s16(*v0); }
void VqmovunsS32(uint16_t* r, int32_t* v0) { *r = vqmovuns_s32(*v0); }
void VqnegS8(int8x8_t* r, int8x8_t* v0) { *r = vqneg_s8(*v0); }
void VqnegS16(int16x4_t* r, int16x4_t* v0) { *r = vqneg_s16(*v0); }
void VqnegS32(int32x2_t* r, int32x2_t* v0) { *r = vqneg_s32(*v0); }
void VqnegS64(int64x1_t* r, int64x1_t* v0) { *r = vqneg_s64(*v0); }
void VqnegbS8(int8_t* r, int8_t* v0) { *r = vqnegb_s8(*v0); }
void VqnegdS64(int64_t* r, int64_t* v0) { *r = vqnegd_s64(*v0); }
void VqneghS16(int16_t* r, int16_t* v0) { *r = vqnegh_s16(*v0); }
void VqnegqS8(int8x16_t* r, int8x16_t* v0) { *r = vqnegq_s8(*v0); }
void VqnegqS16(int16x8_t* r, int16x8_t* v0) { *r = vqnegq_s16(*v0); }
void VqnegqS32(int32x4_t* r, int32x4_t* v0) { *r = vqnegq_s32(*v0); }
void VqnegqS64(int64x2_t* r, int64x2_t* v0) { *r = vqnegq_s64(*v0); }
void VqnegsS32(int32_t* r, int32_t* v0) { *r = vqnegs_s32(*v0); }
void VqrdmlahS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vqrdmlah_s16(*v0, *v1, *v2); }
void VqrdmlahS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vqrdmlah_s32(*v0, *v1, *v2); }
void VqrdmlahhS16(int16_t* r, int16_t* v0, int16_t* v1, int16_t* v2) { *r = vqrdmlahh_s16(*v0, *v1, *v2); }
void VqrdmlahqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vqrdmlahq_s16(*v0, *v1, *v2); }
void VqrdmlahqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vqrdmlahq_s32(*v0, *v1, *v2); }
void VqrdmlahsS32(int32_t* r, int32_t* v0, int32_t* v1, int32_t* v2) { *r = vqrdmlahs_s32(*v0, *v1, *v2); }
void VqrdmlshS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) { *r = vqrdmlsh_s16(*v0, *v1, *v2); }
void VqrdmlshS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) { *r = vqrdmlsh_s32(*v0, *v1, *v2); }
void VqrdmlshhS16(int16_t* r, int16_t* v0, int16_t* v1, int16_t* v2) { *r = vqrdmlshh_s16(*v0, *v1, *v2); }
void VqrdmlshqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vqrdmlshq_s16(*v0, *v1, *v2); }
void VqrdmlshqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vqrdmlshq_s32(*v0, *v1, *v2); }
void VqrdmlshsS32(int32_t* r, int32_t* v0, int32_t* v1, int32_t* v2) { *r = vqrdmlshs_s32(*v0, *v1, *v2); }
void VqrdmulhS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqrdmulh_s16(*v0, *v1); }
void VqrdmulhS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqrdmulh_s32(*v0, *v1); }
void VqrdmulhNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqrdmulh_n_s16(*v0, *v1); }
void VqrdmulhNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqrdmulh_n_s32(*v0, *v1); }
void VqrdmulhhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqrdmulhh_s16(*v0, *v1); }
void VqrdmulhqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqrdmulhq_s16(*v0, *v1); }
void VqrdmulhqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqrdmulhq_s32(*v0, *v1); }
void VqrdmulhqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vqrdmulhq_n_s16(*v0, *v1); }
void VqrdmulhqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vqrdmulhq_n_s32(*v0, *v1); }
void VqrdmulhsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqrdmulhs_s32(*v0, *v1); }
void VqrshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqrshl_s8(*v0, *v1); }
void VqrshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqrshl_s16(*v0, *v1); }
void VqrshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqrshl_s32(*v0, *v1); }
void VqrshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqrshl_s64(*v0, *v1); }
void VqrshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vqrshl_u8(*v0, *v1); }
void VqrshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vqrshl_u16(*v0, *v1); }
void VqrshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vqrshl_u32(*v0, *v1); }
void VqrshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vqrshl_u64(*v0, *v1); }
void VqrshlbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqrshlb_s8(*v0, *v1); }
void VqrshlbU8(uint8_t* r, uint8_t* v0, int8_t* v1) { *r = vqrshlb_u8(*v0, *v1); }
void VqrshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqrshld_s64(*v0, *v1); }
void VqrshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vqrshld_u64(*v0, *v1); }
void VqrshlhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqrshlh_s16(*v0, *v1); }
void VqrshlhU16(uint16_t* r, uint16_t* v0, int16_t* v1) { *r = vqrshlh_u16(*v0, *v1); }
void VqrshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqrshlq_s8(*v0, *v1); }
void VqrshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqrshlq_s16(*v0, *v1); }
void VqrshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqrshlq_s32(*v0, *v1); }
void VqrshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqrshlq_s64(*v0, *v1); }
void VqrshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vqrshlq_u8(*v0, *v1); }
void VqrshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vqrshlq_u16(*v0, *v1); }
void VqrshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vqrshlq_u32(*v0, *v1); }
void VqrshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vqrshlq_u64(*v0, *v1); }
void VqrshlsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqrshls_s32(*v0, *v1); }
void VqrshlsU32(uint32_t* r, uint32_t* v0, int32_t* v1) { *r = vqrshls_u32(*v0, *v1); }
void VqshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqshl_s8(*v0, *v1); }
void VqshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqshl_s16(*v0, *v1); }
void VqshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqshl_s32(*v0, *v1); }
void VqshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqshl_s64(*v0, *v1); }
void VqshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vqshl_u8(*v0, *v1); }
void VqshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vqshl_u16(*v0, *v1); }
void VqshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vqshl_u32(*v0, *v1); }
void VqshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vqshl_u64(*v0, *v1); }
void VqshlbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqshlb_s8(*v0, *v1); }
void VqshlbU8(uint8_t* r, uint8_t* v0, int8_t* v1) { *r = vqshlb_u8(*v0, *v1); }
void VqshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqshld_s64(*v0, *v1); }
void VqshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vqshld_u64(*v0, *v1); }
void VqshlhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqshlh_s16(*v0, *v1); }
void VqshlhU16(uint16_t* r, uint16_t* v0, int16_t* v1) { *r = vqshlh_u16(*v0, *v1); }
void VqshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqshlq_s8(*v0, *v1); }
void VqshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqshlq_s16(*v0, *v1); }
void VqshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqshlq_s32(*v0, *v1); }
void VqshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqshlq_s64(*v0, *v1); }
void VqshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vqshlq_u8(*v0, *v1); }
void VqshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vqshlq_u16(*v0, *v1); }
void VqshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vqshlq_u32(*v0, *v1); }
void VqshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vqshlq_u64(*v0, *v1); }
void VqshlsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqshls_s32(*v0, *v1); }
void VqshlsU32(uint32_t* r, uint32_t* v0, int32_t* v1) { *r = vqshls_u32(*v0, *v1); }
void VqsubS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqsub_s8(*v0, *v1); }
void VqsubS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqsub_s16(*v0, *v1); }
void VqsubS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqsub_s32(*v0, *v1); }
void VqsubS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqsub_s64(*v0, *v1); }
void VqsubU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vqsub_u8(*v0, *v1); }
void VqsubU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vqsub_u16(*v0, *v1); }
void VqsubU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vqsub_u32(*v0, *v1); }
void VqsubU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vqsub_u64(*v0, *v1); }
void VqsubbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqsubb_s8(*v0, *v1); }
void VqsubbU8(uint8_t* r, uint8_t* v0, uint8_t* v1) { *r = vqsubb_u8(*v0, *v1); }
void VqsubdS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqsubd_s64(*v0, *v1); }
void VqsubdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vqsubd_u64(*v0, *v1); }
void VqsubhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqsubh_s16(*v0, *v1); }
void VqsubhU16(uint16_t* r, uint16_t* v0, uint16_t* v1) { *r = vqsubh_u16(*v0, *v1); }
void VqsubqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqsubq_s8(*v0, *v1); }
void VqsubqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqsubq_s16(*v0, *v1); }
void VqsubqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqsubq_s32(*v0, *v1); }
void VqsubqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqsubq_s64(*v0, *v1); }
void VqsubqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqsubq_u8(*v0, *v1); }
void VqsubqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vqsubq_u16(*v0, *v1); }
void VqsubqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vqsubq_u32(*v0, *v1); }
void VqsubqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vqsubq_u64(*v0, *v1); }
void VqsubsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqsubs_s32(*v0, *v1); }
void VqsubsU32(uint32_t* r, uint32_t* v0, uint32_t* v1) { *r = vqsubs_u32(*v0, *v1); }
void Vqtbl1S8(int8x8_t* r, int8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1_s8(*v0, *v1); }
void Vqtbl1U8(uint8x8_t* r, uint8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1_u8(*v0, *v1); }
void Vqtbl1P8(poly8x8_t* r, poly8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1_p8(*v0, *v1); }
void Vqtbl1QS8(int8x16_t* r, int8x16_t* v0, uint8x16_t* v1) { *r = vqtbl1q_s8(*v0, *v1); }
void Vqtbl1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqtbl1q_u8(*v0, *v1); }
void Vqtbl1QP8(poly8x16_t* r, poly8x16_t* v0, uint8x16_t* v1) { *r = vqtbl1q_p8(*v0, *v1); }
void Vqtbl2S8(int8x8_t* r, int8x16x2_t* v0, uint8x8_t* v1) { *r = vqtbl2_s8(*v0, *v1); }
void Vqtbl2U8(uint8x8_t* r, uint8x16x2_t* v0, uint8x8_t* v1) { *r = vqtbl2_u8(*v0, *v1); }
void Vqtbl2P8(poly8x8_t* r, poly8x16x2_t* v0, uint8x8_t* v1) { *r = vqtbl2_p8(*v0, *v1); }
void Vqtbl2QS8(int8x16_t* r, int8x16x2_t* v0, uint8x16_t* v1) { *r = vqtbl2q_s8(*v0, *v1); }
void Vqtbl2QU8(uint8x16_t* r, uint8x16x2_t* v0, uint8x16_t* v1) { *r = vqtbl2q_u8(*v0, *v1); }
void Vqtbl2QP8(poly8x16_t* r, poly8x16x2_t* v0, uint8x16_t* v1) { *r = vqtbl2q_p8(*v0, *v1); }
void Vqtbl3S8(int8x8_t* r, int8x16x3_t* v0, uint8x8_t* v1) { *r = vqtbl3_s8(*v0, *v1); }
void Vqtbl3U8(uint8x8_t* r, uint8x16x3_t* v0, uint8x8_t* v1) { *r = vqtbl3_u8(*v0, *v1); }
void Vqtbl3P8(poly8x8_t* r, poly8x16x3_t* v0, uint8x8_t* v1) { *r = vqtbl3_p8(*v0, *v1); }
void Vqtbl3QS8(int8x16_t* r, int8x16x3_t* v0, uint8x16_t* v1) { *r = vqtbl3q_s8(*v0, *v1); }
void Vqtbl3QU8(uint8x16_t* r, uint8x16x3_t* v0, uint8x16_t* v1) { *r = vqtbl3q_u8(*v0, *v1); }
void Vqtbl3QP8(poly8x16_t* r, poly8x16x3_t* v0, uint8x16_t* v1) { *r = vqtbl3q_p8(*v0, *v1); }
void Vqtbl4S8(int8x8_t* r, int8x16x4_t* v0, uint8x8_t* v1) { *r = vqtbl4_s8(*v0, *v1); }
void Vqtbl4U8(uint8x8_t* r, uint8x16x4_t* v0, uint8x8_t* v1) { *r = vqtbl4_u8(*v0, *v1); }
void Vqtbl4P8(poly8x8_t* r, poly8x16x4_t* v0, uint8x8_t* v1) { *r = vqtbl4_p8(*v0, *v1); }
void Vqtbl4QS8(int8x16_t* r, int8x16x4_t* v0, uint8x16_t* v1) { *r = vqtbl4q_s8(*v0, *v1); }
void Vqtbl4QU8(uint8x16_t* r, uint8x16x4_t* v0, uint8x16_t* v1) { *r = vqtbl4q_u8(*v0, *v1); }
void Vqtbl4QP8(poly8x16_t* r, poly8x16x4_t* v0, uint8x16_t* v1) { *r = vqtbl4q_p8(*v0, *v1); }
void Vqtbx1S8(int8x8_t* r, int8x8_t* v0, int8x16_t* v1, uint8x8_t* v2) { *r = vqtbx1_s8(*v0, *v1, *v2); }
void Vqtbx1U8(uint8x8_t* r, uint8x8_t* v0, uint8x16_t* v1, uint8x8_t* v2) { *r = vqtbx1_u8(*v0, *v1, *v2); }
void Vqtbx1P8(poly8x8_t* r, poly8x8_t* v0, poly8x16_t* v1, uint8x8_t* v2) { *r = vqtbx1_p8(*v0, *v1, *v2); }
void Vqtbx1QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, uint8x16_t* v2) { *r = vqtbx1q_s8(*v0, *v1, *v2); }
void Vqtbx1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* v2) { *r = vqtbx1q_u8(*v0, *v1, *v2); }
void Vqtbx1QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1, uint8x16_t* v2) { *r = vqtbx1q_p8(*v0, *v1, *v2); }
void Vqtbx2S8(int8x8_t* r, int8x8_t* v0, int8x16x2_t* v1, uint8x8_t* v2) { *r = vqtbx2_s8(*v0, *v1, *v2); }
void Vqtbx2U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x2_t* v1, uint8x8_t* v2) { *r = vqtbx2_u8(*v0, *v1, *v2); }
void Vqtbx2P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x2_t* v1, uint8x8_t* v2) { *r = vqtbx2_p8(*v0, *v1, *v2); }
void Vqtbx2QS8(int8x16_t* r, int8x16_t* v0, int8x16x2_t* v1, uint8x16_t* v2) { *r = vqtbx2q_s8(*v0, *v1, *v2); }
void Vqtbx2QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x2_t* v1, uint8x16_t* v2) { *r = vqtbx2q_u8(*v0, *v1, *v2); }
void Vqtbx2QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x2_t* v1, uint8x16_t* v2) { *r = vqtbx2q_p8(*v0, *v1, *v2); }
void Vqtbx3S8(int8x8_t* r, int8x8_t* v0, int8x16x3_t* v1, uint8x8_t* v2) { *r = vqtbx3_s8(*v0, *v1, *v2); }
void Vqtbx3U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x3_t* v1, uint8x8_t* v2) { *r = vqtbx3_u8(*v0, *v1, *v2); }
void Vqtbx3P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x3_t* v1, uint8x8_t* v2) { *r = vqtbx3_p8(*v0, *v1, *v2); }
void Vqtbx3QS8(int8x16_t* r, int8x16_t* v0, int8x16x3_t* v1, uint8x16_t* v2) { *r = vqtbx3q_s8(*v0, *v1, *v2); }
void Vqtbx3QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x3_t* v1, uint8x16_t* v2) { *r = vqtbx3q_u8(*v0, *v1, *v2); }
void Vqtbx3QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x3_t* v1, uint8x16_t* v2) { *r = vqtbx3q_p8(*v0, *v1, *v2); }
void Vqtbx4S8(int8x8_t* r, int8x8_t* v0, int8x16x4_t* v1, uint8x8_t* v2) { *r = vqtbx4_s8(*v0, *v1, *v2); }
void Vqtbx4U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x4_t* v1, uint8x8_t* v2) { *r = vqtbx4_u8(*v0, *v1, *v2); }
void Vqtbx4P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x4_t* v1, uint8x8_t* v2) { *r = vqtbx4_p8(*v0, *v1, *v2); }
void Vqtbx4QS8(int8x16_t* r, int8x16_t* v0, int8x16x4_t* v1, uint8x16_t* v2) { *r = vqtbx4q_s8(*v0, *v1, *v2); }
void Vqtbx4QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x4_t* v1, uint8x16_t* v2) { *r = vqtbx4q_u8(*v0, *v1, *v2); }
void Vqtbx4QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x4_t* v1, uint8x16_t* v2) { *r = vqtbx4q_p8(*v0, *v1, *v2); }
void VraddhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vraddhn_s16(*v0, *v1); }
void VraddhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vraddhn_s32(*v0, *v1); }
void VraddhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vraddhn_s64(*v0, *v1); }
void VraddhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vraddhn_u16(*v0, *v1); }
void VraddhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vraddhn_u32(*v0, *v1); }
void VraddhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vraddhn_u64(*v0, *v1); }
void VraddhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vraddhn_high_s16(*v0, *v1, *v2); }
void VraddhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vraddhn_high_s32(*v0, *v1, *v2); }
void VraddhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vraddhn_high_s64(*v0, *v1, *v2); }
void VraddhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vraddhn_high_u16(*v0, *v1, *v2); }
void VraddhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vraddhn_high_u32(*v0, *v1, *v2); }
void VraddhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vraddhn_high_u64(*v0, *v1, *v2); }
void Vrax1QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vrax1q_u64(*v0, *v1); }
void VrbitS8(int8x8_t* r, int8x8_t* v0) { *r = vrbit_s8(*v0); }
void VrbitU8(uint8x8_t* r, uint8x8_t* v0) { *r = vrbit_u8(*v0); }
void VrbitP8(poly8x8_t* r, poly8x8_t* v0) { *r = vrbit_p8(*v0); }
void VrbitqS8(int8x16_t* r, int8x16_t* v0) { *r = vrbitq_s8(*v0); }
void VrbitqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrbitq_u8(*v0); }
void VrbitqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrbitq_p8(*v0); }
void VrecpeU32(uint32x2_t* r, uint32x2_t* v0) { *r = vrecpe_u32(*v0); }
void VrecpeF32(float32x2_t* r, float32x2_t* v0) { *r = vrecpe_f32(*v0); }
void VrecpeF64(float64x1_t* r, float64x1_t* v0) { *r = vrecpe_f64(*v0); }
void VrecpedF64(float64_t* r, float64_t* v0) { *r = vrecped_f64(*v0); }
void VrecpeqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vrecpeq_u32(*v0); }
void VrecpeqF32(float32x4_t* r, float32x4_t* v0) { *r = vrecpeq_f32(*v0); }
void VrecpeqF64(float64x2_t* r, float64x2_t* v0) { *r = vrecpeq_f64(*v0); }
void VrecpesF32(float32_t* r, float32_t* v0) { *r = vrecpes_f32(*v0); }
void VrecpsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vrecps_f32(*v0, *v1); }
void VrecpsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vrecps_f64(*v0, *v1); }
void VrecpsdF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vrecpsd_f64(*v0, *v1); }
void VrecpsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vrecpsq_f32(*v0, *v1); }
void VrecpsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vrecpsq_f64(*v0, *v1); }
void VrecpssF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vrecpss_f32(*v0, *v1); }
void VrecpxdF64(float64_t* r, float64_t* v0) { *r = vrecpxd_f64(*v0); }
void VrecpxsF32(float32_t* r, float32_t* v0) { *r = vrecpxs_f32(*v0); }
void VreinterpretF32S8(float32x2_t* r, int8x8_t* v0) { *r = vreinterpret_f32_s8(*v0); }
void VreinterpretF32S16(float32x2_t* r, int16x4_t* v0) { *r = vreinterpret_f32_s16(*v0); }
void VreinterpretF32S32(float32x2_t* r, int32x2_t* v0) { *r = vreinterpret_f32_s32(*v0); }
void VreinterpretF32S64(float32x2_t* r, int64x1_t* v0) { *r = vreinterpret_f32_s64(*v0); }
void VreinterpretF32U8(float32x2_t* r, uint8x8_t* v0) { *r = vreinterpret_f32_u8(*v0); }
void VreinterpretF32U16(float32x2_t* r, uint16x4_t* v0) { *r = vreinterpret_f32_u16(*v0); }
void VreinterpretF32U32(float32x2_t* r, uint32x2_t* v0) { *r = vreinterpret_f32_u32(*v0); }
void VreinterpretF32U64(float32x2_t* r, uint64x1_t* v0) { *r = vreinterpret_f32_u64(*v0); }
void VreinterpretF32F64(float32x2_t* r, float64x1_t* v0) { *r = vreinterpret_f32_f64(*v0); }
void VreinterpretF32P16(float32x2_t* r, poly16x4_t* v0) { *r = vreinterpret_f32_p16(*v0); }
void VreinterpretF32P64(float32x2_t* r, poly64x1_t* v0) { *r = vreinterpret_f32_p64(*v0); }
void VreinterpretF32P8(float32x2_t* r, poly8x8_t* v0) { *r = vreinterpret_f32_p8(*v0); }
void VreinterpretF64S8(float64x1_t* r, int8x8_t* v0) { *r = vreinterpret_f64_s8(*v0); }
void VreinterpretF64S16(float64x1_t* r, int16x4_t* v0) { *r = vreinterpret_f64_s16(*v0); }
void VreinterpretF64S32(float64x1_t* r, int32x2_t* v0) { *r = vreinterpret_f64_s32(*v0); }
void VreinterpretF64S64(float64x1_t* r, int64x1_t* v0) { *r = vreinterpret_f64_s64(*v0); }
void VreinterpretF64U8(float64x1_t* r, uint8x8_t* v0) { *r = vreinterpret_f64_u8(*v0); }
void VreinterpretF64U16(float64x1_t* r, uint16x4_t* v0) { *r = vreinterpret_f64_u16(*v0); }
void VreinterpretF64U32(float64x1_t* r, uint32x2_t* v0) { *r = vreinterpret_f64_u32(*v0); }
void VreinterpretF64U64(float64x1_t* r, uint64x1_t* v0) { *r = vreinterpret_f64_u64(*v0); }
void VreinterpretF64F32(float64x1_t* r, float32x2_t* v0) { *r = vreinterpret_f64_f32(*v0); }
void VreinterpretF64P16(float64x1_t* r, poly16x4_t* v0) { *r = vreinterpret_f64_p16(*v0); }
void VreinterpretF64P64(float64x1_t* r, poly64x1_t* v0) { *r = vreinterpret_f64_p64(*v0); }
void VreinterpretF64P8(float64x1_t* r, poly8x8_t* v0) { *r = vreinterpret_f64_p8(*v0); }
void VreinterpretP16S8(poly16x4_t* r, int8x8_t* v0) { *r = vreinterpret_p16_s8(*v0); }
void VreinterpretP16S16(poly16x4_t* r, int16x4_t* v0) { *r = vreinterpret_p16_s16(*v0); }
void VreinterpretP16S32(poly16x4_t* r, int32x2_t* v0) { *r = vreinterpret_p16_s32(*v0); }
void VreinterpretP16S64(poly16x4_t* r, int64x1_t* v0) { *r = vreinterpret_p16_s64(*v0); }
void VreinterpretP16U8(poly16x4_t* r, uint8x8_t* v0) { *r = vreinterpret_p16_u8(*v0); }
void VreinterpretP16U16(poly16x4_t* r, uint16x4_t* v0) { *r = vreinterpret_p16_u16(*v0); }
void VreinterpretP16U32(poly16x4_t* r, uint32x2_t* v0) { *r = vreinterpret_p16_u32(*v0); }
void VreinterpretP16U64(poly16x4_t* r, uint64x1_t* v0) { *r = vreinterpret_p16_u64(*v0); }
void VreinterpretP16F32(poly16x4_t* r, float32x2_t* v0) { *r = vreinterpret_p16_f32(*v0); }
void VreinterpretP16F64(poly16x4_t* r, float64x1_t* v0) { *r = vreinterpret_p16_f64(*v0); }
void VreinterpretP16P64(poly16x4_t* r, poly64x1_t* v0) { *r = vreinterpret_p16_p64(*v0); }
void VreinterpretP16P8(poly16x4_t* r, poly8x8_t* v0) { *r = vreinterpret_p16_p8(*v0); }
void VreinterpretP64S8(poly64x1_t* r, int8x8_t* v0) { *r = vreinterpret_p64_s8(*v0); }
void VreinterpretP64S16(poly64x1_t* r, int16x4_t* v0) { *r = vreinterpret_p64_s16(*v0); }
void VreinterpretP64S32(poly64x1_t* r, int32x2_t* v0) { *r = vreinterpret_p64_s32(*v0); }
void VreinterpretP64S64(poly64x1_t* r, int64x1_t* v0) { *r = vreinterpret_p64_s64(*v0); }
void VreinterpretP64U8(poly64x1_t* r, uint8x8_t* v0) { *r = vreinterpret_p64_u8(*v0); }
void VreinterpretP64U16(poly64x1_t* r, uint16x4_t* v0) { *r = vreinterpret_p64_u16(*v0); }
void VreinterpretP64U32(poly64x1_t* r, uint32x2_t* v0) { *r = vreinterpret_p64_u32(*v0); }
void VreinterpretP64U64(poly64x1_t* r, uint64x1_t* v0) { *r = vreinterpret_p64_u64(*v0); }
void VreinterpretP64F32(poly64x1_t* r, float32x2_t* v0) { *r = vreinterpret_p64_f32(*v0); }
void VreinterpretP64F64(poly64x1_t* r, float64x1_t* v0) { *r = vreinterpret_p64_f64(*v0); }
void VreinterpretP64P16(poly64x1_t* r, poly16x4_t* v0) { *r = vreinterpret_p64_p16(*v0); }
void VreinterpretP64P8(poly64x1_t* r, poly8x8_t* v0) { *r = vreinterpret_p64_p8(*v0); }
void VreinterpretP8S8(poly8x8_t* r, int8x8_t* v0) { *r = vreinterpret_p8_s8(*v0); }
void VreinterpretP8S16(poly8x8_t* r, int16x4_t* v0) { *r = vreinterpret_p8_s16(*v0); }
void VreinterpretP8S32(poly8x8_t* r, int32x2_t* v0) { *r = vreinterpret_p8_s32(*v0); }
void VreinterpretP8S64(poly8x8_t* r, int64x1_t* v0) { *r = vreinterpret_p8_s64(*v0); }
void VreinterpretP8U8(poly8x8_t* r, uint8x8_t* v0) { *r = vreinterpret_p8_u8(*v0); }
void VreinterpretP8U16(poly8x8_t* r, uint16x4_t* v0) { *r = vreinterpret_p8_u16(*v0); }
void VreinterpretP8U32(poly8x8_t* r, uint32x2_t* v0) { *r = vreinterpret_p8_u32(*v0); }
void VreinterpretP8U64(poly8x8_t* r, uint64x1_t* v0) { *r = vreinterpret_p8_u64(*v0); }
void VreinterpretP8F32(poly8x8_t* r, float32x2_t* v0) { *r = vreinterpret_p8_f32(*v0); }
void VreinterpretP8F64(poly8x8_t* r, float64x1_t* v0) { *r = vreinterpret_p8_f64(*v0); }
void VreinterpretP8P16(poly8x8_t* r, poly16x4_t* v0) { *r = vreinterpret_p8_p16(*v0); }
void VreinterpretP8P64(poly8x8_t* r, poly64x1_t* v0) { *r = vreinterpret_p8_p64(*v0); }
void VreinterpretS16S8(int16x4_t* r, int8x8_t* v0) { *r = vreinterpret_s16_s8(*v0); }
void VreinterpretS16S32(int16x4_t* r, int32x2_t* v0) { *r = vreinterpret_s16_s32(*v0); }
void VreinterpretS16S64(int16x4_t* r, int64x1_t* v0) { *r = vreinterpret_s16_s64(*v0); }
void VreinterpretS16U8(int16x4_t* r, uint8x8_t* v0) { *r = vreinterpret_s16_u8(*v0); }
void VreinterpretS16U16(int16x4_t* r, uint16x4_t* v0) { *r = vreinterpret_s16_u16(*v0); }
void VreinterpretS16U32(int16x4_t* r, uint32x2_t* v0) { *r = vreinterpret_s16_u32(*v0); }
void VreinterpretS16U64(int16x4_t* r, uint64x1_t* v0) { *r = vreinterpret_s16_u64(*v0); }
void VreinterpretS16F32(int16x4_t* r, float32x2_t* v0) { *r = vreinterpret_s16_f32(*v0); }
void VreinterpretS16F64(int16x4_t* r, float64x1_t* v0) { *r = vreinterpret_s16_f64(*v0); }
void VreinterpretS16P16(int16x4_t* r, poly16x4_t* v0) { *r = vreinterpret_s16_p16(*v0); }
void VreinterpretS16P64(int16x4_t* r, poly64x1_t* v0) { *r = vreinterpret_s16_p64(*v0); }
void VreinterpretS16P8(int16x4_t* r, poly8x8_t* v0) { *r = vreinterpret_s16_p8(*v0); }
void VreinterpretS32S8(int32x2_t* r, int8x8_t* v0) { *r = vreinterpret_s32_s8(*v0); }
void VreinterpretS32S16(int32x2_t* r, int16x4_t* v0) { *r = vreinterpret_s32_s16(*v0); }
void VreinterpretS32S64(int32x2_t* r, int64x1_t* v0) { *r = vreinterpret_s32_s64(*v0); }
void VreinterpretS32U8(int32x2_t* r, uint8x8_t* v0) { *r = vreinterpret_s32_u8(*v0); }
void VreinterpretS32U16(int32x2_t* r, uint16x4_t* v0) { *r = vreinterpret_s32_u16(*v0); }
void VreinterpretS32U32(int32x2_t* r, uint32x2_t* v0) { *r = vreinterpret_s32_u32(*v0); }
void VreinterpretS32U64(int32x2_t* r, uint64x1_t* v0) { *r = vreinterpret_s32_u64(*v0); }
void VreinterpretS32F32(int32x2_t* r, float32x2_t* v0) { *r = vreinterpret_s32_f32(*v0); }
void VreinterpretS32F64(int32x2_t* r, float64x1_t* v0) { *r = vreinterpret_s32_f64(*v0); }
void VreinterpretS32P16(int32x2_t* r, poly16x4_t* v0) { *r = vreinterpret_s32_p16(*v0); }
void VreinterpretS32P64(int32x2_t* r, poly64x1_t* v0) { *r = vreinterpret_s32_p64(*v0); }
void VreinterpretS32P8(int32x2_t* r, poly8x8_t* v0) { *r = vreinterpret_s32_p8(*v0); }
void VreinterpretS64S8(int64x1_t* r, int8x8_t* v0) { *r = vreinterpret_s64_s8(*v0); }
void VreinterpretS64S16(int64x1_t* r, int16x4_t* v0) { *r = vreinterpret_s64_s16(*v0); }
void VreinterpretS64S32(int64x1_t* r, int32x2_t* v0) { *r = vreinterpret_s64_s32(*v0); }
void VreinterpretS64U8(int64x1_t* r, uint8x8_t* v0) { *r = vreinterpret_s64_u8(*v0); }
void VreinterpretS64U16(int64x1_t* r, uint16x4_t* v0) { *r = vreinterpret_s64_u16(*v0); }
void VreinterpretS64U32(int64x1_t* r, uint32x2_t* v0) { *r = vreinterpret_s64_u32(*v0); }
void VreinterpretS64U64(int64x1_t* r, uint64x1_t* v0) { *r = vreinterpret_s64_u64(*v0); }
void VreinterpretS64F32(int64x1_t* r, float32x2_t* v0) { *r = vreinterpret_s64_f32(*v0); }
void VreinterpretS64F64(int64x1_t* r, float64x1_t* v0) { *r = vreinterpret_s64_f64(*v0); }
void VreinterpretS64P16(int64x1_t* r, poly16x4_t* v0) { *r = vreinterpret_s64_p16(*v0); }
void VreinterpretS64P64(int64x1_t* r, poly64x1_t* v0) { *r = vreinterpret_s64_p64(*v0); }
void VreinterpretS64P8(int64x1_t* r, poly8x8_t* v0) { *r = vreinterpret_s64_p8(*v0); }
void VreinterpretS8S16(int8x8_t* r, int16x4_t* v0) { *r = vreinterpret_s8_s16(*v0); }
void VreinterpretS8S32(int8x8_t* r, int32x2_t* v0) { *r = vreinterpret_s8_s32(*v0); }
void VreinterpretS8S64(int8x8_t* r, int64x1_t* v0) { *r = vreinterpret_s8_s64(*v0); }
void VreinterpretS8U8(int8x8_t* r, uint8x8_t* v0) { *r = vreinterpret_s8_u8(*v0); }
void VreinterpretS8U16(int8x8_t* r, uint16x4_t* v0) { *r = vreinterpret_s8_u16(*v0); }
void VreinterpretS8U32(int8x8_t* r, uint32x2_t* v0) { *r = vreinterpret_s8_u32(*v0); }
void VreinterpretS8U64(int8x8_t* r, uint64x1_t* v0) { *r = vreinterpret_s8_u64(*v0); }
void VreinterpretS8F32(int8x8_t* r, float32x2_t* v0) { *r = vreinterpret_s8_f32(*v0); }
void VreinterpretS8F64(int8x8_t* r, float64x1_t* v0) { *r = vreinterpret_s8_f64(*v0); }
void VreinterpretS8P16(int8x8_t* r, poly16x4_t* v0) { *r = vreinterpret_s8_p16(*v0); }
void VreinterpretS8P64(int8x8_t* r, poly64x1_t* v0) { *r = vreinterpret_s8_p64(*v0); }
void VreinterpretS8P8(int8x8_t* r, poly8x8_t* v0) { *r = vreinterpret_s8_p8(*v0); }
void VreinterpretU16S8(uint16x4_t* r, int8x8_t* v0) { *r = vreinterpret_u16_s8(*v0); }
void VreinterpretU16S16(uint16x4_t* r, int16x4_t* v0) { *r = vreinterpret_u16_s16(*v0); }
void VreinterpretU16S32(uint16x4_t* r, int32x2_t* v0) { *r = vreinterpret_u16_s32(*v0); }
void VreinterpretU16S64(uint16x4_t* r, int64x1_t* v0) { *r = vreinterpret_u16_s64(*v0); }
void VreinterpretU16U8(uint16x4_t* r, uint8x8_t* v0) { *r = vreinterpret_u16_u8(*v0); }
void VreinterpretU16U32(uint16x4_t* r, uint32x2_t* v0) { *r = vreinterpret_u16_u32(*v0); }
void VreinterpretU16U64(uint16x4_t* r, uint64x1_t* v0) { *r = vreinterpret_u16_u64(*v0); }
void VreinterpretU16F32(uint16x4_t* r, float32x2_t* v0) { *r = vreinterpret_u16_f32(*v0); }
void VreinterpretU16F64(uint16x4_t* r, float64x1_t* v0) { *r = vreinterpret_u16_f64(*v0); }
void VreinterpretU16P16(uint16x4_t* r, poly16x4_t* v0) { *r = vreinterpret_u16_p16(*v0); }
void VreinterpretU16P64(uint16x4_t* r, poly64x1_t* v0) { *r = vreinterpret_u16_p64(*v0); }
void VreinterpretU16P8(uint16x4_t* r, poly8x8_t* v0) { *r = vreinterpret_u16_p8(*v0); }
void VreinterpretU32S8(uint32x2_t* r, int8x8_t* v0) { *r = vreinterpret_u32_s8(*v0); }
void VreinterpretU32S16(uint32x2_t* r, int16x4_t* v0) { *r = vreinterpret_u32_s16(*v0); }
void VreinterpretU32S32(uint32x2_t* r, int32x2_t* v0) { *r = vreinterpret_u32_s32(*v0); }
void VreinterpretU32S64(uint32x2_t* r, int64x1_t* v0) { *r = vreinterpret_u32_s64(*v0); }
void VreinterpretU32U8(uint32x2_t* r, uint8x8_t* v0) { *r = vreinterpret_u32_u8(*v0); }
void VreinterpretU32U16(uint32x2_t* r, uint16x4_t* v0) { *r = vreinterpret_u32_u16(*v0); }
void VreinterpretU32U64(uint32x2_t* r, uint64x1_t* v0) { *r = vreinterpret_u32_u64(*v0); }
void VreinterpretU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vreinterpret_u32_f32(*v0); }
void VreinterpretU32F64(uint32x2_t* r, float64x1_t* v0) { *r = vreinterpret_u32_f64(*v0); }
void VreinterpretU32P16(uint32x2_t* r, poly16x4_t* v0) { *r = vreinterpret_u32_p16(*v0); }
void VreinterpretU32P64(uint32x2_t* r, poly64x1_t* v0) { *r = vreinterpret_u32_p64(*v0); }
void VreinterpretU32P8(uint32x2_t* r, poly8x8_t* v0) { *r = vreinterpret_u32_p8(*v0); }
void VreinterpretU64S8(uint64x1_t* r, int8x8_t* v0) { *r = vreinterpret_u64_s8(*v0); }
void VreinterpretU64S16(uint64x1_t* r, int16x4_t* v0) { *r = vreinterpret_u64_s16(*v0); }
void VreinterpretU64S32(uint64x1_t* r, int32x2_t* v0) { *r = vreinterpret_u64_s32(*v0); }
void VreinterpretU64S64(uint64x1_t* r, int64x1_t* v0) { *r = vreinterpret_u64_s64(*v0); }
void VreinterpretU64U8(uint64x1_t* r, uint8x8_t* v0) { *r = vreinterpret_u64_u8(*v0); }
void VreinterpretU64U16(uint64x1_t* r, uint16x4_t* v0) { *r = vreinterpret_u64_u16(*v0); }
void VreinterpretU64U32(uint64x1_t* r, uint32x2_t* v0) { *r = vreinterpret_u64_u32(*v0); }
void VreinterpretU64F32(uint64x1_t* r, float32x2_t* v0) { *r = vreinterpret_u64_f32(*v0); }
void VreinterpretU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vreinterpret_u64_f64(*v0); }
void VreinterpretU64P16(uint64x1_t* r, poly16x4_t* v0) { *r = vreinterpret_u64_p16(*v0); }
void VreinterpretU64P64(uint64x1_t* r, poly64x1_t* v0) { *r = vreinterpret_u64_p64(*v0); }
void VreinterpretU64P8(uint64x1_t* r, poly8x8_t* v0) { *r = vreinterpret_u64_p8(*v0); }
void VreinterpretU8S8(uint8x8_t* r, int8x8_t* v0) { *r = vreinterpret_u8_s8(*v0); }
void VreinterpretU8S16(uint8x8_t* r, int16x4_t* v0) { *r = vreinterpret_u8_s16(*v0); }
void VreinterpretU8S32(uint8x8_t* r, int32x2_t* v0) { *r = vreinterpret_u8_s32(*v0); }
void VreinterpretU8S64(uint8x8_t* r, int64x1_t* v0) { *r = vreinterpret_u8_s64(*v0); }
void VreinterpretU8U16(uint8x8_t* r, uint16x4_t* v0) { *r = vreinterpret_u8_u16(*v0); }
void VreinterpretU8U32(uint8x8_t* r, uint32x2_t* v0) { *r = vreinterpret_u8_u32(*v0); }
void VreinterpretU8U64(uint8x8_t* r, uint64x1_t* v0) { *r = vreinterpret_u8_u64(*v0); }
void VreinterpretU8F32(uint8x8_t* r, float32x2_t* v0) { *r = vreinterpret_u8_f32(*v0); }
void VreinterpretU8F64(uint8x8_t* r, float64x1_t* v0) { *r = vreinterpret_u8_f64(*v0); }
void VreinterpretU8P16(uint8x8_t* r, poly16x4_t* v0) { *r = vreinterpret_u8_p16(*v0); }
void VreinterpretU8P64(uint8x8_t* r, poly64x1_t* v0) { *r = vreinterpret_u8_p64(*v0); }
void VreinterpretU8P8(uint8x8_t* r, poly8x8_t* v0) { *r = vreinterpret_u8_p8(*v0); }
void VreinterpretqF32S8(float32x4_t* r, int8x16_t* v0) { *r = vreinterpretq_f32_s8(*v0); }
void VreinterpretqF32S16(float32x4_t* r, int16x8_t* v0) { *r = vreinterpretq_f32_s16(*v0); }
void VreinterpretqF32S32(float32x4_t* r, int32x4_t* v0) { *r = vreinterpretq_f32_s32(*v0); }
void VreinterpretqF32S64(float32x4_t* r, int64x2_t* v0) { *r = vreinterpretq_f32_s64(*v0); }
void VreinterpretqF32U8(float32x4_t* r, uint8x16_t* v0) { *r = vreinterpretq_f32_u8(*v0); }
void VreinterpretqF32U16(float32x4_t* r, uint16x8_t* v0) { *r = vreinterpretq_f32_u16(*v0); }
void VreinterpretqF32U32(float32x4_t* r, uint32x4_t* v0) { *r = vreinterpretq_f32_u32(*v0); }
void VreinterpretqF32U64(float32x4_t* r, uint64x2_t* v0) { *r = vreinterpretq_f32_u64(*v0); }
void VreinterpretqF32F64(float32x4_t* r, float64x2_t* v0) { *r = vreinterpretq_f32_f64(*v0); }
void VreinterpretqF32P128(float32x4_t* r, poly128_t* v0) { *r = vreinterpretq_f32_p128(*v0); }
void VreinterpretqF32P16(float32x4_t* r, poly16x8_t* v0) { *r = vreinterpretq_f32_p16(*v0); }
void VreinterpretqF32P64(float32x4_t* r, poly64x2_t* v0) { *r = vreinterpretq_f32_p64(*v0); }
void VreinterpretqF32P8(float32x4_t* r, poly8x16_t* v0) { *r = vreinterpretq_f32_p8(*v0); }
void VreinterpretqF64S8(float64x2_t* r, int8x16_t* v0) { *r = vreinterpretq_f64_s8(*v0); }
void VreinterpretqF64S16(float64x2_t* r, int16x8_t* v0) { *r = vreinterpretq_f64_s16(*v0); }
void VreinterpretqF64S32(float64x2_t* r, int32x4_t* v0) { *r = vreinterpretq_f64_s32(*v0); }
void VreinterpretqF64S64(float64x2_t* r, int64x2_t* v0) { *r = vreinterpretq_f64_s64(*v0); }
void VreinterpretqF64U8(float64x2_t* r, uint8x16_t* v0) { *r = vreinterpretq_f64_u8(*v0); }
void VreinterpretqF64U16(float64x2_t* r, uint16x8_t* v0) { *r = vreinterpretq_f64_u16(*v0); }
void VreinterpretqF64U32(float64x2_t* r, uint32x4_t* v0) { *r = vreinterpretq_f64_u32(*v0); }
void VreinterpretqF64U64(float64x2_t* r, uint64x2_t* v0) { *r = vreinterpretq_f64_u64(*v0); }
void VreinterpretqF64F32(float64x2_t* r, float32x4_t* v0) { *r = vreinterpretq_f64_f32(*v0); }
void VreinterpretqF64P128(float64x2_t* r, poly128_t* v0) { *r = vreinterpretq_f64_p128(*v0); }
void VreinterpretqF64P16(float64x2_t* r, poly16x8_t* v0) { *r = vreinterpretq_f64_p16(*v0); }
void VreinterpretqF64P64(float64x2_t* r, poly64x2_t* v0) { *r = vreinterpretq_f64_p64(*v0); }
void VreinterpretqF64P8(float64x2_t* r, poly8x16_t* v0) { *r = vreinterpretq_f64_p8(*v0); }
void VreinterpretqP128S8(poly128_t* r, int8x16_t* v0) { *r = vreinterpretq_p128_s8(*v0); }
void VreinterpretqP128S16(poly128_t* r, int16x8_t* v0) { *r = vreinterpretq_p128_s16(*v0); }
void VreinterpretqP128S32(poly128_t* r, int32x4_t* v0) { *r = vreinterpretq_p128_s32(*v0); }
void VreinterpretqP128S64(poly128_t* r, int64x2_t* v0) { *r = vreinterpretq_p128_s64(*v0); }
void VreinterpretqP128U8(poly128_t* r, uint8x16_t* v0) { *r = vreinterpretq_p128_u8(*v0); }
void VreinterpretqP128U16(poly128_t* r, uint16x8_t* v0) { *r = vreinterpretq_p128_u16(*v0); }
void VreinterpretqP128U32(poly128_t* r, uint32x4_t* v0) { *r = vreinterpretq_p128_u32(*v0); }
void VreinterpretqP128U64(poly128_t* r, uint64x2_t* v0) { *r = vreinterpretq_p128_u64(*v0); }
void VreinterpretqP128F32(poly128_t* r, float32x4_t* v0) { *r = vreinterpretq_p128_f32(*v0); }
void VreinterpretqP128F64(poly128_t* r, float64x2_t* v0) { *r = vreinterpretq_p128_f64(*v0); }
void VreinterpretqP128P16(poly128_t* r, poly16x8_t* v0) { *r = vreinterpretq_p128_p16(*v0); }
void VreinterpretqP128P64(poly128_t* r, poly64x2_t* v0) { *r = vreinterpretq_p128_p64(*v0); }
void VreinterpretqP128P8(poly128_t* r, poly8x16_t* v0) { *r = vreinterpretq_p128_p8(*v0); }
void VreinterpretqP16S8(poly16x8_t* r, int8x16_t* v0) { *r = vreinterpretq_p16_s8(*v0); }
void VreinterpretqP16S16(poly16x8_t* r, int16x8_t* v0) { *r = vreinterpretq_p16_s16(*v0); }
void VreinterpretqP16S32(poly16x8_t* r, int32x4_t* v0) { *r = vreinterpretq_p16_s32(*v0); }
void VreinterpretqP16S64(poly16x8_t* r, int64x2_t* v0) { *r = vreinterpretq_p16_s64(*v0); }
void VreinterpretqP16U8(poly16x8_t* r, uint8x16_t* v0) { *r = vreinterpretq_p16_u8(*v0); }
void VreinterpretqP16U16(poly16x8_t* r, uint16x8_t* v0) { *r = vreinterpretq_p16_u16(*v0); }
void VreinterpretqP16U32(poly16x8_t* r, uint32x4_t* v0) { *r = vreinterpretq_p16_u32(*v0); }
void VreinterpretqP16U64(poly16x8_t* r, uint64x2_t* v0) { *r = vreinterpretq_p16_u64(*v0); }
void VreinterpretqP16F32(poly16x8_t* r, float32x4_t* v0) { *r = vreinterpretq_p16_f32(*v0); }
void VreinterpretqP16F64(poly16x8_t* r, float64x2_t* v0) { *r = vreinterpretq_p16_f64(*v0); }
void VreinterpretqP16P128(poly16x8_t* r, poly128_t* v0) { *r = vreinterpretq_p16_p128(*v0); }
void VreinterpretqP16P64(poly16x8_t* r, poly64x2_t* v0) { *r = vreinterpretq_p16_p64(*v0); }
void VreinterpretqP16P8(poly16x8_t* r, poly8x16_t* v0) { *r = vreinterpretq_p16_p8(*v0); }
void VreinterpretqP64S8(poly64x2_t* r, int8x16_t* v0) { *r = vreinterpretq_p64_s8(*v0); }
void VreinterpretqP64S16(poly64x2_t* r, int16x8_t* v0) { *r = vreinterpretq_p64_s16(*v0); }
void VreinterpretqP64S32(poly64x2_t* r, int32x4_t* v0) { *r = vreinterpretq_p64_s32(*v0); }
void VreinterpretqP64S64(poly64x2_t* r, int64x2_t* v0) { *r = vreinterpretq_p64_s64(*v0); }
void VreinterpretqP64U8(poly64x2_t* r, uint8x16_t* v0) { *r = vreinterpretq_p64_u8(*v0); }
void VreinterpretqP64U16(poly64x2_t* r, uint16x8_t* v0) { *r = vreinterpretq_p64_u16(*v0); }
void VreinterpretqP64U32(poly64x2_t* r, uint32x4_t* v0) { *r = vreinterpretq_p64_u32(*v0); }
void VreinterpretqP64U64(poly64x2_t* r, uint64x2_t* v0) { *r = vreinterpretq_p64_u64(*v0); }
void VreinterpretqP64F32(poly64x2_t* r, float32x4_t* v0) { *r = vreinterpretq_p64_f32(*v0); }
void VreinterpretqP64F64(poly64x2_t* r, float64x2_t* v0) { *r = vreinterpretq_p64_f64(*v0); }
void VreinterpretqP64P128(poly64x2_t* r, poly128_t* v0) { *r = vreinterpretq_p64_p128(*v0); }
void VreinterpretqP64P16(poly64x2_t* r, poly16x8_t* v0) { *r = vreinterpretq_p64_p16(*v0); }
void VreinterpretqP64P8(poly64x2_t* r, poly8x16_t* v0) { *r = vreinterpretq_p64_p8(*v0); }
void VreinterpretqP8S8(poly8x16_t* r, int8x16_t* v0) { *r = vreinterpretq_p8_s8(*v0); }
void VreinterpretqP8S16(poly8x16_t* r, int16x8_t* v0) { *r = vreinterpretq_p8_s16(*v0); }
void VreinterpretqP8S32(poly8x16_t* r, int32x4_t* v0) { *r = vreinterpretq_p8_s32(*v0); }
void VreinterpretqP8S64(poly8x16_t* r, int64x2_t* v0) { *r = vreinterpretq_p8_s64(*v0); }
void VreinterpretqP8U8(poly8x16_t* r, uint8x16_t* v0) { *r = vreinterpretq_p8_u8(*v0); }
void VreinterpretqP8U16(poly8x16_t* r, uint16x8_t* v0) { *r = vreinterpretq_p8_u16(*v0); }
void VreinterpretqP8U32(poly8x16_t* r, uint32x4_t* v0) { *r = vreinterpretq_p8_u32(*v0); }
void VreinterpretqP8U64(poly8x16_t* r, uint64x2_t* v0) { *r = vreinterpretq_p8_u64(*v0); }
void VreinterpretqP8F32(poly8x16_t* r, float32x4_t* v0) { *r = vreinterpretq_p8_f32(*v0); }
void VreinterpretqP8F64(poly8x16_t* r, float64x2_t* v0) { *r = vreinterpretq_p8_f64(*v0); }
void VreinterpretqP8P128(poly8x16_t* r, poly128_t* v0) { *r = vreinterpretq_p8_p128(*v0); }
void VreinterpretqP8P16(poly8x16_t* r, poly16x8_t* v0) { *r = vreinterpretq_p8_p16(*v0); }
void VreinterpretqP8P64(poly8x16_t* r, poly64x2_t* v0) { *r = vreinterpretq_p8_p64(*v0); }
void VreinterpretqS16S8(int16x8_t* r, int8x16_t* v0) { *r = vreinterpretq_s16_s8(*v0); }
void VreinterpretqS16S32(int16x8_t* r, int32x4_t* v0) { *r = vreinterpretq_s16_s32(*v0); }
void VreinterpretqS16S64(int16x8_t* r, int64x2_t* v0) { *r = vreinterpretq_s16_s64(*v0); }
void VreinterpretqS16U8(int16x8_t* r, uint8x16_t* v0) { *r = vreinterpretq_s16_u8(*v0); }
void VreinterpretqS16U16(int16x8_t* r, uint16x8_t* v0) { *r = vreinterpretq_s16_u16(*v0); }
void VreinterpretqS16U32(int16x8_t* r, uint32x4_t* v0) { *r = vreinterpretq_s16_u32(*v0); }
void VreinterpretqS16U64(int16x8_t* r, uint64x2_t* v0) { *r = vreinterpretq_s16_u64(*v0); }
void VreinterpretqS16F32(int16x8_t* r, float32x4_t* v0) { *r = vreinterpretq_s16_f32(*v0); }
void VreinterpretqS16F64(int16x8_t* r, float64x2_t* v0) { *r = vreinterpretq_s16_f64(*v0); }
void VreinterpretqS16P128(int16x8_t* r, poly128_t* v0) { *r = vreinterpretq_s16_p128(*v0); }
void VreinterpretqS16P16(int16x8_t* r, poly16x8_t* v0) { *r = vreinterpretq_s16_p16(*v0); }
void VreinterpretqS16P64(int16x8_t* r, poly64x2_t* v0) { *r = vreinterpretq_s16_p64(*v0); }
void VreinterpretqS16P8(int16x8_t* r, poly8x16_t* v0) { *r = vreinterpretq_s16_p8(*v0); }
void VreinterpretqS32S8(int32x4_t* r, int8x16_t* v0) { *r = vreinterpretq_s32_s8(*v0); }
void VreinterpretqS32S16(int32x4_t* r, int16x8_t* v0) { *r = vreinterpretq_s32_s16(*v0); }
void VreinterpretqS32S64(int32x4_t* r, int64x2_t* v0) { *r = vreinterpretq_s32_s64(*v0); }
void VreinterpretqS32U8(int32x4_t* r, uint8x16_t* v0) { *r = vreinterpretq_s32_u8(*v0); }
void VreinterpretqS32U16(int32x4_t* r, uint16x8_t* v0) { *r = vreinterpretq_s32_u16(*v0); }
void VreinterpretqS32U32(int32x4_t* r, uint32x4_t* v0) { *r = vreinterpretq_s32_u32(*v0); }
void VreinterpretqS32U64(int32x4_t* r, uint64x2_t* v0) { *r = vreinterpretq_s32_u64(*v0); }
void VreinterpretqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vreinterpretq_s32_f32(*v0); }
void VreinterpretqS32F64(int32x4_t* r, float64x2_t* v0) { *r = vreinterpretq_s32_f64(*v0); }
void VreinterpretqS32P128(int32x4_t* r, poly128_t* v0) { *r = vreinterpretq_s32_p128(*v0); }
void VreinterpretqS32P16(int32x4_t* r, poly16x8_t* v0) { *r = vreinterpretq_s32_p16(*v0); }
void VreinterpretqS32P64(int32x4_t* r, poly64x2_t* v0) { *r = vreinterpretq_s32_p64(*v0); }
void VreinterpretqS32P8(int32x4_t* r, poly8x16_t* v0) { *r = vreinterpretq_s32_p8(*v0); }
void VreinterpretqS64S8(int64x2_t* r, int8x16_t* v0) { *r = vreinterpretq_s64_s8(*v0); }
void VreinterpretqS64S16(int64x2_t* r, int16x8_t* v0) { *r = vreinterpretq_s64_s16(*v0); }
void VreinterpretqS64S32(int64x2_t* r, int32x4_t* v0) { *r = vreinterpretq_s64_s32(*v0); }
void VreinterpretqS64U8(int64x2_t* r, uint8x16_t* v0) { *r = vreinterpretq_s64_u8(*v0); }
void VreinterpretqS64U16(int64x2_t* r, uint16x8_t* v0) { *r = vreinterpretq_s64_u16(*v0); }
void VreinterpretqS64U32(int64x2_t* r, uint32x4_t* v0) { *r = vreinterpretq_s64_u32(*v0); }
void VreinterpretqS64U64(int64x2_t* r, uint64x2_t* v0) { *r = vreinterpretq_s64_u64(*v0); }
void VreinterpretqS64F32(int64x2_t* r, float32x4_t* v0) { *r = vreinterpretq_s64_f32(*v0); }
void VreinterpretqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vreinterpretq_s64_f64(*v0); }
void VreinterpretqS64P128(int64x2_t* r, poly128_t* v0) { *r = vreinterpretq_s64_p128(*v0); }
void VreinterpretqS64P16(int64x2_t* r, poly16x8_t* v0) { *r = vreinterpretq_s64_p16(*v0); }
void VreinterpretqS64P64(int64x2_t* r, poly64x2_t* v0) { *r = vreinterpretq_s64_p64(*v0); }
void VreinterpretqS64P8(int64x2_t* r, poly8x16_t* v0) { *r = vreinterpretq_s64_p8(*v0); }
void VreinterpretqS8S16(int8x16_t* r, int16x8_t* v0) { *r = vreinterpretq_s8_s16(*v0); }
void VreinterpretqS8S32(int8x16_t* r, int32x4_t* v0) { *r = vreinterpretq_s8_s32(*v0); }
void VreinterpretqS8S64(int8x16_t* r, int64x2_t* v0) { *r = vreinterpretq_s8_s64(*v0); }
void VreinterpretqS8U8(int8x16_t* r, uint8x16_t* v0) { *r = vreinterpretq_s8_u8(*v0); }
void VreinterpretqS8U16(int8x16_t* r, uint16x8_t* v0) { *r = vreinterpretq_s8_u16(*v0); }
void VreinterpretqS8U32(int8x16_t* r, uint32x4_t* v0) { *r = vreinterpretq_s8_u32(*v0); }
void VreinterpretqS8U64(int8x16_t* r, uint64x2_t* v0) { *r = vreinterpretq_s8_u64(*v0); }
void VreinterpretqS8F32(int8x16_t* r, float32x4_t* v0) { *r = vreinterpretq_s8_f32(*v0); }
void VreinterpretqS8F64(int8x16_t* r, float64x2_t* v0) { *r = vreinterpretq_s8_f64(*v0); }
void VreinterpretqS8P128(int8x16_t* r, poly128_t* v0) { *r = vreinterpretq_s8_p128(*v0); }
void VreinterpretqS8P16(int8x16_t* r, poly16x8_t* v0) { *r = vreinterpretq_s8_p16(*v0); }
void VreinterpretqS8P64(int8x16_t* r, poly64x2_t* v0) { *r = vreinterpretq_s8_p64(*v0); }
void VreinterpretqS8P8(int8x16_t* r, poly8x16_t* v0) { *r = vreinterpretq_s8_p8(*v0); }
void VreinterpretqU16S8(uint16x8_t* r, int8x16_t* v0) { *r = vreinterpretq_u16_s8(*v0); }
void VreinterpretqU16S16(uint16x8_t* r, int16x8_t* v0) { *r = vreinterpretq_u16_s16(*v0); }
void VreinterpretqU16S32(uint16x8_t* r, int32x4_t* v0) { *r = vreinterpretq_u16_s32(*v0); }
void VreinterpretqU16S64(uint16x8_t* r, int64x2_t* v0) { *r = vreinterpretq_u16_s64(*v0); }
void VreinterpretqU16U8(uint16x8_t* r, uint8x16_t* v0) { *r = vreinterpretq_u16_u8(*v0); }
void VreinterpretqU16U32(uint16x8_t* r, uint32x4_t* v0) { *r = vreinterpretq_u16_u32(*v0); }
void VreinterpretqU16U64(uint16x8_t* r, uint64x2_t* v0) { *r = vreinterpretq_u16_u64(*v0); }
void VreinterpretqU16F32(uint16x8_t* r, float32x4_t* v0) { *r = vreinterpretq_u16_f32(*v0); }
void VreinterpretqU16F64(uint16x8_t* r, float64x2_t* v0) { *r = vreinterpretq_u16_f64(*v0); }
void VreinterpretqU16P128(uint16x8_t* r, poly128_t* v0) { *r = vreinterpretq_u16_p128(*v0); }
void VreinterpretqU16P16(uint16x8_t* r, poly16x8_t* v0) { *r = vreinterpretq_u16_p16(*v0); }
void VreinterpretqU16P64(uint16x8_t* r, poly64x2_t* v0) { *r = vreinterpretq_u16_p64(*v0); }
void VreinterpretqU16P8(uint16x8_t* r, poly8x16_t* v0) { *r = vreinterpretq_u16_p8(*v0); }
void VreinterpretqU32S8(uint32x4_t* r, int8x16_t* v0) { *r = vreinterpretq_u32_s8(*v0); }
void VreinterpretqU32S16(uint32x4_t* r, int16x8_t* v0) { *r = vreinterpretq_u32_s16(*v0); }
void VreinterpretqU32S32(uint32x4_t* r, int32x4_t* v0) { *r = vreinterpretq_u32_s32(*v0); }
void VreinterpretqU32S64(uint32x4_t* r, int64x2_t* v0) { *r = vreinterpretq_u32_s64(*v0); }
void VreinterpretqU32U8(uint32x4_t* r, uint8x16_t* v0) { *r = vreinterpretq_u32_u8(*v0); }
void VreinterpretqU32U16(uint32x4_t* r, uint16x8_t* v0) { *r = vreinterpretq_u32_u16(*v0); }
void VreinterpretqU32U64(uint32x4_t* r, uint64x2_t* v0) { *r = vreinterpretq_u32_u64(*v0); }
void VreinterpretqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vreinterpretq_u32_f32(*v0); }
void VreinterpretqU32F64(uint32x4_t* r, float64x2_t* v0) { *r = vreinterpretq_u32_f64(*v0); }
void VreinterpretqU32P128(uint32x4_t* r, poly128_t* v0) { *r = vreinterpretq_u32_p128(*v0); }
void VreinterpretqU32P16(uint32x4_t* r, poly16x8_t* v0) { *r = vreinterpretq_u32_p16(*v0); }
void VreinterpretqU32P64(uint32x4_t* r, poly64x2_t* v0) { *r = vreinterpretq_u32_p64(*v0); }
void VreinterpretqU32P8(uint32x4_t* r, poly8x16_t* v0) { *r = vreinterpretq_u32_p8(*v0); }
void VreinterpretqU64S8(uint64x2_t* r, int8x16_t* v0) { *r = vreinterpretq_u64_s8(*v0); }
void VreinterpretqU64S16(uint64x2_t* r, int16x8_t* v0) { *r = vreinterpretq_u64_s16(*v0); }
void VreinterpretqU64S32(uint64x2_t* r, int32x4_t* v0) { *r = vreinterpretq_u64_s32(*v0); }
void VreinterpretqU64S64(uint64x2_t* r, int64x2_t* v0) { *r = vreinterpretq_u64_s64(*v0); }
void VreinterpretqU64U8(uint64x2_t* r, uint8x16_t* v0) { *r = vreinterpretq_u64_u8(*v0); }
void VreinterpretqU64U16(uint64x2_t* r, uint16x8_t* v0) { *r = vreinterpretq_u64_u16(*v0); }
void VreinterpretqU64U32(uint64x2_t* r, uint32x4_t* v0) { *r = vreinterpretq_u64_u32(*v0); }
void VreinterpretqU64F32(uint64x2_t* r, float32x4_t* v0) { *r = vreinterpretq_u64_f32(*v0); }
void VreinterpretqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vreinterpretq_u64_f64(*v0); }
void VreinterpretqU64P128(uint64x2_t* r, poly128_t* v0) { *r = vreinterpretq_u64_p128(*v0); }
void VreinterpretqU64P16(uint64x2_t* r, poly16x8_t* v0) { *r = vreinterpretq_u64_p16(*v0); }
void VreinterpretqU64P64(uint64x2_t* r, poly64x2_t* v0) { *r = vreinterpretq_u64_p64(*v0); }
void VreinterpretqU64P8(uint64x2_t* r, poly8x16_t* v0) { *r = vreinterpretq_u64_p8(*v0); }
void VreinterpretqU8S8(uint8x16_t* r, int8x16_t* v0) { *r = vreinterpretq_u8_s8(*v0); }
void VreinterpretqU8S16(uint8x16_t* r, int16x8_t* v0) { *r = vreinterpretq_u8_s16(*v0); }
void VreinterpretqU8S32(uint8x16_t* r, int32x4_t* v0) { *r = vreinterpretq_u8_s32(*v0); }
void VreinterpretqU8S64(uint8x16_t* r, int64x2_t* v0) { *r = vreinterpretq_u8_s64(*v0); }
void VreinterpretqU8U16(uint8x16_t* r, uint16x8_t* v0) { *r = vreinterpretq_u8_u16(*v0); }
void VreinterpretqU8U32(uint8x16_t* r, uint32x4_t* v0) { *r = vreinterpretq_u8_u32(*v0); }
void VreinterpretqU8U64(uint8x16_t* r, uint64x2_t* v0) { *r = vreinterpretq_u8_u64(*v0); }
void VreinterpretqU8F32(uint8x16_t* r, float32x4_t* v0) { *r = vreinterpretq_u8_f32(*v0); }
void VreinterpretqU8F64(uint8x16_t* r, float64x2_t* v0) { *r = vreinterpretq_u8_f64(*v0); }
void VreinterpretqU8P128(uint8x16_t* r, poly128_t* v0) { *r = vreinterpretq_u8_p128(*v0); }
void VreinterpretqU8P16(uint8x16_t* r, poly16x8_t* v0) { *r = vreinterpretq_u8_p16(*v0); }
void VreinterpretqU8P64(uint8x16_t* r, poly64x2_t* v0) { *r = vreinterpretq_u8_p64(*v0); }
void VreinterpretqU8P8(uint8x16_t* r, poly8x16_t* v0) { *r = vreinterpretq_u8_p8(*v0); }
void Vrev16S8(int8x8_t* r, int8x8_t* v0) { *r = vrev16_s8(*v0); }
void Vrev16U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev16_u8(*v0); }
void Vrev16P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev16_p8(*v0); }
void Vrev16QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev16q_s8(*v0); }
void Vrev16QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev16q_u8(*v0); }
void Vrev16QP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrev16q_p8(*v0); }
void Vrev32S8(int8x8_t* r, int8x8_t* v0) { *r = vrev32_s8(*v0); }
void Vrev32S16(int16x4_t* r, int16x4_t* v0) { *r = vrev32_s16(*v0); }
void Vrev32U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev32_u8(*v0); }
void Vrev32U16(uint16x4_t* r, uint16x4_t* v0) { *r = vrev32_u16(*v0); }
void Vrev32P16(poly16x4_t* r, poly16x4_t* v0) { *r = vrev32_p16(*v0); }
void Vrev32P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev32_p8(*v0); }
void Vrev32QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev32q_s8(*v0); }
void Vrev32QS16(int16x8_t* r, int16x8_t* v0) { *r = vrev32q_s16(*v0); }
void Vrev32QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev32q_u8(*v0); }
void Vrev32QU16(uint16x8_t* r, uint16x8_t* v0) { *r = vrev32q_u16(*v0); }
void Vrev32QP16(poly16x8_t* r, poly16x8_t* v0) { *r = vrev32q_p16(*v0); }
void Vrev32QP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrev32q_p8(*v0); }
void Vrev64S8(int8x8_t* r, int8x8_t* v0) { *r = vrev64_s8(*v0); }
void Vrev64S16(int16x4_t* r, int16x4_t* v0) { *r = vrev64_s16(*v0); }
void Vrev64S32(int32x2_t* r, int32x2_t* v0) { *r = vrev64_s32(*v0); }
void Vrev64U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev64_u8(*v0); }
void Vrev64U16(uint16x4_t* r, uint16x4_t* v0) { *r = vrev64_u16(*v0); }
void Vrev64U32(uint32x2_t* r, uint32x2_t* v0) { *r = vrev64_u32(*v0); }
void Vrev64F32(float32x2_t* r, float32x2_t* v0) { *r = vrev64_f32(*v0); }
void Vrev64P16(poly16x4_t* r, poly16x4_t* v0) { *r = vrev64_p16(*v0); }
void Vrev64P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev64_p8(*v0); }
void Vrev64QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev64q_s8(*v0); }
void Vrev64QS16(int16x8_t* r, int16x8_t* v0) { *r = vrev64q_s16(*v0); }
void Vrev64QS32(int32x4_t* r, int32x4_t* v0) { *r = vrev64q_s32(*v0); }
void Vrev64QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev64q_u8(*v0); }
void Vrev64QU16(uint16x8_t* r, uint16x8_t* v0) { *r = vrev64q_u16(*v0); }
void Vrev64QU32(uint32x4_t* r, uint32x4_t* v0) { *r = vrev64q_u32(*v0); }
void Vrev64QF32(float32x4_t* r, float32x4_t* v0) { *r = vrev64q_f32(*v0); }
void Vrev64QP16(poly16x8_t* r, poly16x8_t* v0) { *r = vrev64q_p16(*v0); }
void Vrev64QP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrev64q_p8(*v0); }
void VrhaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vrhadd_s8(*v0, *v1); }
void VrhaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vrhadd_s16(*v0, *v1); }
void VrhaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vrhadd_s32(*v0, *v1); }
void VrhaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vrhadd_u8(*v0, *v1); }
void VrhaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vrhadd_u16(*v0, *v1); }
void VrhaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vrhadd_u32(*v0, *v1); }
void VrhaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vrhaddq_s8(*v0, *v1); }
void VrhaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vrhaddq_s16(*v0, *v1); }
void VrhaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vrhaddq_s32(*v0, *v1); }
void VrhaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vrhaddq_u8(*v0, *v1); }
void VrhaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vrhaddq_u16(*v0, *v1); }
void VrhaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vrhaddq_u32(*v0, *v1); }
void VrndF32(float32x2_t* r, float32x2_t* v0) { *r = vrnd_f32(*v0); }
void VrndF64(float64x1_t* r, float64x1_t* v0) { *r = vrnd_f64(*v0); }
void Vrnd32XF32(float32x2_t* r, float32x2_t* v0) { *r = vrnd32x_f32(*v0); }
void Vrnd32XF64(float64x1_t* r, float64x1_t* v0) { *r = vrnd32x_f64(*v0); }
void Vrnd32XqF32(float32x4_t* r, float32x4_t* v0) { *r = vrnd32xq_f32(*v0); }
void Vrnd32XqF64(float64x2_t* r, float64x2_t* v0) { *r = vrnd32xq_f64(*v0); }
void Vrnd32ZF32(float32x2_t* r, float32x2_t* v0) { *r = vrnd32z_f32(*v0); }
void Vrnd32ZF64(float64x1_t* r, float64x1_t* v0) { *r = vrnd32z_f64(*v0); }
void Vrnd32ZqF32(float32x4_t* r, float32x4_t* v0) { *r = vrnd32zq_f32(*v0); }
void Vrnd32ZqF64(float64x2_t* r, float64x2_t* v0) { *r = vrnd32zq_f64(*v0); }
void Vrnd64XF32(float32x2_t* r, float32x2_t* v0) { *r = vrnd64x_f32(*v0); }
void Vrnd64XF64(float64x1_t* r, float64x1_t* v0) { *r = vrnd64x_f64(*v0); }
void Vrnd64XqF32(float32x4_t* r, float32x4_t* v0) { *r = vrnd64xq_f32(*v0); }
void Vrnd64XqF64(float64x2_t* r, float64x2_t* v0) { *r = vrnd64xq_f64(*v0); }
void Vrnd64ZF32(float32x2_t* r, float32x2_t* v0) { *r = vrnd64z_f32(*v0); }
void Vrnd64ZF64(float64x1_t* r, float64x1_t* v0) { *r = vrnd64z_f64(*v0); }
void Vrnd64ZqF32(float32x4_t* r, float32x4_t* v0) { *r = vrnd64zq_f32(*v0); }
void Vrnd64ZqF64(float64x2_t* r, float64x2_t* v0) { *r = vrnd64zq_f64(*v0); }
void VrndaF32(float32x2_t* r, float32x2_t* v0) { *r = vrnda_f32(*v0); }
void VrndaF64(float64x1_t* r, float64x1_t* v0) { *r = vrnda_f64(*v0); }
void VrndaqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndaq_f32(*v0); }
void VrndaqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndaq_f64(*v0); }
void VrndiF32(float32x2_t* r, float32x2_t* v0) { *r = vrndi_f32(*v0); }
void VrndiF64(float64x1_t* r, float64x1_t* v0) { *r = vrndi_f64(*v0); }
void VrndiqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndiq_f32(*v0); }
void VrndiqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndiq_f64(*v0); }
void VrndmF32(float32x2_t* r, float32x2_t* v0) { *r = vrndm_f32(*v0); }
void VrndmF64(float64x1_t* r, float64x1_t* v0) { *r = vrndm_f64(*v0); }
void VrndmqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndmq_f32(*v0); }
void VrndmqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndmq_f64(*v0); }
void VrndnF32(float32x2_t* r, float32x2_t* v0) { *r = vrndn_f32(*v0); }
void VrndnF64(float64x1_t* r, float64x1_t* v0) { *r = vrndn_f64(*v0); }
void VrndnqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndnq_f32(*v0); }
void VrndnqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndnq_f64(*v0); }
void VrndnsF32(float32_t* r, float32_t* v0) { *r = vrndns_f32(*v0); }
void VrndpF32(float32x2_t* r, float32x2_t* v0) { *r = vrndp_f32(*v0); }
void VrndpF64(float64x1_t* r, float64x1_t* v0) { *r = vrndp_f64(*v0); }
void VrndpqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndpq_f32(*v0); }
void VrndpqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndpq_f64(*v0); }
void VrndqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndq_f32(*v0); }
void VrndqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndq_f64(*v0); }
void VrndxF32(float32x2_t* r, float32x2_t* v0) { *r = vrndx_f32(*v0); }
void VrndxF64(float64x1_t* r, float64x1_t* v0) { *r = vrndx_f64(*v0); }
void VrndxqF32(float32x4_t* r, float32x4_t* v0) { *r = vrndxq_f32(*v0); }
void VrndxqF64(float64x2_t* r, float64x2_t* v0) { *r = vrndxq_f64(*v0); }
void VrshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vrshl_s8(*v0, *v1); }
void VrshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vrshl_s16(*v0, *v1); }
void VrshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vrshl_s32(*v0, *v1); }
void VrshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vrshl_s64(*v0, *v1); }
void VrshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vrshl_u8(*v0, *v1); }
void VrshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vrshl_u16(*v0, *v1); }
void VrshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vrshl_u32(*v0, *v1); }
void VrshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vrshl_u64(*v0, *v1); }
void VrshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vrshld_s64(*v0, *v1); }
void VrshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vrshld_u64(*v0, *v1); }
void VrshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vrshlq_s8(*v0, *v1); }
void VrshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vrshlq_s16(*v0, *v1); }
void VrshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vrshlq_s32(*v0, *v1); }
void VrshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vrshlq_s64(*v0, *v1); }
void VrshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vrshlq_u8(*v0, *v1); }
void VrshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vrshlq_u16(*v0, *v1); }
void VrshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vrshlq_u32(*v0, *v1); }
void VrshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vrshlq_u64(*v0, *v1); }
void VrsqrteU32(uint32x2_t* r, uint32x2_t* v0) { *r = vrsqrte_u32(*v0); }
void VrsqrteF32(float32x2_t* r, float32x2_t* v0) { *r = vrsqrte_f32(*v0); }
void VrsqrteF64(float64x1_t* r, float64x1_t* v0) { *r = vrsqrte_f64(*v0); }
void VrsqrtedF64(float64_t* r, float64_t* v0) { *r = vrsqrted_f64(*v0); }
void VrsqrteqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vrsqrteq_u32(*v0); }
void VrsqrteqF32(float32x4_t* r, float32x4_t* v0) { *r = vrsqrteq_f32(*v0); }
void VrsqrteqF64(float64x2_t* r, float64x2_t* v0) { *r = vrsqrteq_f64(*v0); }
void VrsqrtesF32(float32_t* r, float32_t* v0) { *r = vrsqrtes_f32(*v0); }
void VrsqrtsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vrsqrts_f32(*v0, *v1); }
void VrsqrtsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vrsqrts_f64(*v0, *v1); }
void VrsqrtsdF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vrsqrtsd_f64(*v0, *v1); }
void VrsqrtsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vrsqrtsq_f32(*v0, *v1); }
void VrsqrtsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vrsqrtsq_f64(*v0, *v1); }
void VrsqrtssF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vrsqrtss_f32(*v0, *v1); }
void VrsubhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vrsubhn_s16(*v0, *v1); }
void VrsubhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vrsubhn_s32(*v0, *v1); }
void VrsubhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vrsubhn_s64(*v0, *v1); }
void VrsubhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vrsubhn_u16(*v0, *v1); }
void VrsubhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vrsubhn_u32(*v0, *v1); }
void VrsubhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vrsubhn_u64(*v0, *v1); }
void VrsubhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vrsubhn_high_s16(*v0, *v1, *v2); }
void VrsubhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vrsubhn_high_s32(*v0, *v1, *v2); }
void VrsubhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vrsubhn_high_s64(*v0, *v1, *v2); }
void VrsubhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vrsubhn_high_u16(*v0, *v1, *v2); }
void VrsubhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vrsubhn_high_u32(*v0, *v1, *v2); }
void VrsubhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vrsubhn_high_u64(*v0, *v1, *v2); }
void Vsha1CqU32(uint32x4_t* r, uint32x4_t* v0, uint32_t* v1, uint32x4_t* v2) { *r = vsha1cq_u32(*v0, *v1, *v2); }
void Vsha1HU32(uint32_t* r, uint32_t* v0) { *r = vsha1h_u32(*v0); }
void Vsha1MqU32(uint32x4_t* r, uint32x4_t* v0, uint32_t* v1, uint32x4_t* v2) { *r = vsha1mq_u32(*v0, *v1, *v2); }
void Vsha1PqU32(uint32x4_t* r, uint32x4_t* v0, uint32_t* v1, uint32x4_t* v2) { *r = vsha1pq_u32(*v0, *v1, *v2); }
void Vsha1Su0QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsha1su0q_u32(*v0, *v1, *v2); }
void Vsha1Su1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsha1su1q_u32(*v0, *v1); }
void Vsha256H2QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsha256h2q_u32(*v0, *v1, *v2); }
void Vsha256HqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsha256hq_u32(*v0, *v1, *v2); }
void Vsha256Su0QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsha256su0q_u32(*v0, *v1); }
void Vsha256Su1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsha256su1q_u32(*v0, *v1, *v2); }
void Vsha512H2QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vsha512h2q_u64(*v0, *v1, *v2); }
void Vsha512HqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vsha512hq_u64(*v0, *v1, *v2); }
void Vsha512Su0QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vsha512su0q_u64(*v0, *v1); }
void Vsha512Su1QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vsha512su1q_u64(*v0, *v1, *v2); }
void VshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vshl_s8(*v0, *v1); }
void VshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vshl_s16(*v0, *v1); }
void VshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vshl_s32(*v0, *v1); }
void VshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vshl_s64(*v0, *v1); }
void VshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vshl_u8(*v0, *v1); }
void VshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vshl_u16(*v0, *v1); }
void VshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vshl_u32(*v0, *v1); }
void VshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vshl_u64(*v0, *v1); }
void VshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vshld_s64(*v0, *v1); }
void VshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vshld_u64(*v0, *v1); }
void VshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vshlq_s8(*v0, *v1); }
void VshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vshlq_s16(*v0, *v1); }
void VshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vshlq_s32(*v0, *v1); }
void VshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vshlq_s64(*v0, *v1); }
void VshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vshlq_u8(*v0, *v1); }
void VshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vshlq_u16(*v0, *v1); }
void VshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vshlq_u32(*v0, *v1); }
void VshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vshlq_u64(*v0, *v1); }
void Vsm3Partw1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsm3partw1q_u32(*v0, *v1, *v2); }
void Vsm3Partw2QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsm3partw2q_u32(*v0, *v1, *v2); }
void Vsm3Ss1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsm3ss1q_u32(*v0, *v1, *v2); }
void Vsm4EkeyqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsm4ekeyq_u32(*v0, *v1); }
void Vsm4EqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsm4eq_u32(*v0, *v1); }
void VsqaddU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vsqadd_u8(*v0, *v1); }
void VsqaddU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vsqadd_u16(*v0, *v1); }
void VsqaddU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vsqadd_u32(*v0, *v1); }
void VsqaddU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vsqadd_u64(*v0, *v1); }
void VsqaddbU8(uint8_t* r, uint8_t* v0, int8_t* v1) { *r = vsqaddb_u8(*v0, *v1); }
void VsqadddU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vsqaddd_u64(*v0, *v1); }
void VsqaddhU16(uint16_t* r, uint16_t* v0, int16_t* v1) { *r = vsqaddh_u16(*v0, *v1); }
void VsqaddqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vsqaddq_u8(*v0, *v1); }
void VsqaddqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vsqaddq_u16(*v0, *v1); }
void VsqaddqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vsqaddq_u32(*v0, *v1); }
void VsqaddqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vsqaddq_u64(*v0, *v1); }
void VsqaddsU32(uint32_t* r, uint32_t* v0, int32_t* v1) { *r = vsqadds_u32(*v0, *v1); }
void VsqrtF32(float32x2_t* r, float32x2_t* v0) { *r = vsqrt_f32(*v0); }
void VsqrtF64(float64x1_t* r, float64x1_t* v0) { *r = vsqrt_f64(*v0); }
void VsqrtqF32(float32x4_t* r, float32x4_t* v0) { *r = vsqrtq_f32(*v0); }
void VsqrtqF64(float64x2_t* r, float64x2_t* v0) { *r = vsqrtq_f64(*v0); }
void VsubS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vsub_s8(*v0, *v1); }
void VsubS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vsub_s16(*v0, *v1); }
void VsubS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vsub_s32(*v0, *v1); }
void VsubS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vsub_s64(*v0, *v1); }
void VsubU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vsub_u8(*v0, *v1); }
void VsubU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vsub_u16(*v0, *v1); }
void VsubU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vsub_u32(*v0, *v1); }
void VsubU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vsub_u64(*v0, *v1); }
void VsubF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vsub_f32(*v0, *v1); }
void VsubF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vsub_f64(*v0, *v1); }
void VsubdS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vsubd_s64(*v0, *v1); }
void VsubdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vsubd_u64(*v0, *v1); }
void VsubhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vsubhn_s16(*v0, *v1); }
void VsubhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vsubhn_s32(*v0, *v1); }
void VsubhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vsubhn_s64(*v0, *v1); }
void VsubhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vsubhn_u16(*v0, *v1); }
void VsubhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsubhn_u32(*v0, *v1); }
void VsubhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vsubhn_u64(*v0, *v1); }
void VsubhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t* v2) { *r = vsubhn_high_s16(*v0, *v1, *v2); }
void VsubhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_t* v2) { *r = vsubhn_high_s32(*v0, *v1, *v2); }
void VsubhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_t* v2) { *r = vsubhn_high_s64(*v0, *v1, *v2); }
void VsubhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16x8_t* v2) { *r = vsubhn_high_u16(*v0, *v1, *v2); }
void VsubhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint32x4_t* v2) { *r = vsubhn_high_u32(*v0, *v1, *v2); }
void VsubhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint64x2_t* v2) { *r = vsubhn_high_u64(*v0, *v1, *v2); }
void VsublS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vsubl_s8(*v0, *v1); }
void VsublS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vsubl_s16(*v0, *v1); }
void VsublS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vsubl_s32(*v0, *v1); }
void VsublU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vsubl_u8(*v0, *v1); }
void VsublU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vsubl_u16(*v0, *v1); }
void VsublU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vsubl_u32(*v0, *v1); }
void VsublHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vsubl_high_s8(*v0, *v1); }
void VsublHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vsubl_high_s16(*v0, *v1); }
void VsublHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vsubl_high_s32(*v0, *v1); }
void VsublHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vsubl_high_u8(*v0, *v1); }
void VsublHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vsubl_high_u16(*v0, *v1); }
void VsublHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsubl_high_u32(*v0, *v1); }
void VsubqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vsubq_s8(*v0, *v1); }
void VsubqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vsubq_s16(*v0, *v1); }
void VsubqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vsubq_s32(*v0, *v1); }
void VsubqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vsubq_s64(*v0, *v1); }
void VsubqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vsubq_u8(*v0, *v1); }
void VsubqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vsubq_u16(*v0, *v1); }
void VsubqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vsubq_u32(*v0, *v1); }
void VsubqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vsubq_u64(*v0, *v1); }
void VsubqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vsubq_f32(*v0, *v1); }
void VsubqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vsubq_f64(*v0, *v1); }
void VsubwS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1) { *r = vsubw_s8(*v0, *v1); }
void VsubwS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1) { *r = vsubw_s16(*v0, *v1); }
void VsubwS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1) { *r = vsubw_s32(*v0, *v1); }
void VsubwU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1) { *r = vsubw_u8(*v0, *v1); }
void VsubwU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1) { *r = vsubw_u16(*v0, *v1); }
void VsubwU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1) { *r = vsubw_u32(*v0, *v1); }
void VsubwHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1) { *r = vsubw_high_s8(*v0, *v1); }
void VsubwHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1) { *r = vsubw_high_s16(*v0, *v1); }
void VsubwHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1) { *r = vsubw_high_s32(*v0, *v1); }
void VsubwHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1) { *r = vsubw_high_u8(*v0, *v1); }
void VsubwHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1) { *r = vsubw_high_u16(*v0, *v1); }
void VsubwHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1) { *r = vsubw_high_u32(*v0, *v1); }
void Vtbl1S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vtbl1_s8(*v0, *v1); }
void Vtbl1U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vtbl1_u8(*v0, *v1); }
void Vtbl1P8(poly8x8_t* r, poly8x8_t* v0, uint8x8_t* v1) { *r = vtbl1_p8(*v0, *v1); }
void Vtbl2S8(int8x8_t* r, int8x8x2_t* v0, int8x8_t* v1) { *r = vtbl2_s8(*v0, *v1); }
void Vtbl2U8(uint8x8_t* r, uint8x8x2_t* v0, uint8x8_t* v1) { *r = vtbl2_u8(*v0, *v1); }
void Vtbl2P8(poly8x8_t* r, poly8x8x2_t* v0, uint8x8_t* v1) { *r = vtbl2_p8(*v0, *v1); }
void Vtbl3S8(int8x8_t* r, int8x8x3_t* v0, int8x8_t* v1) { *r = vtbl3_s8(*v0, *v1); }
void Vtbl3U8(uint8x8_t* r, uint8x8x3_t* v0, uint8x8_t* v1) { *r = vtbl3_u8(*v0, *v1); }
void Vtbl3P8(poly8x8_t* r, poly8x8x3_t* v0, uint8x8_t* v1) { *r = vtbl3_p8(*v0, *v1); }
void Vtbl4S8(int8x8_t* r, int8x8x4_t* v0, int8x8_t* v1) { *r = vtbl4_s8(*v0, *v1); }
void Vtbl4U8(uint8x8_t* r, uint8x8x4_t* v0, uint8x8_t* v1) { *r = vtbl4_u8(*v0, *v1); }
void Vtbl4P8(poly8x8_t* r, poly8x8x4_t* v0, uint8x8_t* v1) { *r = vtbl4_p8(*v0, *v1); }
void Vtbx1S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vtbx1_s8(*v0, *v1, *v2); }
void Vtbx1U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) { *r = vtbx1_u8(*v0, *v1, *v2); }
void Vtbx1P8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1, uint8x8_t* v2) { *r = vtbx1_p8(*v0, *v1, *v2); }
void Vtbx2S8(int8x8_t* r, int8x8_t* v0, int8x8x2_t* v1, int8x8_t* v2) { *r = vtbx2_s8(*v0, *v1, *v2); }
void Vtbx2U8(uint8x8_t* r, uint8x8_t* v0, uint8x8x2_t* v1, uint8x8_t* v2) { *r = vtbx2_u8(*v0, *v1, *v2); }
void Vtbx2P8(poly8x8_t* r, poly8x8_t* v0, poly8x8x2_t* v1, uint8x8_t* v2) { *r = vtbx2_p8(*v0, *v1, *v2); }
void Vtbx3S8(int8x8_t* r, int8x8_t* v0, int8x8x3_t* v1, int8x8_t* v2) { *r = vtbx3_s8(*v0, *v1, *v2); }
void Vtbx3U8(uint8x8_t* r, uint8x8_t* v0, uint8x8x3_t* v1, uint8x8_t* v2) { *r = vtbx3_u8(*v0, *v1, *v2); }
void Vtbx3P8(poly8x8_t* r, poly8x8_t* v0, poly8x8x3_t* v1, uint8x8_t* v2) { *r = vtbx3_p8(*v0, *v1, *v2); }
void Vtbx4S8(int8x8_t* r, int8x8_t* v0, int8x8x4_t* v1, int8x8_t* v2) { *r = vtbx4_s8(*v0, *v1, *v2); }
void Vtbx4U8(uint8x8_t* r, uint8x8_t* v0, uint8x8x4_t* v1, uint8x8_t* v2) { *r = vtbx4_u8(*v0, *v1, *v2); }
void Vtbx4P8(poly8x8_t* r, poly8x8_t* v0, poly8x8x4_t* v1, uint8x8_t* v2) { *r = vtbx4_p8(*v0, *v1, *v2); }
void VtrnS8(int8x8x2_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vtrn_s8(*v0, *v1); }
void VtrnS16(int16x4x2_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vtrn_s16(*v0, *v1); }
void VtrnS32(int32x2x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vtrn_s32(*v0, *v1); }
void VtrnU8(uint8x8x2_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vtrn_u8(*v0, *v1); }
void VtrnU16(uint16x4x2_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vtrn_u16(*v0, *v1); }
void VtrnU32(uint32x2x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vtrn_u32(*v0, *v1); }
void VtrnF32(float32x2x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vtrn_f32(*v0, *v1); }
void Vtrn1S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vtrn1_s8(*v0, *v1); }
void Vtrn1S16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vtrn1_s16(*v0, *v1); }
void Vtrn1S32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vtrn1_s32(*v0, *v1); }
void Vtrn1U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vtrn1_u8(*v0, *v1); }
void Vtrn1U16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vtrn1_u16(*v0, *v1); }
void Vtrn1U32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vtrn1_u32(*v0, *v1); }
void Vtrn1F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vtrn1_f32(*v0, *v1); }
void Vtrn1P16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vtrn1_p16(*v0, *v1); }
void Vtrn1P8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vtrn1_p8(*v0, *v1); }
void Vtrn1QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vtrn1q_s8(*v0, *v1); }
void Vtrn1QS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vtrn1q_s16(*v0, *v1); }
void Vtrn1QS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vtrn1q_s32(*v0, *v1); }
void Vtrn1QS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vtrn1q_s64(*v0, *v1); }
void Vtrn1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vtrn1q_u8(*v0, *v1); }
void Vtrn1QU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vtrn1q_u16(*v0, *v1); }
void Vtrn1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vtrn1q_u32(*v0, *v1); }
void Vtrn1QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vtrn1q_u64(*v0, *v1); }
void Vtrn1QF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vtrn1q_f32(*v0, *v1); }
void Vtrn1QF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vtrn1q_f64(*v0, *v1); }
void Vtrn1QP16(poly16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vtrn1q_p16(*v0, *v1); }
void Vtrn1QP64(poly64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vtrn1q_p64(*v0, *v1); }
void Vtrn1QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vtrn1q_p8(*v0, *v1); }
void Vtrn2S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vtrn2_s8(*v0, *v1); }
void Vtrn2S16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vtrn2_s16(*v0, *v1); }
void Vtrn2S32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vtrn2_s32(*v0, *v1); }
void Vtrn2U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vtrn2_u8(*v0, *v1); }
void Vtrn2U16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vtrn2_u16(*v0, *v1); }
void Vtrn2U32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vtrn2_u32(*v0, *v1); }
void Vtrn2F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vtrn2_f32(*v0, *v1); }
void Vtrn2P16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vtrn2_p16(*v0, *v1); }
void Vtrn2P8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vtrn2_p8(*v0, *v1); }
void Vtrn2QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vtrn2q_s8(*v0, *v1); }
void Vtrn2QS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vtrn2q_s16(*v0, *v1); }
void Vtrn2QS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vtrn2q_s32(*v0, *v1); }
void Vtrn2QS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vtrn2q_s64(*v0, *v1); }
void Vtrn2QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vtrn2q_u8(*v0, *v1); }
void Vtrn2QU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vtrn2q_u16(*v0, *v1); }
void Vtrn2QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vtrn2q_u32(*v0, *v1); }
void Vtrn2QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vtrn2q_u64(*v0, *v1); }
void Vtrn2QF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vtrn2q_f32(*v0, *v1); }
void Vtrn2QF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vtrn2q_f64(*v0, *v1); }
void Vtrn2QP16(poly16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vtrn2q_p16(*v0, *v1); }
void Vtrn2QP64(poly64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vtrn2q_p64(*v0, *v1); }
void Vtrn2QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vtrn2q_p8(*v0, *v1); }
void VtrnP16(poly16x4x2_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vtrn_p16(*v0, *v1); }
void VtrnP8(poly8x8x2_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vtrn_p8(*v0, *v1); }
void VtrnqS8(int8x16x2_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vtrnq_s8(*v0, *v1); }
void VtrnqS16(int16x8x2_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vtrnq_s16(*v0, *v1); }
void VtrnqS32(int32x4x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vtrnq_s32(*v0, *v1); }
void VtrnqU8(uint8x16x2_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vtrnq_u8(*v0, *v1); }
void VtrnqU16(uint16x8x2_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vtrnq_u16(*v0, *v1); }
void VtrnqU32(uint32x4x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vtrnq_u32(*v0, *v1); }
void VtrnqF32(float32x4x2_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vtrnq_f32(*v0, *v1); }
void VtrnqP16(poly16x8x2_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vtrnq_p16(*v0, *v1); }
void VtrnqP8(poly8x16x2_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vtrnq_p8(*v0, *v1); }
void VtstS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vtst_s8(*v0, *v1); }
void VtstS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vtst_s16(*v0, *v1); }
void VtstS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vtst_s32(*v0, *v1); }
void VtstS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vtst_s64(*v0, *v1); }
void VtstU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vtst_u8(*v0, *v1); }
void VtstU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vtst_u16(*v0, *v1); }
void VtstU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vtst_u32(*v0, *v1); }
void VtstU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vtst_u64(*v0, *v1); }
void VtstP16(uint16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vtst_p16(*v0, *v1); }
void VtstP64(uint64x1_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vtst_p64(*v0, *v1); }
void VtstP8(uint8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vtst_p8(*v0, *v1); }
void VtstdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vtstd_s64(*v0, *v1); }
void VtstdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vtstd_u64(*v0, *v1); }
void VtstqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vtstq_s8(*v0, *v1); }
void VtstqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vtstq_s16(*v0, *v1); }
void VtstqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vtstq_s32(*v0, *v1); }
void VtstqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vtstq_s64(*v0, *v1); }
void VtstqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vtstq_u8(*v0, *v1); }
void VtstqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vtstq_u16(*v0, *v1); }
void VtstqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vtstq_u32(*v0, *v1); }
void VtstqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vtstq_u64(*v0, *v1); }
void VtstqP16(uint16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vtstq_p16(*v0, *v1); }
void VtstqP64(uint64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vtstq_p64(*v0, *v1); }
void VtstqP8(uint8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vtstq_p8(*v0, *v1); }
void VuqaddS8(int8x8_t* r, int8x8_t* v0, uint8x8_t* v1) { *r = vuqadd_s8(*v0, *v1); }
void VuqaddS16(int16x4_t* r, int16x4_t* v0, uint16x4_t* v1) { *r = vuqadd_s16(*v0, *v1); }
void VuqaddS32(int32x2_t* r, int32x2_t* v0, uint32x2_t* v1) { *r = vuqadd_s32(*v0, *v1); }
void VuqaddS64(int64x1_t* r, int64x1_t* v0, uint64x1_t* v1) { *r = vuqadd_s64(*v0, *v1); }
void VuqaddbS8(int8_t* r, int8_t* v0, uint8_t* v1) { *r = vuqaddb_s8(*v0, *v1); }
void VuqadddS64(int64_t* r, int64_t* v0, uint64_t* v1) { *r = vuqaddd_s64(*v0, *v1); }
void VuqaddhS16(int16_t* r, int16_t* v0, uint16_t* v1) { *r = vuqaddh_s16(*v0, *v1); }
void VuqaddqS8(int8x16_t* r, int8x16_t* v0, uint8x16_t* v1) { *r = vuqaddq_s8(*v0, *v1); }
void VuqaddqS16(int16x8_t* r, int16x8_t* v0, uint16x8_t* v1) { *r = vuqaddq_s16(*v0, *v1); }
void VuqaddqS32(int32x4_t* r, int32x4_t* v0, uint32x4_t* v1) { *r = vuqaddq_s32(*v0, *v1); }
void VuqaddqS64(int64x2_t* r, int64x2_t* v0, uint64x2_t* v1) { *r = vuqaddq_s64(*v0, *v1); }
void VuqaddsS32(int32_t* r, int32_t* v0, uint32_t* v1) { *r = vuqadds_s32(*v0, *v1); }
void VusdotS32(int32x2_t* r, int32x2_t* v0, uint8x8_t* v1, int8x8_t* v2) { *r = vusdot_s32(*v0, *v1, *v2); }
void VusdotqS32(int32x4_t* r, int32x4_t* v0, uint8x16_t* v1, int8x16_t* v2) { *r = vusdotq_s32(*v0, *v1, *v2); }
void VusmmlaqS32(int32x4_t* r, int32x4_t* v0, uint8x16_t* v1, int8x16_t* v2) { *r = vusmmlaq_s32(*v0, *v1, *v2); }
void VuzpS8(int8x8x2_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vuzp_s8(*v0, *v1); }
void VuzpS16(int16x4x2_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vuzp_s16(*v0, *v1); }
void VuzpS32(int32x2x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vuzp_s32(*v0, *v1); }
void VuzpU8(uint8x8x2_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vuzp_u8(*v0, *v1); }
void VuzpU16(uint16x4x2_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vuzp_u16(*v0, *v1); }
void VuzpU32(uint32x2x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vuzp_u32(*v0, *v1); }
void VuzpF32(float32x2x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vuzp_f32(*v0, *v1); }
void Vuzp1S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vuzp1_s8(*v0, *v1); }
void Vuzp1S16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vuzp1_s16(*v0, *v1); }
void Vuzp1S32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vuzp1_s32(*v0, *v1); }
void Vuzp1U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vuzp1_u8(*v0, *v1); }
void Vuzp1U16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vuzp1_u16(*v0, *v1); }
void Vuzp1U32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vuzp1_u32(*v0, *v1); }
void Vuzp1F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vuzp1_f32(*v0, *v1); }
void Vuzp1P16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vuzp1_p16(*v0, *v1); }
void Vuzp1P8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vuzp1_p8(*v0, *v1); }
void Vuzp1QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vuzp1q_s8(*v0, *v1); }
void Vuzp1QS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vuzp1q_s16(*v0, *v1); }
void Vuzp1QS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vuzp1q_s32(*v0, *v1); }
void Vuzp1QS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vuzp1q_s64(*v0, *v1); }
void Vuzp1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vuzp1q_u8(*v0, *v1); }
void Vuzp1QU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vuzp1q_u16(*v0, *v1); }
void Vuzp1QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vuzp1q_u32(*v0, *v1); }
void Vuzp1QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vuzp1q_u64(*v0, *v1); }
void Vuzp1QF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vuzp1q_f32(*v0, *v1); }
void Vuzp1QF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vuzp1q_f64(*v0, *v1); }
void Vuzp1QP16(poly16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vuzp1q_p16(*v0, *v1); }
void Vuzp1QP64(poly64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vuzp1q_p64(*v0, *v1); }
void Vuzp1QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vuzp1q_p8(*v0, *v1); }
void Vuzp2S8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vuzp2_s8(*v0, *v1); }
void Vuzp2S16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vuzp2_s16(*v0, *v1); }
void Vuzp2S32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vuzp2_s32(*v0, *v1); }
void Vuzp2U8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vuzp2_u8(*v0, *v1); }
void Vuzp2U16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vuzp2_u16(*v0, *v1); }
void Vuzp2U32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vuzp2_u32(*v0, *v1); }
void Vuzp2F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vuzp2_f32(*v0, *v1); }
void Vuzp2P16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vuzp2_p16(*v0, *v1); }
void Vuzp2P8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vuzp2_p8(*v0, *v1); }
void Vuzp2QS8(int8x16_t* r, int8x16_t* v0,
Download .txt
gitextract_6cse9yuu/

├── .gitignore
├── LICENSE
├── README.md
├── arm/
│   ├── generate.go
│   ├── neon/
│   │   ├── functions.c
│   │   ├── functions.go
│   │   ├── functions_bypass.go
│   │   ├── functions_cgo.go
│   │   ├── functions_test.go
│   │   ├── loops.c
│   │   ├── loops.go
│   │   └── loops_test.go
│   └── types.go
├── example/
│   ├── neon/
│   │   └── main.go
│   └── sse2/
│       └── main.go
├── generator/
│   ├── arm/
│   │   ├── arm.go
│   │   ├── main.go
│   │   └── sort.go
│   ├── scanner/
│   │   ├── scan.go
│   │   ├── scan_test.go
│   │   └── util.go
│   ├── types/
│   │   ├── function.go
│   │   └── type.go
│   ├── utils/
│   │   ├── download.go
│   │   ├── filter.go
│   │   └── slice.go
│   ├── writer/
│   │   ├── cgo.go
│   │   ├── function.go
│   │   ├── package.go
│   │   ├── package_test.go
│   │   ├── type.go
│   │   └── writer.go
│   └── x86/
│       ├── info.go
│       └── main.go
├── go.mod
├── go.sum
└── x86/
    ├── aes/
    │   ├── functions.c
    │   └── functions.go
    ├── avx/
    │   ├── functions.c
    │   └── functions.go
    ├── avx2/
    │   ├── functions.c
    │   └── functions.go
    ├── bmi/
    │   ├── functions.c
    │   └── functions.go
    ├── bmi2/
    │   ├── functions.c
    │   └── functions.go
    ├── crc32/
    │   ├── functions.c
    │   └── functions.go
    ├── f16c/
    │   ├── functions.c
    │   └── functions.go
    ├── fma/
    │   ├── functions.c
    │   └── functions.go
    ├── fsgsbase/
    │   ├── functions.c
    │   └── functions.go
    ├── generate.go
    ├── lzcnt/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_sse/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_sse2/
    │   ├── functions.c
    │   └── functions.go
    ├── mmx_ssse3/
    │   ├── functions.c
    │   └── functions.go
    ├── popcnt/
    │   ├── functions.c
    │   └── functions.go
    ├── sse/
    │   ├── functions.c
    │   └── functions.go
    ├── sse2/
    │   ├── functions.c
    │   └── functions.go
    ├── sse3/
    │   ├── functions.c
    │   └── functions.go
    ├── ssse3/
    │   ├── functions.c
    │   └── functions.go
    └── types.go
Download .txt
Showing preview only (748K chars total). Download the full file or copy to clipboard to get everything.
SYMBOL INDEX (7261 symbols across 66 files)

FILE: arm/neon/functions.c
  function VabaS8 (line 3) | void VabaS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r ...
  function VabaS16 (line 4) | void VabaS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) ...
  function VabaS32 (line 5) | void VabaS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) ...
  function VabaU8 (line 6) | void VabaU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) {...
  function VabaU16 (line 7) | void VabaU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* ...
  function VabaU32 (line 8) | void VabaU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* ...
  function VabalS8 (line 9) | void VabalS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { ...
  function VabalS16 (line 10) | void VabalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2)...
  function VabalS32 (line 11) | void VabalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2)...
  function VabalU8 (line 12) | void VabalU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2...
  function VabalU16 (line 13) | void VabalU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t*...
  function VabalU32 (line 14) | void VabalU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t*...
  function VabalHighS8 (line 15) | void VabalHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* ...
  function VabalHighS16 (line 16) | void VabalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t*...
  function VabalHighS32 (line 17) | void VabalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t*...
  function VabalHighU8 (line 18) | void VabalHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16...
  function VabalHighU16 (line 19) | void VabalHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x...
  function VabalHighU32 (line 20) | void VabalHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x...
  function VabaqS8 (line 21) | void VabaqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) ...
  function VabaqS16 (line 22) | void VabaqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2)...
  function VabaqS32 (line 23) | void VabaqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2)...
  function VabaqU8 (line 24) | void VabaqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* ...
  function VabaqU16 (line 25) | void VabaqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t*...
  function VabaqU32 (line 26) | void VabaqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t*...
  function VabdS8 (line 27) | void VabdS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vabd_s8(*v0,...
  function VabdS16 (line 28) | void VabdS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vabd_s16...
  function VabdS32 (line 29) | void VabdS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vabd_s32...
  function VabdU8 (line 30) | void VabdU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vabd_u8(*...
  function VabdU16 (line 31) | void VabdU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vabd_...
  function VabdU32 (line 32) | void VabdU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vabd_...
  function VabdF32 (line 33) | void VabdF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = va...
  function VabdF64 (line 34) | void VabdF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = va...
  function VabddF64 (line 35) | void VabddF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vabdd_f...
  function VabdlS8 (line 36) | void VabdlS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vabdl_s8(*...
  function VabdlS16 (line 37) | void VabdlS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vabdl_s...
  function VabdlS32 (line 38) | void VabdlS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vabdl_s...
  function VabdlU8 (line 39) | void VabdlU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vabdl_u...
  function VabdlU16 (line 40) | void VabdlU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vabd...
  function VabdlU32 (line 41) | void VabdlU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vabd...
  function VabdlHighS8 (line 42) | void VabdlHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vabd...
  function VabdlHighS16 (line 43) | void VabdlHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vab...
  function VabdlHighS32 (line 44) | void VabdlHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vab...
  function VabdlHighU8 (line 45) | void VabdlHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = v...
  function VabdlHighU16 (line 46) | void VabdlHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = ...
  function VabdlHighU32 (line 47) | void VabdlHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = ...
  function VabdqS8 (line 48) | void VabdqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vabdq_s8...
  function VabdqS16 (line 49) | void VabdqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vabdq_s...
  function VabdqS32 (line 50) | void VabdqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vabdq_s...
  function VabdqU8 (line 51) | void VabdqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vabdq...
  function VabdqU16 (line 52) | void VabdqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vabd...
  function VabdqU32 (line 53) | void VabdqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vabd...
  function VabdqF32 (line 54) | void VabdqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VabdqF64 (line 55) | void VabdqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VabdsF32 (line 56) | void VabdsF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vabds_f...
  function VabsS8 (line 57) | void VabsS8(int8x8_t* r, int8x8_t* v0) { *r = vabs_s8(*v0); }
  function VabsS16 (line 58) | void VabsS16(int16x4_t* r, int16x4_t* v0) { *r = vabs_s16(*v0); }
  function VabsS32 (line 59) | void VabsS32(int32x2_t* r, int32x2_t* v0) { *r = vabs_s32(*v0); }
  function VabsS64 (line 60) | void VabsS64(int64x1_t* r, int64x1_t* v0) { *r = vabs_s64(*v0); }
  function VabsF32 (line 61) | void VabsF32(float32x2_t* r, float32x2_t* v0) { *r = vabs_f32(*v0); }
  function VabsF64 (line 62) | void VabsF64(float64x1_t* r, float64x1_t* v0) { *r = vabs_f64(*v0); }
  function VabsdS64 (line 63) | void VabsdS64(int64_t* r, int64_t* v0) { *r = vabsd_s64(*v0); }
  function VabsqS8 (line 64) | void VabsqS8(int8x16_t* r, int8x16_t* v0) { *r = vabsq_s8(*v0); }
  function VabsqS16 (line 65) | void VabsqS16(int16x8_t* r, int16x8_t* v0) { *r = vabsq_s16(*v0); }
  function VabsqS32 (line 66) | void VabsqS32(int32x4_t* r, int32x4_t* v0) { *r = vabsq_s32(*v0); }
  function VabsqS64 (line 67) | void VabsqS64(int64x2_t* r, int64x2_t* v0) { *r = vabsq_s64(*v0); }
  function VabsqF32 (line 68) | void VabsqF32(float32x4_t* r, float32x4_t* v0) { *r = vabsq_f32(*v0); }
  function VabsqF64 (line 69) | void VabsqF64(float64x2_t* r, float64x2_t* v0) { *r = vabsq_f64(*v0); }
  function VaddS8 (line 70) | void VaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vadd_s8(*v0,...
  function VaddS16 (line 71) | void VaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vadd_s16...
  function VaddS32 (line 72) | void VaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vadd_s32...
  function VaddS64 (line 73) | void VaddS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vadd_s64...
  function VaddU8 (line 74) | void VaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vadd_u8(*...
  function VaddU16 (line 75) | void VaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vadd_...
  function VaddU32 (line 76) | void VaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vadd_...
  function VaddU64 (line 77) | void VaddU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vadd_...
  function VaddF32 (line 78) | void VaddF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = va...
  function VaddF64 (line 79) | void VaddF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = va...
  function VaddP16 (line 80) | void VaddP16(poly16x4_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = vadd_...
  function VaddP64 (line 81) | void VaddP64(poly64x1_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vadd_...
  function VaddP8 (line 82) | void VaddP8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vadd_p8(*...
  function VadddS64 (line 83) | void VadddS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vaddd_s64(*v0...
  function VadddU64 (line 84) | void VadddU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vaddd_u64(...
  function VaddhnS16 (line 85) | void VaddhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vaddhn_...
  function VaddhnS32 (line 86) | void VaddhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vaddhn...
  function VaddhnS64 (line 87) | void VaddhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vaddhn...
  function VaddhnU16 (line 88) | void VaddhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vadd...
  function VaddhnU32 (line 89) | void VaddhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vad...
  function VaddhnU64 (line 90) | void VaddhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vad...
  function VaddhnHighS16 (line 91) | void VaddhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t*...
  function VaddhnHighS32 (line 92) | void VaddhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_t...
  function VaddhnHighS64 (line 93) | void VaddhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_t...
  function VaddhnHighU16 (line 94) | void VaddhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16x...
  function VaddhnHighU32 (line 95) | void VaddhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint32...
  function VaddhnHighU64 (line 96) | void VaddhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint64...
  function VaddlS8 (line 97) | void VaddlS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vaddl_s8(*...
  function VaddlS16 (line 98) | void VaddlS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vaddl_s...
  function VaddlS32 (line 99) | void VaddlS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vaddl_s...
  function VaddlU8 (line 100) | void VaddlU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vaddl_u...
  function VaddlU16 (line 101) | void VaddlU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vadd...
  function VaddlU32 (line 102) | void VaddlU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vadd...
  function VaddlHighS8 (line 103) | void VaddlHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vadd...
  function VaddlHighS16 (line 104) | void VaddlHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vad...
  function VaddlHighS32 (line 105) | void VaddlHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vad...
  function VaddlHighU8 (line 106) | void VaddlHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = v...
  function VaddlHighU16 (line 107) | void VaddlHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = ...
  function VaddlHighU32 (line 108) | void VaddlHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = ...
  function VaddlvS8 (line 109) | void VaddlvS8(int16_t* r, int8x8_t* v0) { *r = vaddlv_s8(*v0); }
  function VaddlvS16 (line 110) | void VaddlvS16(int32_t* r, int16x4_t* v0) { *r = vaddlv_s16(*v0); }
  function VaddlvS32 (line 111) | void VaddlvS32(int64_t* r, int32x2_t* v0) { *r = vaddlv_s32(*v0); }
  function VaddlvU8 (line 112) | void VaddlvU8(uint16_t* r, uint8x8_t* v0) { *r = vaddlv_u8(*v0); }
  function VaddlvU16 (line 113) | void VaddlvU16(uint32_t* r, uint16x4_t* v0) { *r = vaddlv_u16(*v0); }
  function VaddlvU32 (line 114) | void VaddlvU32(uint64_t* r, uint32x2_t* v0) { *r = vaddlv_u32(*v0); }
  function VaddlvqS8 (line 115) | void VaddlvqS8(int16_t* r, int8x16_t* v0) { *r = vaddlvq_s8(*v0); }
  function VaddlvqS16 (line 116) | void VaddlvqS16(int32_t* r, int16x8_t* v0) { *r = vaddlvq_s16(*v0); }
  function VaddlvqS32 (line 117) | void VaddlvqS32(int64_t* r, int32x4_t* v0) { *r = vaddlvq_s32(*v0); }
  function VaddlvqU8 (line 118) | void VaddlvqU8(uint16_t* r, uint8x16_t* v0) { *r = vaddlvq_u8(*v0); }
  function VaddlvqU16 (line 119) | void VaddlvqU16(uint32_t* r, uint16x8_t* v0) { *r = vaddlvq_u16(*v0); }
  function VaddlvqU32 (line 120) | void VaddlvqU32(uint64_t* r, uint32x4_t* v0) { *r = vaddlvq_u32(*v0); }
  function VaddqS8 (line 121) | void VaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vaddq_s8...
  function VaddqS16 (line 122) | void VaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vaddq_s...
  function VaddqS32 (line 123) | void VaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vaddq_s...
  function VaddqS64 (line 124) | void VaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vaddq_s...
  function VaddqU8 (line 125) | void VaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaddq...
  function VaddqU16 (line 126) | void VaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vadd...
  function VaddqU32 (line 127) | void VaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vadd...
  function VaddqU64 (line 128) | void VaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vadd...
  function VaddqF32 (line 129) | void VaddqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VaddqF64 (line 130) | void VaddqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VaddqP128 (line 131) | void VaddqP128(poly128_t* r, poly128_t* v0, poly128_t* v1) { *r = vaddq_...
  function VaddqP16 (line 132) | void VaddqP16(poly16x8_t* r, poly16x8_t* v0, poly16x8_t* v1) { *r = vadd...
  function VaddqP64 (line 133) | void VaddqP64(poly64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vadd...
  function VaddqP8 (line 134) | void VaddqP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vaddq...
  function VaddvS8 (line 135) | void VaddvS8(int8_t* r, int8x8_t* v0) { *r = vaddv_s8(*v0); }
  function VaddvS16 (line 136) | void VaddvS16(int16_t* r, int16x4_t* v0) { *r = vaddv_s16(*v0); }
  function VaddvS32 (line 137) | void VaddvS32(int32_t* r, int32x2_t* v0) { *r = vaddv_s32(*v0); }
  function VaddvU8 (line 138) | void VaddvU8(uint8_t* r, uint8x8_t* v0) { *r = vaddv_u8(*v0); }
  function VaddvU16 (line 139) | void VaddvU16(uint16_t* r, uint16x4_t* v0) { *r = vaddv_u16(*v0); }
  function VaddvU32 (line 140) | void VaddvU32(uint32_t* r, uint32x2_t* v0) { *r = vaddv_u32(*v0); }
  function VaddvF32 (line 141) | void VaddvF32(float32_t* r, float32x2_t* v0) { *r = vaddv_f32(*v0); }
  function VaddvqS8 (line 142) | void VaddvqS8(int8_t* r, int8x16_t* v0) { *r = vaddvq_s8(*v0); }
  function VaddvqS16 (line 143) | void VaddvqS16(int16_t* r, int16x8_t* v0) { *r = vaddvq_s16(*v0); }
  function VaddvqS32 (line 144) | void VaddvqS32(int32_t* r, int32x4_t* v0) { *r = vaddvq_s32(*v0); }
  function VaddvqS64 (line 145) | void VaddvqS64(int64_t* r, int64x2_t* v0) { *r = vaddvq_s64(*v0); }
  function VaddvqU8 (line 146) | void VaddvqU8(uint8_t* r, uint8x16_t* v0) { *r = vaddvq_u8(*v0); }
  function VaddvqU16 (line 147) | void VaddvqU16(uint16_t* r, uint16x8_t* v0) { *r = vaddvq_u16(*v0); }
  function VaddvqU32 (line 148) | void VaddvqU32(uint32_t* r, uint32x4_t* v0) { *r = vaddvq_u32(*v0); }
  function VaddvqU64 (line 149) | void VaddvqU64(uint64_t* r, uint64x2_t* v0) { *r = vaddvq_u64(*v0); }
  function VaddvqF32 (line 150) | void VaddvqF32(float32_t* r, float32x4_t* v0) { *r = vaddvq_f32(*v0); }
  function VaddvqF64 (line 151) | void VaddvqF64(float64_t* r, float64x2_t* v0) { *r = vaddvq_f64(*v0); }
  function VaddwS8 (line 152) | void VaddwS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1) { *r = vaddw_s8(...
  function VaddwS16 (line 153) | void VaddwS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1) { *r = vaddw_s...
  function VaddwS32 (line 154) | void VaddwS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1) { *r = vaddw_s...
  function VaddwU8 (line 155) | void VaddwU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1) { *r = vaddw_...
  function VaddwU16 (line 156) | void VaddwU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1) { *r = vadd...
  function VaddwU32 (line 157) | void VaddwU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1) { *r = vadd...
  function VaddwHighS8 (line 158) | void VaddwHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1) { *r = vadd...
  function VaddwHighS16 (line 159) | void VaddwHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1) { *r = vad...
  function VaddwHighS32 (line 160) | void VaddwHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1) { *r = vad...
  function VaddwHighU8 (line 161) | void VaddwHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1) { *r = v...
  function VaddwHighU16 (line 162) | void VaddwHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1) { *r = ...
  function VaddwHighU32 (line 163) | void VaddwHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1) { *r = ...
  function VaesdqU8 (line 164) | void VaesdqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaes...
  function VaeseqU8 (line 165) | void VaeseqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vaes...
  function VaesimcqU8 (line 166) | void VaesimcqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vaesimcq_u8(*v0); }
  function VaesmcqU8 (line 167) | void VaesmcqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vaesmcq_u8(*v0); }
  function VandS8 (line 168) | void VandS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vand_s8(*v0,...
  function VandS16 (line 169) | void VandS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vand_s16...
  function VandS32 (line 170) | void VandS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vand_s32...
  function VandS64 (line 171) | void VandS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vand_s64...
  function VandU8 (line 172) | void VandU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vand_u8(*...
  function VandU16 (line 173) | void VandU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vand_...
  function VandU32 (line 174) | void VandU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vand_...
  function VandU64 (line 175) | void VandU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vand_...
  function VandqS8 (line 176) | void VandqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vandq_s8...
  function VandqS16 (line 177) | void VandqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vandq_s...
  function VandqS32 (line 178) | void VandqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vandq_s...
  function VandqS64 (line 179) | void VandqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vandq_s...
  function VandqU8 (line 180) | void VandqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vandq...
  function VandqU16 (line 181) | void VandqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vand...
  function VandqU32 (line 182) | void VandqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vand...
  function VandqU64 (line 183) | void VandqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vand...
  function VbcaxqS8 (line 184) | void VbcaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2)...
  function VbcaxqS16 (line 185) | void VbcaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2...
  function VbcaxqS32 (line 186) | void VbcaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2...
  function VbcaxqS64 (line 187) | void VbcaxqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1, int64x2_t* v2...
  function VbcaxqU8 (line 188) | void VbcaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t*...
  function VbcaxqU16 (line 189) | void VbcaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t...
  function VbcaxqU32 (line 190) | void VbcaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t...
  function VbcaxqU64 (line 191) | void VbcaxqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t...
  function VbicS8 (line 192) | void VbicS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vbic_s8(*v0,...
  function VbicS16 (line 193) | void VbicS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vbic_s16...
  function VbicS32 (line 194) | void VbicS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vbic_s32...
  function VbicS64 (line 195) | void VbicS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vbic_s64...
  function VbicU8 (line 196) | void VbicU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vbic_u8(*...
  function VbicU16 (line 197) | void VbicU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vbic_...
  function VbicU32 (line 198) | void VbicU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vbic_...
  function VbicU64 (line 199) | void VbicU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vbic_...
  function VbicqS8 (line 200) | void VbicqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vbicq_s8...
  function VbicqS16 (line 201) | void VbicqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vbicq_s...
  function VbicqS32 (line 202) | void VbicqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vbicq_s...
  function VbicqS64 (line 203) | void VbicqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vbicq_s...
  function VbicqU8 (line 204) | void VbicqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vbicq...
  function VbicqU16 (line 205) | void VbicqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vbic...
  function VbicqU32 (line 206) | void VbicqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vbic...
  function VbicqU64 (line 207) | void VbicqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vbic...
  function VbslS8 (line 208) | void VbslS8(int8x8_t* r, uint8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r...
  function VbslS16 (line 209) | void VbslS16(int16x4_t* r, uint16x4_t* v0, int16x4_t* v1, int16x4_t* v2)...
  function VbslS32 (line 210) | void VbslS32(int32x2_t* r, uint32x2_t* v0, int32x2_t* v1, int32x2_t* v2)...
  function VbslS64 (line 211) | void VbslS64(int64x1_t* r, uint64x1_t* v0, int64x1_t* v1, int64x1_t* v2)...
  function VbslU8 (line 212) | void VbslU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) {...
  function VbslU16 (line 213) | void VbslU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* ...
  function VbslU32 (line 214) | void VbslU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* ...
  function VbslU64 (line 215) | void VbslU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1, uint64x1_t* ...
  function VbslF32 (line 216) | void VbslF32(float32x2_t* r, uint32x2_t* v0, float32x2_t* v1, float32x2_...
  function VbslF64 (line 217) | void VbslF64(float64x1_t* r, uint64x1_t* v0, float64x1_t* v1, float64x1_...
  function VbslP16 (line 218) | void VbslP16(poly16x4_t* r, uint16x4_t* v0, poly16x4_t* v1, poly16x4_t* ...
  function VbslP64 (line 219) | void VbslP64(poly64x1_t* r, uint64x1_t* v0, poly64x1_t* v1, poly64x1_t* ...
  function VbslP8 (line 220) | void VbslP8(poly8x8_t* r, uint8x8_t* v0, poly8x8_t* v1, poly8x8_t* v2) {...
  function VbslqS8 (line 221) | void VbslqS8(int8x16_t* r, uint8x16_t* v0, int8x16_t* v1, int8x16_t* v2)...
  function VbslqS16 (line 222) | void VbslqS16(int16x8_t* r, uint16x8_t* v0, int16x8_t* v1, int16x8_t* v2...
  function VbslqS32 (line 223) | void VbslqS32(int32x4_t* r, uint32x4_t* v0, int32x4_t* v1, int32x4_t* v2...
  function VbslqS64 (line 224) | void VbslqS64(int64x2_t* r, uint64x2_t* v0, int64x2_t* v1, int64x2_t* v2...
  function VbslqU8 (line 225) | void VbslqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* ...
  function VbslqU16 (line 226) | void VbslqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t*...
  function VbslqU32 (line 227) | void VbslqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t*...
  function VbslqU64 (line 228) | void VbslqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t*...
  function VbslqF32 (line 229) | void VbslqF32(float32x4_t* r, uint32x4_t* v0, float32x4_t* v1, float32x4...
  function VbslqF64 (line 230) | void VbslqF64(float64x2_t* r, uint64x2_t* v0, float64x2_t* v1, float64x2...
  function VbslqP16 (line 231) | void VbslqP16(poly16x8_t* r, uint16x8_t* v0, poly16x8_t* v1, poly16x8_t*...
  function VbslqP64 (line 232) | void VbslqP64(poly64x2_t* r, uint64x2_t* v0, poly64x2_t* v1, poly64x2_t*...
  function VbslqP8 (line 233) | void VbslqP8(poly8x16_t* r, uint8x16_t* v0, poly8x16_t* v1, poly8x16_t* ...
  function VcaddRot270F32 (line 234) | void VcaddRot270F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { ...
  function VcaddRot90F32 (line 235) | void VcaddRot90F32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *...
  function VcaddqRot270F32 (line 236) | void VcaddqRot270F32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) {...
  function VcaddqRot270F64 (line 237) | void VcaddqRot270F64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) {...
  function VcaddqRot90F32 (line 238) | void VcaddqRot90F32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { ...
  function VcaddqRot90F64 (line 239) | void VcaddqRot90F64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { ...
  function VcageF32 (line 240) | void VcageF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vc...
  function VcageF64 (line 241) | void VcageF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vc...
  function VcagedF64 (line 242) | void VcagedF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaged_...
  function VcageqF32 (line 243) | void VcageqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VcageqF64 (line 244) | void VcageqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VcagesF32 (line 245) | void VcagesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcages_...
  function VcagtF32 (line 246) | void VcagtF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vc...
  function VcagtF64 (line 247) | void VcagtF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vc...
  function VcagtdF64 (line 248) | void VcagtdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcagtd_...
  function VcagtqF32 (line 249) | void VcagtqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VcagtqF64 (line 250) | void VcagtqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VcagtsF32 (line 251) | void VcagtsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcagts_...
  function VcaleF32 (line 252) | void VcaleF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vc...
  function VcaleF64 (line 253) | void VcaleF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vc...
  function VcaledF64 (line 254) | void VcaledF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaled_...
  function VcaleqF32 (line 255) | void VcaleqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VcaleqF64 (line 256) | void VcaleqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VcalesF32 (line 257) | void VcalesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcales_...
  function VcaltF32 (line 258) | void VcaltF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vc...
  function VcaltF64 (line 259) | void VcaltF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vc...
  function VcaltdF64 (line 260) | void VcaltdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcaltd_...
  function VcaltqF32 (line 261) | void VcaltqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VcaltqF64 (line 262) | void VcaltqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VcaltsF32 (line 263) | void VcaltsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcalts_...
  function VceqS8 (line 264) | void VceqS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vceq_s8(*v0...
  function VceqS16 (line 265) | void VceqS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vceq_s1...
  function VceqS32 (line 266) | void VceqS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vceq_s3...
  function VceqS64 (line 267) | void VceqS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vceq_s6...
  function VceqU8 (line 268) | void VceqU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vceq_u8(*...
  function VceqU16 (line 269) | void VceqU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vceq_...
  function VceqU32 (line 270) | void VceqU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vceq_...
  function VceqU64 (line 271) | void VceqU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vceq_...
  function VceqF32 (line 272) | void VceqF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vce...
  function VceqF64 (line 273) | void VceqF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vce...
  function VceqP64 (line 274) | void VceqP64(uint64x1_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = vceq_...
  function VceqP8 (line 275) | void VceqP8(uint8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vceq_p8(*...
  function VceqdS64 (line 276) | void VceqdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vceqd_s64(*v...
  function VceqdU64 (line 277) | void VceqdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vceqd_u64(...
  function VceqdF64 (line 278) | void VceqdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vceqd_f6...
  function VceqqS8 (line 279) | void VceqqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vceqq_s...
  function VceqqS16 (line 280) | void VceqqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vceqq_...
  function VceqqS32 (line 281) | void VceqqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vceqq_...
  function VceqqS64 (line 282) | void VceqqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vceqq_...
  function VceqqU8 (line 283) | void VceqqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vceqq...
  function VceqqU16 (line 284) | void VceqqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vceq...
  function VceqqU32 (line 285) | void VceqqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vceq...
  function VceqqU64 (line 286) | void VceqqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vceq...
  function VceqqF32 (line 287) | void VceqqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vc...
  function VceqqF64 (line 288) | void VceqqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vc...
  function VceqqP64 (line 289) | void VceqqP64(uint64x2_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = vceq...
  function VceqqP8 (line 290) | void VceqqP8(uint8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vceqq...
  function VceqsF32 (line 291) | void VceqsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vceqs_f3...
  function VceqzS8 (line 292) | void VceqzS8(uint8x8_t* r, int8x8_t* v0) { *r = vceqz_s8(*v0); }
  function VceqzS16 (line 293) | void VceqzS16(uint16x4_t* r, int16x4_t* v0) { *r = vceqz_s16(*v0); }
  function VceqzS32 (line 294) | void VceqzS32(uint32x2_t* r, int32x2_t* v0) { *r = vceqz_s32(*v0); }
  function VceqzS64 (line 295) | void VceqzS64(uint64x1_t* r, int64x1_t* v0) { *r = vceqz_s64(*v0); }
  function VceqzU8 (line 296) | void VceqzU8(uint8x8_t* r, uint8x8_t* v0) { *r = vceqz_u8(*v0); }
  function VceqzU16 (line 297) | void VceqzU16(uint16x4_t* r, uint16x4_t* v0) { *r = vceqz_u16(*v0); }
  function VceqzU32 (line 298) | void VceqzU32(uint32x2_t* r, uint32x2_t* v0) { *r = vceqz_u32(*v0); }
  function VceqzU64 (line 299) | void VceqzU64(uint64x1_t* r, uint64x1_t* v0) { *r = vceqz_u64(*v0); }
  function VceqzF32 (line 300) | void VceqzF32(uint32x2_t* r, float32x2_t* v0) { *r = vceqz_f32(*v0); }
  function VceqzF64 (line 301) | void VceqzF64(uint64x1_t* r, float64x1_t* v0) { *r = vceqz_f64(*v0); }
  function VceqzP64 (line 302) | void VceqzP64(uint64x1_t* r, poly64x1_t* v0) { *r = vceqz_p64(*v0); }
  function VceqzP8 (line 303) | void VceqzP8(uint8x8_t* r, poly8x8_t* v0) { *r = vceqz_p8(*v0); }
  function VceqzdS64 (line 304) | void VceqzdS64(uint64_t* r, int64_t* v0) { *r = vceqzd_s64(*v0); }
  function VceqzdU64 (line 305) | void VceqzdU64(uint64_t* r, uint64_t* v0) { *r = vceqzd_u64(*v0); }
  function VceqzdF64 (line 306) | void VceqzdF64(uint64_t* r, float64_t* v0) { *r = vceqzd_f64(*v0); }
  function VceqzqS8 (line 307) | void VceqzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vceqzq_s8(*v0); }
  function VceqzqS16 (line 308) | void VceqzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vceqzq_s16(*v0); }
  function VceqzqS32 (line 309) | void VceqzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vceqzq_s32(*v0); }
  function VceqzqS64 (line 310) | void VceqzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vceqzq_s64(*v0); }
  function VceqzqU8 (line 311) | void VceqzqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vceqzq_u8(*v0); }
  function VceqzqU16 (line 312) | void VceqzqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vceqzq_u16(*v0); }
  function VceqzqU32 (line 313) | void VceqzqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vceqzq_u32(*v0); }
  function VceqzqU64 (line 314) | void VceqzqU64(uint64x2_t* r, uint64x2_t* v0) { *r = vceqzq_u64(*v0); }
  function VceqzqF32 (line 315) | void VceqzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vceqzq_f32(*v0); }
  function VceqzqF64 (line 316) | void VceqzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vceqzq_f64(*v0); }
  function VceqzqP64 (line 317) | void VceqzqP64(uint64x2_t* r, poly64x2_t* v0) { *r = vceqzq_p64(*v0); }
  function VceqzqP8 (line 318) | void VceqzqP8(uint8x16_t* r, poly8x16_t* v0) { *r = vceqzq_p8(*v0); }
  function VceqzsF32 (line 319) | void VceqzsF32(uint32_t* r, float32_t* v0) { *r = vceqzs_f32(*v0); }
  function VcgeS8 (line 320) | void VcgeS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcge_s8(*v0...
  function VcgeS16 (line 321) | void VcgeS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcge_s1...
  function VcgeS32 (line 322) | void VcgeS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcge_s3...
  function VcgeS64 (line 323) | void VcgeS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcge_s6...
  function VcgeU8 (line 324) | void VcgeU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcge_u8(*...
  function VcgeU16 (line 325) | void VcgeU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcge_...
  function VcgeU32 (line 326) | void VcgeU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcge_...
  function VcgeU64 (line 327) | void VcgeU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcge_...
  function VcgeF32 (line 328) | void VcgeF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcg...
  function VcgeF64 (line 329) | void VcgeF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcg...
  function VcgedS64 (line 330) | void VcgedS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcged_s64(*v...
  function VcgedU64 (line 331) | void VcgedU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcged_u64(...
  function VcgedF64 (line 332) | void VcgedF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcged_f6...
  function VcgeqS8 (line 333) | void VcgeqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcgeq_s...
  function VcgeqS16 (line 334) | void VcgeqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcgeq_...
  function VcgeqS32 (line 335) | void VcgeqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcgeq_...
  function VcgeqS64 (line 336) | void VcgeqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcgeq_...
  function VcgeqU8 (line 337) | void VcgeqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcgeq...
  function VcgeqU16 (line 338) | void VcgeqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcge...
  function VcgeqU32 (line 339) | void VcgeqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcge...
  function VcgeqU64 (line 340) | void VcgeqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcge...
  function VcgeqF32 (line 341) | void VcgeqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vc...
  function VcgeqF64 (line 342) | void VcgeqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vc...
  function VcgesF32 (line 343) | void VcgesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcges_f3...
  function VcgezS8 (line 344) | void VcgezS8(uint8x8_t* r, int8x8_t* v0) { *r = vcgez_s8(*v0); }
  function VcgezS16 (line 345) | void VcgezS16(uint16x4_t* r, int16x4_t* v0) { *r = vcgez_s16(*v0); }
  function VcgezS32 (line 346) | void VcgezS32(uint32x2_t* r, int32x2_t* v0) { *r = vcgez_s32(*v0); }
  function VcgezS64 (line 347) | void VcgezS64(uint64x1_t* r, int64x1_t* v0) { *r = vcgez_s64(*v0); }
  function VcgezF32 (line 348) | void VcgezF32(uint32x2_t* r, float32x2_t* v0) { *r = vcgez_f32(*v0); }
  function VcgezF64 (line 349) | void VcgezF64(uint64x1_t* r, float64x1_t* v0) { *r = vcgez_f64(*v0); }
  function VcgezdS64 (line 350) | void VcgezdS64(uint64_t* r, int64_t* v0) { *r = vcgezd_s64(*v0); }
  function VcgezdF64 (line 351) | void VcgezdF64(uint64_t* r, float64_t* v0) { *r = vcgezd_f64(*v0); }
  function VcgezqS8 (line 352) | void VcgezqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcgezq_s8(*v0); }
  function VcgezqS16 (line 353) | void VcgezqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcgezq_s16(*v0); }
  function VcgezqS32 (line 354) | void VcgezqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcgezq_s32(*v0); }
  function VcgezqS64 (line 355) | void VcgezqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcgezq_s64(*v0); }
  function VcgezqF32 (line 356) | void VcgezqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcgezq_f32(*v0); }
  function VcgezqF64 (line 357) | void VcgezqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcgezq_f64(*v0); }
  function VcgezsF32 (line 358) | void VcgezsF32(uint32_t* r, float32_t* v0) { *r = vcgezs_f32(*v0); }
  function VcgtS8 (line 359) | void VcgtS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcgt_s8(*v0...
  function VcgtS16 (line 360) | void VcgtS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcgt_s1...
  function VcgtS32 (line 361) | void VcgtS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcgt_s3...
  function VcgtS64 (line 362) | void VcgtS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcgt_s6...
  function VcgtU8 (line 363) | void VcgtU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcgt_u8(*...
  function VcgtU16 (line 364) | void VcgtU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcgt_...
  function VcgtU32 (line 365) | void VcgtU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcgt_...
  function VcgtU64 (line 366) | void VcgtU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcgt_...
  function VcgtF32 (line 367) | void VcgtF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcg...
  function VcgtF64 (line 368) | void VcgtF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcg...
  function VcgtdS64 (line 369) | void VcgtdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcgtd_s64(*v...
  function VcgtdU64 (line 370) | void VcgtdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcgtd_u64(...
  function VcgtdF64 (line 371) | void VcgtdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcgtd_f6...
  function VcgtqS8 (line 372) | void VcgtqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcgtq_s...
  function VcgtqS16 (line 373) | void VcgtqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcgtq_...
  function VcgtqS32 (line 374) | void VcgtqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcgtq_...
  function VcgtqS64 (line 375) | void VcgtqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcgtq_...
  function VcgtqU8 (line 376) | void VcgtqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcgtq...
  function VcgtqU16 (line 377) | void VcgtqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcgt...
  function VcgtqU32 (line 378) | void VcgtqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcgt...
  function VcgtqU64 (line 379) | void VcgtqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcgt...
  function VcgtqF32 (line 380) | void VcgtqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vc...
  function VcgtqF64 (line 381) | void VcgtqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vc...
  function VcgtsF32 (line 382) | void VcgtsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcgts_f3...
  function VcgtzS8 (line 383) | void VcgtzS8(uint8x8_t* r, int8x8_t* v0) { *r = vcgtz_s8(*v0); }
  function VcgtzS16 (line 384) | void VcgtzS16(uint16x4_t* r, int16x4_t* v0) { *r = vcgtz_s16(*v0); }
  function VcgtzS32 (line 385) | void VcgtzS32(uint32x2_t* r, int32x2_t* v0) { *r = vcgtz_s32(*v0); }
  function VcgtzS64 (line 386) | void VcgtzS64(uint64x1_t* r, int64x1_t* v0) { *r = vcgtz_s64(*v0); }
  function VcgtzF32 (line 387) | void VcgtzF32(uint32x2_t* r, float32x2_t* v0) { *r = vcgtz_f32(*v0); }
  function VcgtzF64 (line 388) | void VcgtzF64(uint64x1_t* r, float64x1_t* v0) { *r = vcgtz_f64(*v0); }
  function VcgtzdS64 (line 389) | void VcgtzdS64(uint64_t* r, int64_t* v0) { *r = vcgtzd_s64(*v0); }
  function VcgtzdF64 (line 390) | void VcgtzdF64(uint64_t* r, float64_t* v0) { *r = vcgtzd_f64(*v0); }
  function VcgtzqS8 (line 391) | void VcgtzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcgtzq_s8(*v0); }
  function VcgtzqS16 (line 392) | void VcgtzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcgtzq_s16(*v0); }
  function VcgtzqS32 (line 393) | void VcgtzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcgtzq_s32(*v0); }
  function VcgtzqS64 (line 394) | void VcgtzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcgtzq_s64(*v0); }
  function VcgtzqF32 (line 395) | void VcgtzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcgtzq_f32(*v0); }
  function VcgtzqF64 (line 396) | void VcgtzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcgtzq_f64(*v0); }
  function VcgtzsF32 (line 397) | void VcgtzsF32(uint32_t* r, float32_t* v0) { *r = vcgtzs_f32(*v0); }
  function VcleS8 (line 398) | void VcleS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcle_s8(*v0...
  function VcleS16 (line 399) | void VcleS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcle_s1...
  function VcleS32 (line 400) | void VcleS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcle_s3...
  function VcleS64 (line 401) | void VcleS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcle_s6...
  function VcleU8 (line 402) | void VcleU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcle_u8(*...
  function VcleU16 (line 403) | void VcleU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vcle_...
  function VcleU32 (line 404) | void VcleU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vcle_...
  function VcleU64 (line 405) | void VcleU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vcle_...
  function VcleF32 (line 406) | void VcleF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcl...
  function VcleF64 (line 407) | void VcleF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcl...
  function VcledS64 (line 408) | void VcledS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcled_s64(*v...
  function VcledU64 (line 409) | void VcledU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcled_u64(...
  function VcledF64 (line 410) | void VcledF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcled_f6...
  function VcleqS8 (line 411) | void VcleqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcleq_s...
  function VcleqS16 (line 412) | void VcleqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcleq_...
  function VcleqS32 (line 413) | void VcleqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcleq_...
  function VcleqS64 (line 414) | void VcleqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcleq_...
  function VcleqU8 (line 415) | void VcleqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcleq...
  function VcleqU16 (line 416) | void VcleqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vcle...
  function VcleqU32 (line 417) | void VcleqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vcle...
  function VcleqU64 (line 418) | void VcleqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vcle...
  function VcleqF32 (line 419) | void VcleqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vc...
  function VcleqF64 (line 420) | void VcleqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vc...
  function VclesF32 (line 421) | void VclesF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vcles_f3...
  function VclezS8 (line 422) | void VclezS8(uint8x8_t* r, int8x8_t* v0) { *r = vclez_s8(*v0); }
  function VclezS16 (line 423) | void VclezS16(uint16x4_t* r, int16x4_t* v0) { *r = vclez_s16(*v0); }
  function VclezS32 (line 424) | void VclezS32(uint32x2_t* r, int32x2_t* v0) { *r = vclez_s32(*v0); }
  function VclezS64 (line 425) | void VclezS64(uint64x1_t* r, int64x1_t* v0) { *r = vclez_s64(*v0); }
  function VclezF32 (line 426) | void VclezF32(uint32x2_t* r, float32x2_t* v0) { *r = vclez_f32(*v0); }
  function VclezF64 (line 427) | void VclezF64(uint64x1_t* r, float64x1_t* v0) { *r = vclez_f64(*v0); }
  function VclezdS64 (line 428) | void VclezdS64(uint64_t* r, int64_t* v0) { *r = vclezd_s64(*v0); }
  function VclezdF64 (line 429) | void VclezdF64(uint64_t* r, float64_t* v0) { *r = vclezd_f64(*v0); }
  function VclezqS8 (line 430) | void VclezqS8(uint8x16_t* r, int8x16_t* v0) { *r = vclezq_s8(*v0); }
  function VclezqS16 (line 431) | void VclezqS16(uint16x8_t* r, int16x8_t* v0) { *r = vclezq_s16(*v0); }
  function VclezqS32 (line 432) | void VclezqS32(uint32x4_t* r, int32x4_t* v0) { *r = vclezq_s32(*v0); }
  function VclezqS64 (line 433) | void VclezqS64(uint64x2_t* r, int64x2_t* v0) { *r = vclezq_s64(*v0); }
  function VclezqF32 (line 434) | void VclezqF32(uint32x4_t* r, float32x4_t* v0) { *r = vclezq_f32(*v0); }
  function VclezqF64 (line 435) | void VclezqF64(uint64x2_t* r, float64x2_t* v0) { *r = vclezq_f64(*v0); }
  function VclezsF32 (line 436) | void VclezsF32(uint32_t* r, float32_t* v0) { *r = vclezs_f32(*v0); }
  function VclsS8 (line 437) | void VclsS8(int8x8_t* r, int8x8_t* v0) { *r = vcls_s8(*v0); }
  function VclsS16 (line 438) | void VclsS16(int16x4_t* r, int16x4_t* v0) { *r = vcls_s16(*v0); }
  function VclsS32 (line 439) | void VclsS32(int32x2_t* r, int32x2_t* v0) { *r = vcls_s32(*v0); }
  function VclsU8 (line 440) | void VclsU8(int8x8_t* r, uint8x8_t* v0) { *r = vcls_u8(*v0); }
  function VclsU16 (line 441) | void VclsU16(int16x4_t* r, uint16x4_t* v0) { *r = vcls_u16(*v0); }
  function VclsU32 (line 442) | void VclsU32(int32x2_t* r, uint32x2_t* v0) { *r = vcls_u32(*v0); }
  function VclsqS8 (line 443) | void VclsqS8(int8x16_t* r, int8x16_t* v0) { *r = vclsq_s8(*v0); }
  function VclsqS16 (line 444) | void VclsqS16(int16x8_t* r, int16x8_t* v0) { *r = vclsq_s16(*v0); }
  function VclsqS32 (line 445) | void VclsqS32(int32x4_t* r, int32x4_t* v0) { *r = vclsq_s32(*v0); }
  function VclsqU8 (line 446) | void VclsqU8(int8x16_t* r, uint8x16_t* v0) { *r = vclsq_u8(*v0); }
  function VclsqU16 (line 447) | void VclsqU16(int16x8_t* r, uint16x8_t* v0) { *r = vclsq_u16(*v0); }
  function VclsqU32 (line 448) | void VclsqU32(int32x4_t* r, uint32x4_t* v0) { *r = vclsq_u32(*v0); }
  function VcltS8 (line 449) | void VcltS8(uint8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vclt_s8(*v0...
  function VcltS16 (line 450) | void VcltS16(uint16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vclt_s1...
  function VcltS32 (line 451) | void VcltS32(uint32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vclt_s3...
  function VcltS64 (line 452) | void VcltS64(uint64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vclt_s6...
  function VcltU8 (line 453) | void VcltU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vclt_u8(*...
  function VcltU16 (line 454) | void VcltU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vclt_...
  function VcltU32 (line 455) | void VcltU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vclt_...
  function VcltU64 (line 456) | void VcltU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vclt_...
  function VcltF32 (line 457) | void VcltF32(uint32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vcl...
  function VcltF64 (line 458) | void VcltF64(uint64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vcl...
  function VcltdS64 (line 459) | void VcltdS64(uint64_t* r, int64_t* v0, int64_t* v1) { *r = vcltd_s64(*v...
  function VcltdU64 (line 460) | void VcltdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vcltd_u64(...
  function VcltdF64 (line 461) | void VcltdF64(uint64_t* r, float64_t* v0, float64_t* v1) { *r = vcltd_f6...
  function VcltqS8 (line 462) | void VcltqS8(uint8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vcltq_s...
  function VcltqS16 (line 463) | void VcltqS16(uint16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vcltq_...
  function VcltqS32 (line 464) | void VcltqS32(uint32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vcltq_...
  function VcltqS64 (line 465) | void VcltqS64(uint64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vcltq_...
  function VcltqU8 (line 466) | void VcltqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vcltq...
  function VcltqU16 (line 467) | void VcltqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vclt...
  function VcltqU32 (line 468) | void VcltqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vclt...
  function VcltqU64 (line 469) | void VcltqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vclt...
  function VcltqF32 (line 470) | void VcltqF32(uint32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = vc...
  function VcltqF64 (line 471) | void VcltqF64(uint64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = vc...
  function VcltsF32 (line 472) | void VcltsF32(uint32_t* r, float32_t* v0, float32_t* v1) { *r = vclts_f3...
  function VcltzS8 (line 473) | void VcltzS8(uint8x8_t* r, int8x8_t* v0) { *r = vcltz_s8(*v0); }
  function VcltzS16 (line 474) | void VcltzS16(uint16x4_t* r, int16x4_t* v0) { *r = vcltz_s16(*v0); }
  function VcltzS32 (line 475) | void VcltzS32(uint32x2_t* r, int32x2_t* v0) { *r = vcltz_s32(*v0); }
  function VcltzS64 (line 476) | void VcltzS64(uint64x1_t* r, int64x1_t* v0) { *r = vcltz_s64(*v0); }
  function VcltzF32 (line 477) | void VcltzF32(uint32x2_t* r, float32x2_t* v0) { *r = vcltz_f32(*v0); }
  function VcltzF64 (line 478) | void VcltzF64(uint64x1_t* r, float64x1_t* v0) { *r = vcltz_f64(*v0); }
  function VcltzdS64 (line 479) | void VcltzdS64(uint64_t* r, int64_t* v0) { *r = vcltzd_s64(*v0); }
  function VcltzdF64 (line 480) | void VcltzdF64(uint64_t* r, float64_t* v0) { *r = vcltzd_f64(*v0); }
  function VcltzqS8 (line 481) | void VcltzqS8(uint8x16_t* r, int8x16_t* v0) { *r = vcltzq_s8(*v0); }
  function VcltzqS16 (line 482) | void VcltzqS16(uint16x8_t* r, int16x8_t* v0) { *r = vcltzq_s16(*v0); }
  function VcltzqS32 (line 483) | void VcltzqS32(uint32x4_t* r, int32x4_t* v0) { *r = vcltzq_s32(*v0); }
  function VcltzqS64 (line 484) | void VcltzqS64(uint64x2_t* r, int64x2_t* v0) { *r = vcltzq_s64(*v0); }
  function VcltzqF32 (line 485) | void VcltzqF32(uint32x4_t* r, float32x4_t* v0) { *r = vcltzq_f32(*v0); }
  function VcltzqF64 (line 486) | void VcltzqF64(uint64x2_t* r, float64x2_t* v0) { *r = vcltzq_f64(*v0); }
  function VcltzsF32 (line 487) | void VcltzsF32(uint32_t* r, float32_t* v0) { *r = vcltzs_f32(*v0); }
  function VclzS8 (line 488) | void VclzS8(int8x8_t* r, int8x8_t* v0) { *r = vclz_s8(*v0); }
  function VclzS16 (line 489) | void VclzS16(int16x4_t* r, int16x4_t* v0) { *r = vclz_s16(*v0); }
  function VclzS32 (line 490) | void VclzS32(int32x2_t* r, int32x2_t* v0) { *r = vclz_s32(*v0); }
  function VclzU8 (line 491) | void VclzU8(uint8x8_t* r, uint8x8_t* v0) { *r = vclz_u8(*v0); }
  function VclzU16 (line 492) | void VclzU16(uint16x4_t* r, uint16x4_t* v0) { *r = vclz_u16(*v0); }
  function VclzU32 (line 493) | void VclzU32(uint32x2_t* r, uint32x2_t* v0) { *r = vclz_u32(*v0); }
  function VclzqS8 (line 494) | void VclzqS8(int8x16_t* r, int8x16_t* v0) { *r = vclzq_s8(*v0); }
  function VclzqS16 (line 495) | void VclzqS16(int16x8_t* r, int16x8_t* v0) { *r = vclzq_s16(*v0); }
  function VclzqS32 (line 496) | void VclzqS32(int32x4_t* r, int32x4_t* v0) { *r = vclzq_s32(*v0); }
  function VclzqU8 (line 497) | void VclzqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vclzq_u8(*v0); }
  function VclzqU16 (line 498) | void VclzqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vclzq_u16(*v0); }
  function VclzqU32 (line 499) | void VclzqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vclzq_u32(*v0); }
  function VcntS8 (line 500) | void VcntS8(int8x8_t* r, int8x8_t* v0) { *r = vcnt_s8(*v0); }
  function VcntU8 (line 501) | void VcntU8(uint8x8_t* r, uint8x8_t* v0) { *r = vcnt_u8(*v0); }
  function VcntP8 (line 502) | void VcntP8(poly8x8_t* r, poly8x8_t* v0) { *r = vcnt_p8(*v0); }
  function VcntqS8 (line 503) | void VcntqS8(int8x16_t* r, int8x16_t* v0) { *r = vcntq_s8(*v0); }
  function VcntqU8 (line 504) | void VcntqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vcntq_u8(*v0); }
  function VcntqP8 (line 505) | void VcntqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vcntq_p8(*v0); }
  function VcombineS8 (line 506) | void VcombineS8(int8x16_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vcombin...
  function VcombineS16 (line 507) | void VcombineS16(int16x8_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vcom...
  function VcombineS32 (line 508) | void VcombineS32(int32x4_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vcom...
  function VcombineS64 (line 509) | void VcombineS64(int64x2_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vcom...
  function VcombineU8 (line 510) | void VcombineU8(uint8x16_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vcom...
  function VcombineU16 (line 511) | void VcombineU16(uint16x8_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = v...
  function VcombineU32 (line 512) | void VcombineU32(uint32x4_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = v...
  function VcombineU64 (line 513) | void VcombineU64(uint64x2_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = v...
  function VcombineF32 (line 514) | void VcombineF32(float32x4_t* r, float32x2_t* v0, float32x2_t* v1) { *r ...
  function VcombineF64 (line 515) | void VcombineF64(float64x2_t* r, float64x1_t* v0, float64x1_t* v1) { *r ...
  function VcombineP16 (line 516) | void VcombineP16(poly16x8_t* r, poly16x4_t* v0, poly16x4_t* v1) { *r = v...
  function VcombineP64 (line 517) | void VcombineP64(poly64x2_t* r, poly64x1_t* v0, poly64x1_t* v1) { *r = v...
  function VcombineP8 (line 518) | void VcombineP8(poly8x16_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vcom...
  function VcvtF32S32 (line 519) | void VcvtF32S32(float32x2_t* r, int32x2_t* v0) { *r = vcvt_f32_s32(*v0); }
  function VcvtF32U32 (line 520) | void VcvtF32U32(float32x2_t* r, uint32x2_t* v0) { *r = vcvt_f32_u32(*v0); }
  function VcvtF32F64 (line 521) | void VcvtF32F64(float32x2_t* r, float64x2_t* v0) { *r = vcvt_f32_f64(*v0...
  function VcvtF64S64 (line 522) | void VcvtF64S64(float64x1_t* r, int64x1_t* v0) { *r = vcvt_f64_s64(*v0); }
  function VcvtF64U64 (line 523) | void VcvtF64U64(float64x1_t* r, uint64x1_t* v0) { *r = vcvt_f64_u64(*v0); }
  function VcvtF64F32 (line 524) | void VcvtF64F32(float64x2_t* r, float32x2_t* v0) { *r = vcvt_f64_f32(*v0...
  function VcvtHighF32F64 (line 525) | void VcvtHighF32F64(float32x4_t* r, float32x2_t* v0, float64x2_t* v1) { ...
  function VcvtHighF64F32 (line 526) | void VcvtHighF64F32(float64x2_t* r, float32x4_t* v0) { *r = vcvt_high_f6...
  function VcvtS32F32 (line 527) | void VcvtS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvt_s32_f32(*v0); }
  function VcvtS64F64 (line 528) | void VcvtS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvt_s64_f64(*v0); }
  function VcvtU32F32 (line 529) | void VcvtU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvt_u32_f32(*v0); }
  function VcvtU64F64 (line 530) | void VcvtU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvt_u64_f64(*v0); }
  function VcvtaS32F32 (line 531) | void VcvtaS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvta_s32_f32(*v0...
  function VcvtaS64F64 (line 532) | void VcvtaS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvta_s64_f64(*v0...
  function VcvtaU32F32 (line 533) | void VcvtaU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvta_u32_f32(*v...
  function VcvtaU64F64 (line 534) | void VcvtaU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvta_u64_f64(*v...
  function VcvtadS64F64 (line 535) | void VcvtadS64F64(int64_t* r, float64_t* v0) { *r = vcvtad_s64_f64(*v0); }
  function VcvtadU64F64 (line 536) | void VcvtadU64F64(uint64_t* r, float64_t* v0) { *r = vcvtad_u64_f64(*v0); }
  function VcvtaqS32F32 (line 537) | void VcvtaqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtaq_s32_f32(*...
  function VcvtaqS64F64 (line 538) | void VcvtaqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtaq_s64_f64(*...
  function VcvtaqU32F32 (line 539) | void VcvtaqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtaq_u32_f32(...
  function VcvtaqU64F64 (line 540) | void VcvtaqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtaq_u64_f64(...
  function VcvtasS32F32 (line 541) | void VcvtasS32F32(int32_t* r, float32_t* v0) { *r = vcvtas_s32_f32(*v0); }
  function VcvtasU32F32 (line 542) | void VcvtasU32F32(uint32_t* r, float32_t* v0) { *r = vcvtas_u32_f32(*v0); }
  function VcvtdF64S64 (line 543) | void VcvtdF64S64(float64_t* r, int64_t* v0) { *r = vcvtd_f64_s64(*v0); }
  function VcvtdF64U64 (line 544) | void VcvtdF64U64(float64_t* r, uint64_t* v0) { *r = vcvtd_f64_u64(*v0); }
  function VcvtdS64F64 (line 545) | void VcvtdS64F64(int64_t* r, float64_t* v0) { *r = vcvtd_s64_f64(*v0); }
  function VcvtdU64F64 (line 546) | void VcvtdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtd_u64_f64(*v0); }
  function VcvtmS32F32 (line 547) | void VcvtmS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtm_s32_f32(*v0...
  function VcvtmS64F64 (line 548) | void VcvtmS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtm_s64_f64(*v0...
  function VcvtmU32F32 (line 549) | void VcvtmU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtm_u32_f32(*v...
  function VcvtmU64F64 (line 550) | void VcvtmU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtm_u64_f64(*v...
  function VcvtmdS64F64 (line 551) | void VcvtmdS64F64(int64_t* r, float64_t* v0) { *r = vcvtmd_s64_f64(*v0); }
  function VcvtmdU64F64 (line 552) | void VcvtmdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtmd_u64_f64(*v0); }
  function VcvtmqS32F32 (line 553) | void VcvtmqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtmq_s32_f32(*...
  function VcvtmqS64F64 (line 554) | void VcvtmqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtmq_s64_f64(*...
  function VcvtmqU32F32 (line 555) | void VcvtmqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtmq_u32_f32(...
  function VcvtmqU64F64 (line 556) | void VcvtmqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtmq_u64_f64(...
  function VcvtmsS32F32 (line 557) | void VcvtmsS32F32(int32_t* r, float32_t* v0) { *r = vcvtms_s32_f32(*v0); }
  function VcvtmsU32F32 (line 558) | void VcvtmsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtms_u32_f32(*v0); }
  function VcvtnS32F32 (line 559) | void VcvtnS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtn_s32_f32(*v0...
  function VcvtnS64F64 (line 560) | void VcvtnS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtn_s64_f64(*v0...
  function VcvtnU32F32 (line 561) | void VcvtnU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtn_u32_f32(*v...
  function VcvtnU64F64 (line 562) | void VcvtnU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtn_u64_f64(*v...
  function VcvtndS64F64 (line 563) | void VcvtndS64F64(int64_t* r, float64_t* v0) { *r = vcvtnd_s64_f64(*v0); }
  function VcvtndU64F64 (line 564) | void VcvtndU64F64(uint64_t* r, float64_t* v0) { *r = vcvtnd_u64_f64(*v0); }
  function VcvtnqS32F32 (line 565) | void VcvtnqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtnq_s32_f32(*...
  function VcvtnqS64F64 (line 566) | void VcvtnqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtnq_s64_f64(*...
  function VcvtnqU32F32 (line 567) | void VcvtnqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtnq_u32_f32(...
  function VcvtnqU64F64 (line 568) | void VcvtnqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtnq_u64_f64(...
  function VcvtnsS32F32 (line 569) | void VcvtnsS32F32(int32_t* r, float32_t* v0) { *r = vcvtns_s32_f32(*v0); }
  function VcvtnsU32F32 (line 570) | void VcvtnsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtns_u32_f32(*v0); }
  function VcvtpS32F32 (line 571) | void VcvtpS32F32(int32x2_t* r, float32x2_t* v0) { *r = vcvtp_s32_f32(*v0...
  function VcvtpS64F64 (line 572) | void VcvtpS64F64(int64x1_t* r, float64x1_t* v0) { *r = vcvtp_s64_f64(*v0...
  function VcvtpU32F32 (line 573) | void VcvtpU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vcvtp_u32_f32(*v...
  function VcvtpU64F64 (line 574) | void VcvtpU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vcvtp_u64_f64(*v...
  function VcvtpdS64F64 (line 575) | void VcvtpdS64F64(int64_t* r, float64_t* v0) { *r = vcvtpd_s64_f64(*v0); }
  function VcvtpdU64F64 (line 576) | void VcvtpdU64F64(uint64_t* r, float64_t* v0) { *r = vcvtpd_u64_f64(*v0); }
  function VcvtpqS32F32 (line 577) | void VcvtpqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtpq_s32_f32(*...
  function VcvtpqS64F64 (line 578) | void VcvtpqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtpq_s64_f64(*...
  function VcvtpqU32F32 (line 579) | void VcvtpqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtpq_u32_f32(...
  function VcvtpqU64F64 (line 580) | void VcvtpqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtpq_u64_f64(...
  function VcvtpsS32F32 (line 581) | void VcvtpsS32F32(int32_t* r, float32_t* v0) { *r = vcvtps_s32_f32(*v0); }
  function VcvtpsU32F32 (line 582) | void VcvtpsU32F32(uint32_t* r, float32_t* v0) { *r = vcvtps_u32_f32(*v0); }
  function VcvtqF32S32 (line 583) | void VcvtqF32S32(float32x4_t* r, int32x4_t* v0) { *r = vcvtq_f32_s32(*v0...
  function VcvtqF32U32 (line 584) | void VcvtqF32U32(float32x4_t* r, uint32x4_t* v0) { *r = vcvtq_f32_u32(*v...
  function VcvtqF64S64 (line 585) | void VcvtqF64S64(float64x2_t* r, int64x2_t* v0) { *r = vcvtq_f64_s64(*v0...
  function VcvtqF64U64 (line 586) | void VcvtqF64U64(float64x2_t* r, uint64x2_t* v0) { *r = vcvtq_f64_u64(*v...
  function VcvtqS32F32 (line 587) | void VcvtqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vcvtq_s32_f32(*v0...
  function VcvtqS64F64 (line 588) | void VcvtqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vcvtq_s64_f64(*v0...
  function VcvtqU32F32 (line 589) | void VcvtqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vcvtq_u32_f32(*v...
  function VcvtqU64F64 (line 590) | void VcvtqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vcvtq_u64_f64(*v...
  function VcvtsF32S32 (line 591) | void VcvtsF32S32(float32_t* r, int32_t* v0) { *r = vcvts_f32_s32(*v0); }
  function VcvtsF32U32 (line 592) | void VcvtsF32U32(float32_t* r, uint32_t* v0) { *r = vcvts_f32_u32(*v0); }
  function VcvtsS32F32 (line 593) | void VcvtsS32F32(int32_t* r, float32_t* v0) { *r = vcvts_s32_f32(*v0); }
  function VcvtsU32F32 (line 594) | void VcvtsU32F32(uint32_t* r, float32_t* v0) { *r = vcvts_u32_f32(*v0); }
  function VcvtxF32F64 (line 595) | void VcvtxF32F64(float32x2_t* r, float64x2_t* v0) { *r = vcvtx_f32_f64(*...
  function VcvtxHighF32F64 (line 596) | void VcvtxHighF32F64(float32x4_t* r, float32x2_t* v0, float64x2_t* v1) {...
  function VcvtxdF32F64 (line 597) | void VcvtxdF32F64(float32_t* r, float64_t* v0) { *r = vcvtxd_f32_f64(*v0...
  function VdivF32 (line 598) | void VdivF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vd...
  function VdivF64 (line 599) | void VdivF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vd...
  function VdivqF32 (line 600) | void VdivqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VdivqF64 (line 601) | void VdivqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VdotS32 (line 602) | void VdotS32(int32x2_t* r, int32x2_t* v0, int8x8_t* v1, int8x8_t* v2) { ...
  function VdotU32 (line 603) | void VdotU32(uint32x2_t* r, uint32x2_t* v0, uint8x8_t* v1, uint8x8_t* v2...
  function VdotqS32 (line 604) | void VdotqS32(int32x4_t* r, int32x4_t* v0, int8x16_t* v1, int8x16_t* v2)...
  function VdotqU32 (line 605) | void VdotqU32(uint32x4_t* r, uint32x4_t* v0, uint8x16_t* v1, uint8x16_t*...
  function VdupNS8 (line 606) | void VdupNS8(int8x8_t* r, int8_t* v0) { *r = vdup_n_s8(*v0); }
  function VdupNS16 (line 607) | void VdupNS16(int16x4_t* r, int16_t* v0) { *r = vdup_n_s16(*v0); }
  function VdupNS32 (line 608) | void VdupNS32(int32x2_t* r, int32_t* v0) { *r = vdup_n_s32(*v0); }
  function VdupNS64 (line 609) | void VdupNS64(int64x1_t* r, int64_t* v0) { *r = vdup_n_s64(*v0); }
  function VdupNU8 (line 610) | void VdupNU8(uint8x8_t* r, uint8_t* v0) { *r = vdup_n_u8(*v0); }
  function VdupNU16 (line 611) | void VdupNU16(uint16x4_t* r, uint16_t* v0) { *r = vdup_n_u16(*v0); }
  function VdupNU32 (line 612) | void VdupNU32(uint32x2_t* r, uint32_t* v0) { *r = vdup_n_u32(*v0); }
  function VdupNU64 (line 613) | void VdupNU64(uint64x1_t* r, uint64_t* v0) { *r = vdup_n_u64(*v0); }
  function VdupNF32 (line 614) | void VdupNF32(float32x2_t* r, float32_t* v0) { *r = vdup_n_f32(*v0); }
  function VdupNF64 (line 615) | void VdupNF64(float64x1_t* r, float64_t* v0) { *r = vdup_n_f64(*v0); }
  function VdupNP16 (line 616) | void VdupNP16(poly16x4_t* r, poly16_t* v0) { *r = vdup_n_p16(*v0); }
  function VdupNP64 (line 617) | void VdupNP64(poly64x1_t* r, poly64_t* v0) { *r = vdup_n_p64(*v0); }
  function VdupNP8 (line 618) | void VdupNP8(poly8x8_t* r, poly8_t* v0) { *r = vdup_n_p8(*v0); }
  function VdupqNS8 (line 619) | void VdupqNS8(int8x16_t* r, int8_t* v0) { *r = vdupq_n_s8(*v0); }
  function VdupqNS16 (line 620) | void VdupqNS16(int16x8_t* r, int16_t* v0) { *r = vdupq_n_s16(*v0); }
  function VdupqNS32 (line 621) | void VdupqNS32(int32x4_t* r, int32_t* v0) { *r = vdupq_n_s32(*v0); }
  function VdupqNS64 (line 622) | void VdupqNS64(int64x2_t* r, int64_t* v0) { *r = vdupq_n_s64(*v0); }
  function VdupqNU8 (line 623) | void VdupqNU8(uint8x16_t* r, uint8_t* v0) { *r = vdupq_n_u8(*v0); }
  function VdupqNU16 (line 624) | void VdupqNU16(uint16x8_t* r, uint16_t* v0) { *r = vdupq_n_u16(*v0); }
  function VdupqNU32 (line 625) | void VdupqNU32(uint32x4_t* r, uint32_t* v0) { *r = vdupq_n_u32(*v0); }
  function VdupqNU64 (line 626) | void VdupqNU64(uint64x2_t* r, uint64_t* v0) { *r = vdupq_n_u64(*v0); }
  function VdupqNF32 (line 627) | void VdupqNF32(float32x4_t* r, float32_t* v0) { *r = vdupq_n_f32(*v0); }
  function VdupqNF64 (line 628) | void VdupqNF64(float64x2_t* r, float64_t* v0) { *r = vdupq_n_f64(*v0); }
  function VdupqNP16 (line 629) | void VdupqNP16(poly16x8_t* r, poly16_t* v0) { *r = vdupq_n_p16(*v0); }
  function VdupqNP64 (line 630) | void VdupqNP64(poly64x2_t* r, poly64_t* v0) { *r = vdupq_n_p64(*v0); }
  function VdupqNP8 (line 631) | void VdupqNP8(poly8x16_t* r, poly8_t* v0) { *r = vdupq_n_p8(*v0); }
  function VeorS8 (line 632) | void VeorS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = veor_s8(*v0,...
  function VeorS16 (line 633) | void VeorS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = veor_s16...
  function VeorS32 (line 634) | void VeorS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = veor_s32...
  function VeorS64 (line 635) | void VeorS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = veor_s64...
  function VeorU8 (line 636) | void VeorU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = veor_u8(*...
  function VeorU16 (line 637) | void VeorU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = veor_...
  function VeorU32 (line 638) | void VeorU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = veor_...
  function VeorU64 (line 639) | void VeorU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = veor_...
  function Veor3QS8 (line 640) | void Veor3QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2)...
  function Veor3QS16 (line 641) | void Veor3QS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2...
  function Veor3QS32 (line 642) | void Veor3QS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2...
  function Veor3QS64 (line 643) | void Veor3QS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1, int64x2_t* v2...
  function Veor3QU8 (line 644) | void Veor3QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t*...
  function Veor3QU16 (line 645) | void Veor3QU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t...
  function Veor3QU32 (line 646) | void Veor3QU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t...
  function Veor3QU64 (line 647) | void Veor3QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1, uint64x2_t...
  function VeorqS8 (line 648) | void VeorqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = veorq_s8...
  function VeorqS16 (line 649) | void VeorqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = veorq_s...
  function VeorqS32 (line 650) | void VeorqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = veorq_s...
  function VeorqS64 (line 651) | void VeorqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = veorq_s...
  function VeorqU8 (line 652) | void VeorqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = veorq...
  function VeorqU16 (line 653) | void VeorqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = veor...
  function VeorqU32 (line 654) | void VeorqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = veor...
  function VeorqU64 (line 655) | void VeorqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = veor...
  function VfmaF32 (line 656) | void VfmaF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2...
  function VfmaF64 (line 657) | void VfmaF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1...
  function VfmaNF32 (line 658) | void VfmaNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_...
  function VfmaNF64 (line 659) | void VfmaNF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64_...
  function VfmaqF32 (line 660) | void VfmaqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x...
  function VfmaqF64 (line 661) | void VfmaqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x...
  function VfmaqNF32 (line 662) | void VfmaqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32...
  function VfmaqNF64 (line 663) | void VfmaqNF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64...
  function VfmsF32 (line 664) | void VfmsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2...
  function VfmsF64 (line 665) | void VfmsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1...
  function VfmsNF32 (line 666) | void VfmsNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_...
  function VfmsNF64 (line 667) | void VfmsNF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64_...
  function VfmsqF32 (line 668) | void VfmsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x...
  function VfmsqF64 (line 669) | void VfmsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x...
  function VfmsqNF32 (line 670) | void VfmsqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32...
  function VfmsqNF64 (line 671) | void VfmsqNF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64...
  function VgetHighS8 (line 672) | void VgetHighS8(int8x8_t* r, int8x16_t* v0) { *r = vget_high_s8(*v0); }
  function VgetHighS16 (line 673) | void VgetHighS16(int16x4_t* r, int16x8_t* v0) { *r = vget_high_s16(*v0); }
  function VgetHighS32 (line 674) | void VgetHighS32(int32x2_t* r, int32x4_t* v0) { *r = vget_high_s32(*v0); }
  function VgetHighS64 (line 675) | void VgetHighS64(int64x1_t* r, int64x2_t* v0) { *r = vget_high_s64(*v0); }
  function VgetHighU8 (line 676) | void VgetHighU8(uint8x8_t* r, uint8x16_t* v0) { *r = vget_high_u8(*v0); }
  function VgetHighU16 (line 677) | void VgetHighU16(uint16x4_t* r, uint16x8_t* v0) { *r = vget_high_u16(*v0...
  function VgetHighU32 (line 678) | void VgetHighU32(uint32x2_t* r, uint32x4_t* v0) { *r = vget_high_u32(*v0...
  function VgetHighU64 (line 679) | void VgetHighU64(uint64x1_t* r, uint64x2_t* v0) { *r = vget_high_u64(*v0...
  function VgetHighF32 (line 680) | void VgetHighF32(float32x2_t* r, float32x4_t* v0) { *r = vget_high_f32(*...
  function VgetHighF64 (line 681) | void VgetHighF64(float64x1_t* r, float64x2_t* v0) { *r = vget_high_f64(*...
  function VgetHighP16 (line 682) | void VgetHighP16(poly16x4_t* r, poly16x8_t* v0) { *r = vget_high_p16(*v0...
  function VgetHighP64 (line 683) | void VgetHighP64(poly64x1_t* r, poly64x2_t* v0) { *r = vget_high_p64(*v0...
  function VgetHighP8 (line 684) | void VgetHighP8(poly8x8_t* r, poly8x16_t* v0) { *r = vget_high_p8(*v0); }
  function VgetLowS8 (line 685) | void VgetLowS8(int8x8_t* r, int8x16_t* v0) { *r = vget_low_s8(*v0); }
  function VgetLowS16 (line 686) | void VgetLowS16(int16x4_t* r, int16x8_t* v0) { *r = vget_low_s16(*v0); }
  function VgetLowS32 (line 687) | void VgetLowS32(int32x2_t* r, int32x4_t* v0) { *r = vget_low_s32(*v0); }
  function VgetLowS64 (line 688) | void VgetLowS64(int64x1_t* r, int64x2_t* v0) { *r = vget_low_s64(*v0); }
  function VgetLowU8 (line 689) | void VgetLowU8(uint8x8_t* r, uint8x16_t* v0) { *r = vget_low_u8(*v0); }
  function VgetLowU16 (line 690) | void VgetLowU16(uint16x4_t* r, uint16x8_t* v0) { *r = vget_low_u16(*v0); }
  function VgetLowU32 (line 691) | void VgetLowU32(uint32x2_t* r, uint32x4_t* v0) { *r = vget_low_u32(*v0); }
  function VgetLowU64 (line 692) | void VgetLowU64(uint64x1_t* r, uint64x2_t* v0) { *r = vget_low_u64(*v0); }
  function VgetLowF32 (line 693) | void VgetLowF32(float32x2_t* r, float32x4_t* v0) { *r = vget_low_f32(*v0...
  function VgetLowF64 (line 694) | void VgetLowF64(float64x1_t* r, float64x2_t* v0) { *r = vget_low_f64(*v0...
  function VgetLowP16 (line 695) | void VgetLowP16(poly16x4_t* r, poly16x8_t* v0) { *r = vget_low_p16(*v0); }
  function VgetLowP64 (line 696) | void VgetLowP64(poly64x1_t* r, poly64x2_t* v0) { *r = vget_low_p64(*v0); }
  function VgetLowP8 (line 697) | void VgetLowP8(poly8x8_t* r, poly8x16_t* v0) { *r = vget_low_p8(*v0); }
  function VhaddS8 (line 698) | void VhaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vhadd_s8(*v...
  function VhaddS16 (line 699) | void VhaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vhadd_s...
  function VhaddS32 (line 700) | void VhaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vhadd_s...
  function VhaddU8 (line 701) | void VhaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vhadd_u8...
  function VhaddU16 (line 702) | void VhaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vhad...
  function VhaddU32 (line 703) | void VhaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vhad...
  function VhaddqS8 (line 704) | void VhaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vhaddq_...
  function VhaddqS16 (line 705) | void VhaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vhaddq...
  function VhaddqS32 (line 706) | void VhaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vhaddq...
  function VhaddqU8 (line 707) | void VhaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vhad...
  function VhaddqU16 (line 708) | void VhaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vha...
  function VhaddqU32 (line 709) | void VhaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vha...
  function VhsubS8 (line 710) | void VhsubS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vhsub_s8(*v...
  function VhsubS16 (line 711) | void VhsubS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vhsub_s...
  function VhsubS32 (line 712) | void VhsubS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vhsub_s...
  function VhsubU8 (line 713) | void VhsubU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vhsub_u8...
  function VhsubU16 (line 714) | void VhsubU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vhsu...
  function VhsubU32 (line 715) | void VhsubU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vhsu...
  function VhsubqS8 (line 716) | void VhsubqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vhsubq_...
  function VhsubqS16 (line 717) | void VhsubqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vhsubq...
  function VhsubqS32 (line 718) | void VhsubqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vhsubq...
  function VhsubqU8 (line 719) | void VhsubqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vhsu...
  function VhsubqU16 (line 720) | void VhsubqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vhs...
  function VhsubqU32 (line 721) | void VhsubqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vhs...
  function VmaxS8 (line 722) | void VmaxS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmax_s8(*v0,...
  function VmaxS16 (line 723) | void VmaxS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmax_s16...
  function VmaxS32 (line 724) | void VmaxS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmax_s32...
  function VmaxU8 (line 725) | void VmaxU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmax_u8(*...
  function VmaxU16 (line 726) | void VmaxU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmax_...
  function VmaxU32 (line 727) | void VmaxU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmax_...
  function VmaxF32 (line 728) | void VmaxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vm...
  function VmaxF64 (line 729) | void VmaxF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vm...
  function VmaxnmF32 (line 730) | void VmaxnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = ...
  function VmaxnmF64 (line 731) | void VmaxnmF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = ...
  function VmaxnmqF32 (line 732) | void VmaxnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r =...
  function VmaxnmqF64 (line 733) | void VmaxnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r =...
  function VmaxnmvF32 (line 734) | void VmaxnmvF32(float32_t* r, float32x2_t* v0) { *r = vmaxnmv_f32(*v0); }
  function VmaxnmvqF32 (line 735) | void VmaxnmvqF32(float32_t* r, float32x4_t* v0) { *r = vmaxnmvq_f32(*v0); }
  function VmaxnmvqF64 (line 736) | void VmaxnmvqF64(float64_t* r, float64x2_t* v0) { *r = vmaxnmvq_f64(*v0); }
  function VmaxqS8 (line 737) | void VmaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmaxq_s8...
  function VmaxqS16 (line 738) | void VmaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmaxq_s...
  function VmaxqS32 (line 739) | void VmaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmaxq_s...
  function VmaxqU8 (line 740) | void VmaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vmaxq...
  function VmaxqU16 (line 741) | void VmaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmax...
  function VmaxqU32 (line 742) | void VmaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmax...
  function VmaxqF32 (line 743) | void VmaxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VmaxqF64 (line 744) | void VmaxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VmaxvS8 (line 745) | void VmaxvS8(int8_t* r, int8x8_t* v0) { *r = vmaxv_s8(*v0); }
  function VmaxvS16 (line 746) | void VmaxvS16(int16_t* r, int16x4_t* v0) { *r = vmaxv_s16(*v0); }
  function VmaxvS32 (line 747) | void VmaxvS32(int32_t* r, int32x2_t* v0) { *r = vmaxv_s32(*v0); }
  function VmaxvU8 (line 748) | void VmaxvU8(uint8_t* r, uint8x8_t* v0) { *r = vmaxv_u8(*v0); }
  function VmaxvU16 (line 749) | void VmaxvU16(uint16_t* r, uint16x4_t* v0) { *r = vmaxv_u16(*v0); }
  function VmaxvU32 (line 750) | void VmaxvU32(uint32_t* r, uint32x2_t* v0) { *r = vmaxv_u32(*v0); }
  function VmaxvF32 (line 751) | void VmaxvF32(float32_t* r, float32x2_t* v0) { *r = vmaxv_f32(*v0); }
  function VmaxvqS8 (line 752) | void VmaxvqS8(int8_t* r, int8x16_t* v0) { *r = vmaxvq_s8(*v0); }
  function VmaxvqS16 (line 753) | void VmaxvqS16(int16_t* r, int16x8_t* v0) { *r = vmaxvq_s16(*v0); }
  function VmaxvqS32 (line 754) | void VmaxvqS32(int32_t* r, int32x4_t* v0) { *r = vmaxvq_s32(*v0); }
  function VmaxvqU8 (line 755) | void VmaxvqU8(uint8_t* r, uint8x16_t* v0) { *r = vmaxvq_u8(*v0); }
  function VmaxvqU16 (line 756) | void VmaxvqU16(uint16_t* r, uint16x8_t* v0) { *r = vmaxvq_u16(*v0); }
  function VmaxvqU32 (line 757) | void VmaxvqU32(uint32_t* r, uint32x4_t* v0) { *r = vmaxvq_u32(*v0); }
  function VmaxvqF32 (line 758) | void VmaxvqF32(float32_t* r, float32x4_t* v0) { *r = vmaxvq_f32(*v0); }
  function VmaxvqF64 (line 759) | void VmaxvqF64(float64_t* r, float64x2_t* v0) { *r = vmaxvq_f64(*v0); }
  function VminS8 (line 760) | void VminS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmin_s8(*v0,...
  function VminS16 (line 761) | void VminS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmin_s16...
  function VminS32 (line 762) | void VminS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmin_s32...
  function VminU8 (line 763) | void VminU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmin_u8(*...
  function VminU16 (line 764) | void VminU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmin_...
  function VminU32 (line 765) | void VminU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmin_...
  function VminF32 (line 766) | void VminF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vm...
  function VminF64 (line 767) | void VminF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vm...
  function VminnmF32 (line 768) | void VminnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = ...
  function VminnmF64 (line 769) | void VminnmF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = ...
  function VminnmqF32 (line 770) | void VminnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r =...
  function VminnmqF64 (line 771) | void VminnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r =...
  function VminnmvF32 (line 772) | void VminnmvF32(float32_t* r, float32x2_t* v0) { *r = vminnmv_f32(*v0); }
  function VminnmvqF32 (line 773) | void VminnmvqF32(float32_t* r, float32x4_t* v0) { *r = vminnmvq_f32(*v0); }
  function VminnmvqF64 (line 774) | void VminnmvqF64(float64_t* r, float64x2_t* v0) { *r = vminnmvq_f64(*v0); }
  function VminqS8 (line 775) | void VminqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vminq_s8...
  function VminqS16 (line 776) | void VminqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vminq_s...
  function VminqS32 (line 777) | void VminqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vminq_s...
  function VminqU8 (line 778) | void VminqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vminq...
  function VminqU16 (line 779) | void VminqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmin...
  function VminqU32 (line 780) | void VminqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmin...
  function VminqF32 (line 781) | void VminqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VminqF64 (line 782) | void VminqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VminvS8 (line 783) | void VminvS8(int8_t* r, int8x8_t* v0) { *r = vminv_s8(*v0); }
  function VminvS16 (line 784) | void VminvS16(int16_t* r, int16x4_t* v0) { *r = vminv_s16(*v0); }
  function VminvS32 (line 785) | void VminvS32(int32_t* r, int32x2_t* v0) { *r = vminv_s32(*v0); }
  function VminvU8 (line 786) | void VminvU8(uint8_t* r, uint8x8_t* v0) { *r = vminv_u8(*v0); }
  function VminvU16 (line 787) | void VminvU16(uint16_t* r, uint16x4_t* v0) { *r = vminv_u16(*v0); }
  function VminvU32 (line 788) | void VminvU32(uint32_t* r, uint32x2_t* v0) { *r = vminv_u32(*v0); }
  function VminvF32 (line 789) | void VminvF32(float32_t* r, float32x2_t* v0) { *r = vminv_f32(*v0); }
  function VminvqS8 (line 790) | void VminvqS8(int8_t* r, int8x16_t* v0) { *r = vminvq_s8(*v0); }
  function VminvqS16 (line 791) | void VminvqS16(int16_t* r, int16x8_t* v0) { *r = vminvq_s16(*v0); }
  function VminvqS32 (line 792) | void VminvqS32(int32_t* r, int32x4_t* v0) { *r = vminvq_s32(*v0); }
  function VminvqU8 (line 793) | void VminvqU8(uint8_t* r, uint8x16_t* v0) { *r = vminvq_u8(*v0); }
  function VminvqU16 (line 794) | void VminvqU16(uint16_t* r, uint16x8_t* v0) { *r = vminvq_u16(*v0); }
  function VminvqU32 (line 795) | void VminvqU32(uint32_t* r, uint32x4_t* v0) { *r = vminvq_u32(*v0); }
  function VminvqF32 (line 796) | void VminvqF32(float32_t* r, float32x4_t* v0) { *r = vminvq_f32(*v0); }
  function VminvqF64 (line 797) | void VminvqF64(float64_t* r, float64x2_t* v0) { *r = vminvq_f64(*v0); }
  function VmlaS8 (line 798) | void VmlaS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r ...
  function VmlaS16 (line 799) | void VmlaS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) ...
  function VmlaS32 (line 800) | void VmlaS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) ...
  function VmlaU8 (line 801) | void VmlaU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) {...
  function VmlaU16 (line 802) | void VmlaU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* ...
  function VmlaU32 (line 803) | void VmlaU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* ...
  function VmlaF32 (line 804) | void VmlaF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2...
  function VmlaF64 (line 805) | void VmlaF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1...
  function VmlaNS16 (line 806) | void VmlaNS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16_t* v2) {...
  function VmlaNS32 (line 807) | void VmlaNS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32_t* v2) {...
  function VmlaNU16 (line 808) | void VmlaNU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16_t* v...
  function VmlaNU32 (line 809) | void VmlaNU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32_t* v...
  function VmlaNF32 (line 810) | void VmlaNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_...
  function VmlalS8 (line 811) | void VmlalS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { ...
  function VmlalS16 (line 812) | void VmlalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2)...
  function VmlalS32 (line 813) | void VmlalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2)...
  function VmlalU8 (line 814) | void VmlalU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2...
  function VmlalU16 (line 815) | void VmlalU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t*...
  function VmlalU32 (line 816) | void VmlalU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t*...
  function VmlalHighS8 (line 817) | void VmlalHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* ...
  function VmlalHighS16 (line 818) | void VmlalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t*...
  function VmlalHighS32 (line 819) | void VmlalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t*...
  function VmlalHighU8 (line 820) | void VmlalHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16...
  function VmlalHighU16 (line 821) | void VmlalHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x...
  function VmlalHighU32 (line 822) | void VmlalHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x...
  function VmlalHighNS16 (line 823) | void VmlalHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* ...
  function VmlalHighNS32 (line 824) | void VmlalHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* ...
  function VmlalHighNU16 (line 825) | void VmlalHighNU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16...
  function VmlalHighNU32 (line 826) | void VmlalHighNU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32...
  function VmlalNS16 (line 827) | void VmlalNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) ...
  function VmlalNS32 (line 828) | void VmlalNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) ...
  function VmlalNU16 (line 829) | void VmlalNU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16_t* ...
  function VmlalNU32 (line 830) | void VmlalNU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32_t* ...
  function VmlaqS8 (line 831) | void VmlaqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) ...
  function VmlaqS16 (line 832) | void VmlaqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2)...
  function VmlaqS32 (line 833) | void VmlaqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2)...
  function VmlaqU8 (line 834) | void VmlaqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* ...
  function VmlaqU16 (line 835) | void VmlaqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t*...
  function VmlaqU32 (line 836) | void VmlaqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t*...
  function VmlaqF32 (line 837) | void VmlaqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x...
  function VmlaqF64 (line 838) | void VmlaqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x...
  function VmlaqNS16 (line 839) | void VmlaqNS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16_t* v2) ...
  function VmlaqNS32 (line 840) | void VmlaqNS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32_t* v2) ...
  function VmlaqNU16 (line 841) | void VmlaqNU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16_t* ...
  function VmlaqNU32 (line 842) | void VmlaqNU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32_t* ...
  function VmlaqNF32 (line 843) | void VmlaqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32...
  function VmlsS8 (line 844) | void VmlsS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r ...
  function VmlsS16 (line 845) | void VmlsS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* v2) ...
  function VmlsS32 (line 846) | void VmlsS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* v2) ...
  function VmlsU8 (line 847) | void VmlsU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1, uint8x8_t* v2) {...
  function VmlsU16 (line 848) | void VmlsU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16x4_t* ...
  function VmlsU32 (line 849) | void VmlsU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32x2_t* ...
  function VmlsF32 (line 850) | void VmlsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32x2...
  function VmlsF64 (line 851) | void VmlsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1, float64x1...
  function VmlsNS16 (line 852) | void VmlsNS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16_t* v2) {...
  function VmlsNS32 (line 853) | void VmlsNS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32_t* v2) {...
  function VmlsNU16 (line 854) | void VmlsNU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1, uint16_t* v...
  function VmlsNU32 (line 855) | void VmlsNU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1, uint32_t* v...
  function VmlsNF32 (line 856) | void VmlsNF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1, float32_...
  function VmlslS8 (line 857) | void VmlslS8(int16x8_t* r, int16x8_t* v0, int8x8_t* v1, int8x8_t* v2) { ...
  function VmlslS16 (line 858) | void VmlslS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v2)...
  function VmlslS32 (line 859) | void VmlslS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v2)...
  function VmlslU8 (line 860) | void VmlslU8(uint16x8_t* r, uint16x8_t* v0, uint8x8_t* v1, uint8x8_t* v2...
  function VmlslU16 (line 861) | void VmlslU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16x4_t*...
  function VmlslU32 (line 862) | void VmlslU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32x2_t*...
  function VmlslHighS8 (line 863) | void VmlslHighS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1, int8x16_t* ...
  function VmlslHighS16 (line 864) | void VmlslHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_t*...
  function VmlslHighS32 (line 865) | void VmlslHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_t*...
  function VmlslHighU8 (line 866) | void VmlslHighU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1, uint8x16...
  function VmlslHighU16 (line 867) | void VmlslHighU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16x...
  function VmlslHighU32 (line 868) | void VmlslHighU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32x...
  function VmlslHighNS16 (line 869) | void VmlslHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t* ...
  function VmlslHighNS32 (line 870) | void VmlslHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t* ...
  function VmlslHighNU16 (line 871) | void VmlslHighNU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1, uint16...
  function VmlslHighNU32 (line 872) | void VmlslHighNU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1, uint32...
  function VmlslNS16 (line 873) | void VmlslNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2) ...
  function VmlslNS32 (line 874) | void VmlslNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2) ...
  function VmlslNU16 (line 875) | void VmlslNU16(uint32x4_t* r, uint32x4_t* v0, uint16x4_t* v1, uint16_t* ...
  function VmlslNU32 (line 876) | void VmlslNU32(uint64x2_t* r, uint64x2_t* v0, uint32x2_t* v1, uint32_t* ...
  function VmlsqS8 (line 877) | void VmlsqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, int8x16_t* v2) ...
  function VmlsqS16 (line 878) | void VmlsqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t* v2)...
  function VmlsqS32 (line 879) | void VmlsqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t* v2)...
  function VmlsqU8 (line 880) | void VmlsqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t* ...
  function VmlsqU16 (line 881) | void VmlsqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16x8_t*...
  function VmlsqU32 (line 882) | void VmlsqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32x4_t*...
  function VmlsqF32 (line 883) | void VmlsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32x...
  function VmlsqF64 (line 884) | void VmlsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1, float64x...
  function VmlsqNS16 (line 885) | void VmlsqNS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16_t* v2) ...
  function VmlsqNS32 (line 886) | void VmlsqNS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32_t* v2) ...
  function VmlsqNU16 (line 887) | void VmlsqNU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1, uint16_t* ...
  function VmlsqNU32 (line 888) | void VmlsqNU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1, uint32_t* ...
  function VmlsqNF32 (line 889) | void VmlsqNF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1, float32...
  function VmmlaqS32 (line 890) | void VmmlaqS32(int32x4_t* r, int32x4_t* v0, int8x16_t* v1, int8x16_t* v2...
  function VmmlaqU32 (line 891) | void VmmlaqU32(uint32x4_t* r, uint32x4_t* v0, uint8x16_t* v1, uint8x16_t...
  function VmovNS8 (line 892) | void VmovNS8(int8x8_t* r, int8_t* v0) { *r = vmov_n_s8(*v0); }
  function VmovNS16 (line 893) | void VmovNS16(int16x4_t* r, int16_t* v0) { *r = vmov_n_s16(*v0); }
  function VmovNS32 (line 894) | void VmovNS32(int32x2_t* r, int32_t* v0) { *r = vmov_n_s32(*v0); }
  function VmovNS64 (line 895) | void VmovNS64(int64x1_t* r, int64_t* v0) { *r = vmov_n_s64(*v0); }
  function VmovNU8 (line 896) | void VmovNU8(uint8x8_t* r, uint8_t* v0) { *r = vmov_n_u8(*v0); }
  function VmovNU16 (line 897) | void VmovNU16(uint16x4_t* r, uint16_t* v0) { *r = vmov_n_u16(*v0); }
  function VmovNU32 (line 898) | void VmovNU32(uint32x2_t* r, uint32_t* v0) { *r = vmov_n_u32(*v0); }
  function VmovNU64 (line 899) | void VmovNU64(uint64x1_t* r, uint64_t* v0) { *r = vmov_n_u64(*v0); }
  function VmovNF32 (line 900) | void VmovNF32(float32x2_t* r, float32_t* v0) { *r = vmov_n_f32(*v0); }
  function VmovNF64 (line 901) | void VmovNF64(float64x1_t* r, float64_t* v0) { *r = vmov_n_f64(*v0); }
  function VmovNP16 (line 902) | void VmovNP16(poly16x4_t* r, poly16_t* v0) { *r = vmov_n_p16(*v0); }
  function VmovNP64 (line 903) | void VmovNP64(poly64x1_t* r, poly64_t* v0) { *r = vmov_n_p64(*v0); }
  function VmovNP8 (line 904) | void VmovNP8(poly8x8_t* r, poly8_t* v0) { *r = vmov_n_p8(*v0); }
  function VmovlS8 (line 905) | void VmovlS8(int16x8_t* r, int8x8_t* v0) { *r = vmovl_s8(*v0); }
  function VmovlS16 (line 906) | void VmovlS16(int32x4_t* r, int16x4_t* v0) { *r = vmovl_s16(*v0); }
  function VmovlS32 (line 907) | void VmovlS32(int64x2_t* r, int32x2_t* v0) { *r = vmovl_s32(*v0); }
  function VmovlU8 (line 908) | void VmovlU8(uint16x8_t* r, uint8x8_t* v0) { *r = vmovl_u8(*v0); }
  function VmovlU16 (line 909) | void VmovlU16(uint32x4_t* r, uint16x4_t* v0) { *r = vmovl_u16(*v0); }
  function VmovlU32 (line 910) | void VmovlU32(uint64x2_t* r, uint32x2_t* v0) { *r = vmovl_u32(*v0); }
  function VmovlHighS8 (line 911) | void VmovlHighS8(int16x8_t* r, int8x16_t* v0) { *r = vmovl_high_s8(*v0); }
  function VmovlHighS16 (line 912) | void VmovlHighS16(int32x4_t* r, int16x8_t* v0) { *r = vmovl_high_s16(*v0...
  function VmovlHighS32 (line 913) | void VmovlHighS32(int64x2_t* r, int32x4_t* v0) { *r = vmovl_high_s32(*v0...
  function VmovlHighU8 (line 914) | void VmovlHighU8(uint16x8_t* r, uint8x16_t* v0) { *r = vmovl_high_u8(*v0...
  function VmovlHighU16 (line 915) | void VmovlHighU16(uint32x4_t* r, uint16x8_t* v0) { *r = vmovl_high_u16(*...
  function VmovlHighU32 (line 916) | void VmovlHighU32(uint64x2_t* r, uint32x4_t* v0) { *r = vmovl_high_u32(*...
  function VmovnS16 (line 917) | void VmovnS16(int8x8_t* r, int16x8_t* v0) { *r = vmovn_s16(*v0); }
  function VmovnS32 (line 918) | void VmovnS32(int16x4_t* r, int32x4_t* v0) { *r = vmovn_s32(*v0); }
  function VmovnS64 (line 919) | void VmovnS64(int32x2_t* r, int64x2_t* v0) { *r = vmovn_s64(*v0); }
  function VmovnU16 (line 920) | void VmovnU16(uint8x8_t* r, uint16x8_t* v0) { *r = vmovn_u16(*v0); }
  function VmovnU32 (line 921) | void VmovnU32(uint16x4_t* r, uint32x4_t* v0) { *r = vmovn_u32(*v0); }
  function VmovnU64 (line 922) | void VmovnU64(uint32x2_t* r, uint64x2_t* v0) { *r = vmovn_u64(*v0); }
  function VmovnHighS16 (line 923) | void VmovnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1) { *r = vmov...
  function VmovnHighS32 (line 924) | void VmovnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1) { *r = vmo...
  function VmovnHighS64 (line 925) | void VmovnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1) { *r = vmo...
  function VmovnHighU16 (line 926) | void VmovnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1) { *r = v...
  function VmovnHighU32 (line 927) | void VmovnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1) { *r = ...
  function VmovnHighU64 (line 928) | void VmovnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1) { *r = ...
  function VmovqNS8 (line 929) | void VmovqNS8(int8x16_t* r, int8_t* v0) { *r = vmovq_n_s8(*v0); }
  function VmovqNS16 (line 930) | void VmovqNS16(int16x8_t* r, int16_t* v0) { *r = vmovq_n_s16(*v0); }
  function VmovqNS32 (line 931) | void VmovqNS32(int32x4_t* r, int32_t* v0) { *r = vmovq_n_s32(*v0); }
  function VmovqNS64 (line 932) | void VmovqNS64(int64x2_t* r, int64_t* v0) { *r = vmovq_n_s64(*v0); }
  function VmovqNU8 (line 933) | void VmovqNU8(uint8x16_t* r, uint8_t* v0) { *r = vmovq_n_u8(*v0); }
  function VmovqNU16 (line 934) | void VmovqNU16(uint16x8_t* r, uint16_t* v0) { *r = vmovq_n_u16(*v0); }
  function VmovqNU32 (line 935) | void VmovqNU32(uint32x4_t* r, uint32_t* v0) { *r = vmovq_n_u32(*v0); }
  function VmovqNU64 (line 936) | void VmovqNU64(uint64x2_t* r, uint64_t* v0) { *r = vmovq_n_u64(*v0); }
  function VmovqNF32 (line 937) | void VmovqNF32(float32x4_t* r, float32_t* v0) { *r = vmovq_n_f32(*v0); }
  function VmovqNF64 (line 938) | void VmovqNF64(float64x2_t* r, float64_t* v0) { *r = vmovq_n_f64(*v0); }
  function VmovqNP16 (line 939) | void VmovqNP16(poly16x8_t* r, poly16_t* v0) { *r = vmovq_n_p16(*v0); }
  function VmovqNP64 (line 940) | void VmovqNP64(poly64x2_t* r, poly64_t* v0) { *r = vmovq_n_p64(*v0); }
  function VmovqNP8 (line 941) | void VmovqNP8(poly8x16_t* r, poly8_t* v0) { *r = vmovq_n_p8(*v0); }
  function VmulS8 (line 942) | void VmulS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmul_s8(*v0,...
  function VmulS16 (line 943) | void VmulS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmul_s16...
  function VmulS32 (line 944) | void VmulS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmul_s32...
  function VmulU8 (line 945) | void VmulU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmul_u8(*...
  function VmulU16 (line 946) | void VmulU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmul_...
  function VmulU32 (line 947) | void VmulU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmul_...
  function VmulF32 (line 948) | void VmulF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = vm...
  function VmulF64 (line 949) | void VmulF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = vm...
  function VmulNS16 (line 950) | void VmulNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vmul_n_s1...
  function VmulNS32 (line 951) | void VmulNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vmul_n_s3...
  function VmulNU16 (line 952) | void VmulNU16(uint16x4_t* r, uint16x4_t* v0, uint16_t* v1) { *r = vmul_n...
  function VmulNU32 (line 953) | void VmulNU32(uint32x2_t* r, uint32x2_t* v0, uint32_t* v1) { *r = vmul_n...
  function VmulNF32 (line 954) | void VmulNF32(float32x2_t* r, float32x2_t* v0, float32_t* v1) { *r = vmu...
  function VmulNF64 (line 955) | void VmulNF64(float64x1_t* r, float64x1_t* v0, float64_t* v1) { *r = vmu...
  function VmulP8 (line 956) | void VmulP8(poly8x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vmul_p8(*...
  function VmullS8 (line 957) | void VmullS8(int16x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmull_s8(*...
  function VmullS16 (line 958) | void VmullS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vmull_s...
  function VmullS32 (line 959) | void VmullS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vmull_s...
  function VmullU8 (line 960) | void VmullU8(uint16x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vmull_u...
  function VmullU16 (line 961) | void VmullU16(uint32x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vmul...
  function VmullU32 (line 962) | void VmullU32(uint64x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vmul...
  function VmullHighS8 (line 963) | void VmullHighS8(int16x8_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmul...
  function VmullHighS16 (line 964) | void VmullHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmu...
  function VmullHighS32 (line 965) | void VmullHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmu...
  function VmullHighU8 (line 966) | void VmullHighU8(uint16x8_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = v...
  function VmullHighU16 (line 967) | void VmullHighU16(uint32x4_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = ...
  function VmullHighU32 (line 968) | void VmullHighU32(uint64x2_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = ...
  function VmullHighNS16 (line 969) | void VmullHighNS16(int32x4_t* r, int16x8_t* v0, int16_t* v1) { *r = vmul...
  function VmullHighNS32 (line 970) | void VmullHighNS32(int64x2_t* r, int32x4_t* v0, int32_t* v1) { *r = vmul...
  function VmullHighNU16 (line 971) | void VmullHighNU16(uint32x4_t* r, uint16x8_t* v0, uint16_t* v1) { *r = v...
  function VmullHighNU32 (line 972) | void VmullHighNU32(uint64x2_t* r, uint32x4_t* v0, uint32_t* v1) { *r = v...
  function VmullHighP64 (line 973) | void VmullHighP64(poly128_t* r, poly64x2_t* v0, poly64x2_t* v1) { *r = v...
  function VmullHighP8 (line 974) | void VmullHighP8(poly16x8_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = v...
  function VmullNS16 (line 975) | void VmullNS16(int32x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vmull_n_...
  function VmullNS32 (line 976) | void VmullNS32(int64x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vmull_n_...
  function VmullNU16 (line 977) | void VmullNU16(uint32x4_t* r, uint16x4_t* v0, uint16_t* v1) { *r = vmull...
  function VmullNU32 (line 978) | void VmullNU32(uint64x2_t* r, uint32x2_t* v0, uint32_t* v1) { *r = vmull...
  function VmullP64 (line 979) | void VmullP64(poly128_t* r, poly64_t* v0, poly64_t* v1) { *r = vmull_p64...
  function VmullP8 (line 980) | void VmullP8(poly16x8_t* r, poly8x8_t* v0, poly8x8_t* v1) { *r = vmull_p...
  function VmulqS8 (line 981) | void VmulqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vmulq_s8...
  function VmulqS16 (line 982) | void VmulqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vmulq_s...
  function VmulqS32 (line 983) | void VmulqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vmulq_s...
  function VmulqU8 (line 984) | void VmulqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vmulq...
  function VmulqU16 (line 985) | void VmulqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vmul...
  function VmulqU32 (line 986) | void VmulqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vmul...
  function VmulqF32 (line 987) | void VmulqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = v...
  function VmulqF64 (line 988) | void VmulqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = v...
  function VmulqNS16 (line 989) | void VmulqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vmulq_n_...
  function VmulqNS32 (line 990) | void VmulqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vmulq_n_...
  function VmulqNU16 (line 991) | void VmulqNU16(uint16x8_t* r, uint16x8_t* v0, uint16_t* v1) { *r = vmulq...
  function VmulqNU32 (line 992) | void VmulqNU32(uint32x4_t* r, uint32x4_t* v0, uint32_t* v1) { *r = vmulq...
  function VmulqNF32 (line 993) | void VmulqNF32(float32x4_t* r, float32x4_t* v0, float32_t* v1) { *r = vm...
  function VmulqNF64 (line 994) | void VmulqNF64(float64x2_t* r, float64x2_t* v0, float64_t* v1) { *r = vm...
  function VmulqP8 (line 995) | void VmulqP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1) { *r = vmulq...
  function VmulxF32 (line 996) | void VmulxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = v...
  function VmulxF64 (line 997) | void VmulxF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = v...
  function VmulxdF64 (line 998) | void VmulxdF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vmulxd...
  function VmulxqF32 (line 999) | void VmulxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = ...
  function VmulxqF64 (line 1000) | void VmulxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = ...
  function VmulxsF32 (line 1001) | void VmulxsF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vmulxs...
  function VmvnS8 (line 1002) | void VmvnS8(int8x8_t* r, int8x8_t* v0) { *r = vmvn_s8(*v0); }
  function VmvnS16 (line 1003) | void VmvnS16(int16x4_t* r, int16x4_t* v0) { *r = vmvn_s16(*v0); }
  function VmvnS32 (line 1004) | void VmvnS32(int32x2_t* r, int32x2_t* v0) { *r = vmvn_s32(*v0); }
  function VmvnU8 (line 1005) | void VmvnU8(uint8x8_t* r, uint8x8_t* v0) { *r = vmvn_u8(*v0); }
  function VmvnU16 (line 1006) | void VmvnU16(uint16x4_t* r, uint16x4_t* v0) { *r = vmvn_u16(*v0); }
  function VmvnU32 (line 1007) | void VmvnU32(uint32x2_t* r, uint32x2_t* v0) { *r = vmvn_u32(*v0); }
  function VmvnP8 (line 1008) | void VmvnP8(poly8x8_t* r, poly8x8_t* v0) { *r = vmvn_p8(*v0); }
  function VmvnqS8 (line 1009) | void VmvnqS8(int8x16_t* r, int8x16_t* v0) { *r = vmvnq_s8(*v0); }
  function VmvnqS16 (line 1010) | void VmvnqS16(int16x8_t* r, int16x8_t* v0) { *r = vmvnq_s16(*v0); }
  function VmvnqS32 (line 1011) | void VmvnqS32(int32x4_t* r, int32x4_t* v0) { *r = vmvnq_s32(*v0); }
  function VmvnqU8 (line 1012) | void VmvnqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vmvnq_u8(*v0); }
  function VmvnqU16 (line 1013) | void VmvnqU16(uint16x8_t* r, uint16x8_t* v0) { *r = vmvnq_u16(*v0); }
  function VmvnqU32 (line 1014) | void VmvnqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vmvnq_u32(*v0); }
  function VmvnqP8 (line 1015) | void VmvnqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vmvnq_p8(*v0); }
  function VnegS8 (line 1016) | void VnegS8(int8x8_t* r, int8x8_t* v0) { *r = vneg_s8(*v0); }
  function VnegS16 (line 1017) | void VnegS16(int16x4_t* r, int16x4_t* v0) { *r = vneg_s16(*v0); }
  function VnegS32 (line 1018) | void VnegS32(int32x2_t* r, int32x2_t* v0) { *r = vneg_s32(*v0); }
  function VnegS64 (line 1019) | void VnegS64(int64x1_t* r, int64x1_t* v0) { *r = vneg_s64(*v0); }
  function VnegF32 (line 1020) | void VnegF32(float32x2_t* r, float32x2_t* v0) { *r = vneg_f32(*v0); }
  function VnegF64 (line 1021) | void VnegF64(float64x1_t* r, float64x1_t* v0) { *r = vneg_f64(*v0); }
  function VnegdS64 (line 1022) | void VnegdS64(int64_t* r, int64_t* v0) { *r = vnegd_s64(*v0); }
  function VnegqS8 (line 1023) | void VnegqS8(int8x16_t* r, int8x16_t* v0) { *r = vnegq_s8(*v0); }
  function VnegqS16 (line 1024) | void VnegqS16(int16x8_t* r, int16x8_t* v0) { *r = vnegq_s16(*v0); }
  function VnegqS32 (line 1025) | void VnegqS32(int32x4_t* r, int32x4_t* v0) { *r = vnegq_s32(*v0); }
  function VnegqS64 (line 1026) | void VnegqS64(int64x2_t* r, int64x2_t* v0) { *r = vnegq_s64(*v0); }
  function VnegqF32 (line 1027) | void VnegqF32(float32x4_t* r, float32x4_t* v0) { *r = vnegq_f32(*v0); }
  function VnegqF64 (line 1028) | void VnegqF64(float64x2_t* r, float64x2_t* v0) { *r = vnegq_f64(*v0); }
  function VornS8 (line 1029) | void VornS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vorn_s8(*v0,...
  function VornS16 (line 1030) | void VornS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vorn_s16...
  function VornS32 (line 1031) | void VornS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vorn_s32...
  function VornS64 (line 1032) | void VornS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vorn_s64...
  function VornU8 (line 1033) | void VornU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vorn_u8(*...
  function VornU16 (line 1034) | void VornU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vorn_...
  function VornU32 (line 1035) | void VornU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vorn_...
  function VornU64 (line 1036) | void VornU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vorn_...
  function VornqS8 (line 1037) | void VornqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vornq_s8...
  function VornqS16 (line 1038) | void VornqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vornq_s...
  function VornqS32 (line 1039) | void VornqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vornq_s...
  function VornqS64 (line 1040) | void VornqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vornq_s...
  function VornqU8 (line 1041) | void VornqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vornq...
  function VornqU16 (line 1042) | void VornqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vorn...
  function VornqU32 (line 1043) | void VornqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vorn...
  function VornqU64 (line 1044) | void VornqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vorn...
  function VorrS8 (line 1045) | void VorrS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vorr_s8(*v0,...
  function VorrS16 (line 1046) | void VorrS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vorr_s16...
  function VorrS32 (line 1047) | void VorrS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vorr_s32...
  function VorrS64 (line 1048) | void VorrS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vorr_s64...
  function VorrU8 (line 1049) | void VorrU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vorr_u8(*...
  function VorrU16 (line 1050) | void VorrU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vorr_...
  function VorrU32 (line 1051) | void VorrU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vorr_...
  function VorrU64 (line 1052) | void VorrU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vorr_...
  function VorrqS8 (line 1053) | void VorrqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vorrq_s8...
  function VorrqS16 (line 1054) | void VorrqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vorrq_s...
  function VorrqS32 (line 1055) | void VorrqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vorrq_s...
  function VorrqS64 (line 1056) | void VorrqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vorrq_s...
  function VorrqU8 (line 1057) | void VorrqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vorrq...
  function VorrqU16 (line 1058) | void VorrqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vorr...
  function VorrqU32 (line 1059) | void VorrqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vorr...
  function VorrqU64 (line 1060) | void VorrqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vorr...
  function VpadalS8 (line 1061) | void VpadalS8(int16x4_t* r, int16x4_t* v0, int8x8_t* v1) { *r = vpadal_s...
  function VpadalS16 (line 1062) | void VpadalS16(int32x2_t* r, int32x2_t* v0, int16x4_t* v1) { *r = vpadal...
  function VpadalS32 (line 1063) | void VpadalS32(int64x1_t* r, int64x1_t* v0, int32x2_t* v1) { *r = vpadal...
  function VpadalU8 (line 1064) | void VpadalU8(uint16x4_t* r, uint16x4_t* v0, uint8x8_t* v1) { *r = vpada...
  function VpadalU16 (line 1065) | void VpadalU16(uint32x2_t* r, uint32x2_t* v0, uint16x4_t* v1) { *r = vpa...
  function VpadalU32 (line 1066) | void VpadalU32(uint64x1_t* r, uint64x1_t* v0, uint32x2_t* v1) { *r = vpa...
  function VpadalqS8 (line 1067) | void VpadalqS8(int16x8_t* r, int16x8_t* v0, int8x16_t* v1) { *r = vpadal...
  function VpadalqS16 (line 1068) | void VpadalqS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1) { *r = vpada...
  function VpadalqS32 (line 1069) | void VpadalqS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1) { *r = vpada...
  function VpadalqU8 (line 1070) | void VpadalqU8(uint16x8_t* r, uint16x8_t* v0, uint8x16_t* v1) { *r = vpa...
  function VpadalqU16 (line 1071) | void VpadalqU16(uint32x4_t* r, uint32x4_t* v0, uint16x8_t* v1) { *r = vp...
  function VpadalqU32 (line 1072) | void VpadalqU32(uint64x2_t* r, uint64x2_t* v0, uint32x4_t* v1) { *r = vp...
  function VpaddS8 (line 1073) | void VpaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpadd_s8(*v...
  function VpaddS16 (line 1074) | void VpaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpadd_s...
  function VpaddS32 (line 1075) | void VpaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpadd_s...
  function VpaddU8 (line 1076) | void VpaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpadd_u8...
  function VpaddU16 (line 1077) | void VpaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpad...
  function VpaddU32 (line 1078) | void VpaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpad...
  function VpaddF32 (line 1079) | void VpaddF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = v...
  function VpadddS64 (line 1080) | void VpadddS64(int64_t* r, int64x2_t* v0) { *r = vpaddd_s64(*v0); }
  function VpadddU64 (line 1081) | void VpadddU64(uint64_t* r, uint64x2_t* v0) { *r = vpaddd_u64(*v0); }
  function VpadddF64 (line 1082) | void VpadddF64(float64_t* r, float64x2_t* v0) { *r = vpaddd_f64(*v0); }
  function VpaddlS8 (line 1083) | void VpaddlS8(int16x4_t* r, int8x8_t* v0) { *r = vpaddl_s8(*v0); }
  function VpaddlS16 (line 1084) | void VpaddlS16(int32x2_t* r, int16x4_t* v0) { *r = vpaddl_s16(*v0); }
  function VpaddlS32 (line 1085) | void VpaddlS32(int64x1_t* r, int32x2_t* v0) { *r = vpaddl_s32(*v0); }
  function VpaddlU8 (line 1086) | void VpaddlU8(uint16x4_t* r, uint8x8_t* v0) { *r = vpaddl_u8(*v0); }
  function VpaddlU16 (line 1087) | void VpaddlU16(uint32x2_t* r, uint16x4_t* v0) { *r = vpaddl_u16(*v0); }
  function VpaddlU32 (line 1088) | void VpaddlU32(uint64x1_t* r, uint32x2_t* v0) { *r = vpaddl_u32(*v0); }
  function VpaddlqS8 (line 1089) | void VpaddlqS8(int16x8_t* r, int8x16_t* v0) { *r = vpaddlq_s8(*v0); }
  function VpaddlqS16 (line 1090) | void VpaddlqS16(int32x4_t* r, int16x8_t* v0) { *r = vpaddlq_s16(*v0); }
  function VpaddlqS32 (line 1091) | void VpaddlqS32(int64x2_t* r, int32x4_t* v0) { *r = vpaddlq_s32(*v0); }
  function VpaddlqU8 (line 1092) | void VpaddlqU8(uint16x8_t* r, uint8x16_t* v0) { *r = vpaddlq_u8(*v0); }
  function VpaddlqU16 (line 1093) | void VpaddlqU16(uint32x4_t* r, uint16x8_t* v0) { *r = vpaddlq_u16(*v0); }
  function VpaddlqU32 (line 1094) | void VpaddlqU32(uint64x2_t* r, uint32x4_t* v0) { *r = vpaddlq_u32(*v0); }
  function VpaddqS8 (line 1095) | void VpaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpaddq_...
  function VpaddqS16 (line 1096) | void VpaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpaddq...
  function VpaddqS32 (line 1097) | void VpaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpaddq...
  function VpaddqS64 (line 1098) | void VpaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vpaddq...
  function VpaddqU8 (line 1099) | void VpaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpad...
  function VpaddqU16 (line 1100) | void VpaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpa...
  function VpaddqU32 (line 1101) | void VpaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpa...
  function VpaddqU64 (line 1102) | void VpaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vpa...
  function VpaddqF32 (line 1103) | void VpaddqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = ...
  function VpaddqF64 (line 1104) | void VpaddqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = ...
  function VpaddsF32 (line 1105) | void VpaddsF32(float32_t* r, float32x2_t* v0) { *r = vpadds_f32(*v0); }
  function VpmaxS8 (line 1106) | void VpmaxS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpmax_s8(*v...
  function VpmaxS16 (line 1107) | void VpmaxS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpmax_s...
  function VpmaxS32 (line 1108) | void VpmaxS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpmax_s...
  function VpmaxU8 (line 1109) | void VpmaxU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpmax_u8...
  function VpmaxU16 (line 1110) | void VpmaxU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpma...
  function VpmaxU32 (line 1111) | void VpmaxU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpma...
  function VpmaxF32 (line 1112) | void VpmaxF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = v...
  function VpmaxnmF32 (line 1113) | void VpmaxnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r =...
  function VpmaxnmqF32 (line 1114) | void VpmaxnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r ...
  function VpmaxnmqF64 (line 1115) | void VpmaxnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r ...
  function VpmaxnmqdF64 (line 1116) | void VpmaxnmqdF64(float64_t* r, float64x2_t* v0) { *r = vpmaxnmqd_f64(*v...
  function VpmaxnmsF32 (line 1117) | void VpmaxnmsF32(float32_t* r, float32x2_t* v0) { *r = vpmaxnms_f32(*v0); }
  function VpmaxqS8 (line 1118) | void VpmaxqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpmaxq_...
  function VpmaxqS16 (line 1119) | void VpmaxqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpmaxq...
  function VpmaxqS32 (line 1120) | void VpmaxqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpmaxq...
  function VpmaxqU8 (line 1121) | void VpmaxqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpma...
  function VpmaxqU16 (line 1122) | void VpmaxqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpm...
  function VpmaxqU32 (line 1123) | void VpmaxqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpm...
  function VpmaxqF32 (line 1124) | void VpmaxqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = ...
  function VpmaxqF64 (line 1125) | void VpmaxqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = ...
  function VpmaxqdF64 (line 1126) | void VpmaxqdF64(float64_t* r, float64x2_t* v0) { *r = vpmaxqd_f64(*v0); }
  function VpmaxsF32 (line 1127) | void VpmaxsF32(float32_t* r, float32x2_t* v0) { *r = vpmaxs_f32(*v0); }
  function VpminS8 (line 1128) | void VpminS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vpmin_s8(*v...
  function VpminS16 (line 1129) | void VpminS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vpmin_s...
  function VpminS32 (line 1130) | void VpminS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vpmin_s...
  function VpminU8 (line 1131) | void VpminU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vpmin_u8...
  function VpminU16 (line 1132) | void VpminU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vpmi...
  function VpminU32 (line 1133) | void VpminU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vpmi...
  function VpminF32 (line 1134) | void VpminF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = v...
  function VpminnmF32 (line 1135) | void VpminnmF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r =...
  function VpminnmqF32 (line 1136) | void VpminnmqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r ...
  function VpminnmqF64 (line 1137) | void VpminnmqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r ...
  function VpminnmqdF64 (line 1138) | void VpminnmqdF64(float64_t* r, float64x2_t* v0) { *r = vpminnmqd_f64(*v...
  function VpminnmsF32 (line 1139) | void VpminnmsF32(float32_t* r, float32x2_t* v0) { *r = vpminnms_f32(*v0); }
  function VpminqS8 (line 1140) | void VpminqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vpminq_...
  function VpminqS16 (line 1141) | void VpminqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vpminq...
  function VpminqS32 (line 1142) | void VpminqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vpminq...
  function VpminqU8 (line 1143) | void VpminqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vpmi...
  function VpminqU16 (line 1144) | void VpminqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vpm...
  function VpminqU32 (line 1145) | void VpminqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vpm...
  function VpminqF32 (line 1146) | void VpminqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r = ...
  function VpminqF64 (line 1147) | void VpminqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r = ...
  function VpminqdF64 (line 1148) | void VpminqdF64(float64_t* r, float64x2_t* v0) { *r = vpminqd_f64(*v0); }
  function VpminsF32 (line 1149) | void VpminsF32(float32_t* r, float32x2_t* v0) { *r = vpmins_f32(*v0); }
  function VqabsS8 (line 1150) | void VqabsS8(int8x8_t* r, int8x8_t* v0) { *r = vqabs_s8(*v0); }
  function VqabsS16 (line 1151) | void VqabsS16(int16x4_t* r, int16x4_t* v0) { *r = vqabs_s16(*v0); }
  function VqabsS32 (line 1152) | void VqabsS32(int32x2_t* r, int32x2_t* v0) { *r = vqabs_s32(*v0); }
  function VqabsS64 (line 1153) | void VqabsS64(int64x1_t* r, int64x1_t* v0) { *r = vqabs_s64(*v0); }
  function VqabsbS8 (line 1154) | void VqabsbS8(int8_t* r, int8_t* v0) { *r = vqabsb_s8(*v0); }
  function VqabsdS64 (line 1155) | void VqabsdS64(int64_t* r, int64_t* v0) { *r = vqabsd_s64(*v0); }
  function VqabshS16 (line 1156) | void VqabshS16(int16_t* r, int16_t* v0) { *r = vqabsh_s16(*v0); }
  function VqabsqS8 (line 1157) | void VqabsqS8(int8x16_t* r, int8x16_t* v0) { *r = vqabsq_s8(*v0); }
  function VqabsqS16 (line 1158) | void VqabsqS16(int16x8_t* r, int16x8_t* v0) { *r = vqabsq_s16(*v0); }
  function VqabsqS32 (line 1159) | void VqabsqS32(int32x4_t* r, int32x4_t* v0) { *r = vqabsq_s32(*v0); }
  function VqabsqS64 (line 1160) | void VqabsqS64(int64x2_t* r, int64x2_t* v0) { *r = vqabsq_s64(*v0); }
  function VqabssS32 (line 1161) | void VqabssS32(int32_t* r, int32_t* v0) { *r = vqabss_s32(*v0); }
  function VqaddS8 (line 1162) | void VqaddS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqadd_s8(*v...
  function VqaddS16 (line 1163) | void VqaddS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqadd_s...
  function VqaddS32 (line 1164) | void VqaddS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqadd_s...
  function VqaddS64 (line 1165) | void VqaddS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqadd_s...
  function VqaddU8 (line 1166) | void VqaddU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vqadd_u8...
  function VqaddU16 (line 1167) | void VqaddU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vqad...
  function VqaddU32 (line 1168) | void VqaddU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vqad...
  function VqaddU64 (line 1169) | void VqaddU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vqad...
  function VqaddbS8 (line 1170) | void VqaddbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqaddb_s8(*v0, *...
  function VqaddbU8 (line 1171) | void VqaddbU8(uint8_t* r, uint8_t* v0, uint8_t* v1) { *r = vqaddb_u8(*v0...
  function VqadddS64 (line 1172) | void VqadddS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqaddd_s64(*...
  function VqadddU64 (line 1173) | void VqadddU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vqaddd_u6...
  function VqaddhS16 (line 1174) | void VqaddhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqaddh_s16(*...
  function VqaddhU16 (line 1175) | void VqaddhU16(uint16_t* r, uint16_t* v0, uint16_t* v1) { *r = vqaddh_u1...
  function VqaddqS8 (line 1176) | void VqaddqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqaddq_...
  function VqaddqS16 (line 1177) | void VqaddqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqaddq...
  function VqaddqS32 (line 1178) | void VqaddqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqaddq...
  function VqaddqS64 (line 1179) | void VqaddqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqaddq...
  function VqaddqU8 (line 1180) | void VqaddqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqad...
  function VqaddqU16 (line 1181) | void VqaddqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vqa...
  function VqaddqU32 (line 1182) | void VqaddqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vqa...
  function VqaddqU64 (line 1183) | void VqaddqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vqa...
  function VqaddsS32 (line 1184) | void VqaddsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqadds_s32(*...
  function VqaddsU32 (line 1185) | void VqaddsU32(uint32_t* r, uint32_t* v0, uint32_t* v1) { *r = vqadds_u3...
  function VqdmlalS16 (line 1186) | void VqdmlalS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v...
  function VqdmlalS32 (line 1187) | void VqdmlalS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v...
  function VqdmlalHighS16 (line 1188) | void VqdmlalHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_...
  function VqdmlalHighS32 (line 1189) | void VqdmlalHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_...
  function VqdmlalHighNS16 (line 1190) | void VqdmlalHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t...
  function VqdmlalHighNS32 (line 1191) | void VqdmlalHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t...
  function VqdmlalNS16 (line 1192) | void VqdmlalNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2...
  function VqdmlalNS32 (line 1193) | void VqdmlalNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2...
  function VqdmlalhS16 (line 1194) | void VqdmlalhS16(int32_t* r, int32_t* v0, int16_t* v1, int16_t* v2) { *r...
  function VqdmlalsS32 (line 1195) | void VqdmlalsS32(int64_t* r, int64_t* v0, int32_t* v1, int32_t* v2) { *r...
  function VqdmlslS16 (line 1196) | void VqdmlslS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16x4_t* v...
  function VqdmlslS32 (line 1197) | void VqdmlslS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32x2_t* v...
  function VqdmlslHighS16 (line 1198) | void VqdmlslHighS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16x8_...
  function VqdmlslHighS32 (line 1199) | void VqdmlslHighS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32x4_...
  function VqdmlslHighNS16 (line 1200) | void VqdmlslHighNS16(int32x4_t* r, int32x4_t* v0, int16x8_t* v1, int16_t...
  function VqdmlslHighNS32 (line 1201) | void VqdmlslHighNS32(int64x2_t* r, int64x2_t* v0, int32x4_t* v1, int32_t...
  function VqdmlslNS16 (line 1202) | void VqdmlslNS16(int32x4_t* r, int32x4_t* v0, int16x4_t* v1, int16_t* v2...
  function VqdmlslNS32 (line 1203) | void VqdmlslNS32(int64x2_t* r, int64x2_t* v0, int32x2_t* v1, int32_t* v2...
  function VqdmlslhS16 (line 1204) | void VqdmlslhS16(int32_t* r, int32_t* v0, int16_t* v1, int16_t* v2) { *r...
  function VqdmlslsS32 (line 1205) | void VqdmlslsS32(int64_t* r, int64_t* v0, int32_t* v1, int32_t* v2) { *r...
  function VqdmulhS16 (line 1206) | void VqdmulhS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqdmu...
  function VqdmulhS32 (line 1207) | void VqdmulhS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqdmu...
  function VqdmulhNS16 (line 1208) | void VqdmulhNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqdmul...
  function VqdmulhNS32 (line 1209) | void VqdmulhNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqdmul...
  function VqdmulhhS16 (line 1210) | void VqdmulhhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqdmulhh_s...
  function VqdmulhqS16 (line 1211) | void VqdmulhqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqdm...
  function VqdmulhqS32 (line 1212) | void VqdmulhqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqdm...
  function VqdmulhqNS16 (line 1213) | void VqdmulhqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vqdmu...
  function VqdmulhqNS32 (line 1214) | void VqdmulhqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vqdmu...
  function VqdmulhsS32 (line 1215) | void VqdmulhsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqdmulhs_s...
  function VqdmullS16 (line 1216) | void VqdmullS16(int32x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqdmu...
  function VqdmullS32 (line 1217) | void VqdmullS32(int64x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqdmu...
  function VqdmullHighS16 (line 1218) | void VqdmullHighS16(int32x4_t* r, int16x8_t* v0, int16x8_t* v1) { *r = v...
  function VqdmullHighS32 (line 1219) | void VqdmullHighS32(int64x2_t* r, int32x4_t* v0, int32x4_t* v1) { *r = v...
  function VqdmullHighNS16 (line 1220) | void VqdmullHighNS16(int32x4_t* r, int16x8_t* v0, int16_t* v1) { *r = vq...
  function VqdmullHighNS32 (line 1221) | void VqdmullHighNS32(int64x2_t* r, int32x4_t* v0, int32_t* v1) { *r = vq...
  function VqdmullNS16 (line 1222) | void VqdmullNS16(int32x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqdmul...
  function VqdmullNS32 (line 1223) | void VqdmullNS32(int64x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqdmul...
  function VqdmullhS16 (line 1224) | void VqdmullhS16(int32_t* r, int16_t* v0, int16_t* v1) { *r = vqdmullh_s...
  function VqdmullsS32 (line 1225) | void VqdmullsS32(int64_t* r, int32_t* v0, int32_t* v1) { *r = vqdmulls_s...
  function VqmovnS16 (line 1226) | void VqmovnS16(int8x8_t* r, int16x8_t* v0) { *r = vqmovn_s16(*v0); }
  function VqmovnS32 (line 1227) | void VqmovnS32(int16x4_t* r, int32x4_t* v0) { *r = vqmovn_s32(*v0); }
  function VqmovnS64 (line 1228) | void VqmovnS64(int32x2_t* r, int64x2_t* v0) { *r = vqmovn_s64(*v0); }
  function VqmovnU16 (line 1229) | void VqmovnU16(uint8x8_t* r, uint16x8_t* v0) { *r = vqmovn_u16(*v0); }
  function VqmovnU32 (line 1230) | void VqmovnU32(uint16x4_t* r, uint32x4_t* v0) { *r = vqmovn_u32(*v0); }
  function VqmovnU64 (line 1231) | void VqmovnU64(uint32x2_t* r, uint64x2_t* v0) { *r = vqmovn_u64(*v0); }
  function VqmovnHighS16 (line 1232) | void VqmovnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1) { *r = vqm...
  function VqmovnHighS32 (line 1233) | void VqmovnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1) { *r = vq...
  function VqmovnHighS64 (line 1234) | void VqmovnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1) { *r = vq...
  function VqmovnHighU16 (line 1235) | void VqmovnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1) { *r = ...
  function VqmovnHighU32 (line 1236) | void VqmovnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1) { *r =...
  function VqmovnHighU64 (line 1237) | void VqmovnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1) { *r =...
  function VqmovndS64 (line 1238) | void VqmovndS64(int32_t* r, int64_t* v0) { *r = vqmovnd_s64(*v0); }
  function VqmovndU64 (line 1239) | void VqmovndU64(uint32_t* r, uint64_t* v0) { *r = vqmovnd_u64(*v0); }
  function VqmovnhS16 (line 1240) | void VqmovnhS16(int8_t* r, int16_t* v0) { *r = vqmovnh_s16(*v0); }
  function VqmovnhU16 (line 1241) | void VqmovnhU16(uint8_t* r, uint16_t* v0) { *r = vqmovnh_u16(*v0); }
  function VqmovnsS32 (line 1242) | void VqmovnsS32(int16_t* r, int32_t* v0) { *r = vqmovns_s32(*v0); }
  function VqmovnsU32 (line 1243) | void VqmovnsU32(uint16_t* r, uint32_t* v0) { *r = vqmovns_u32(*v0); }
  function VqmovunS16 (line 1244) | void VqmovunS16(uint8x8_t* r, int16x8_t* v0) { *r = vqmovun_s16(*v0); }
  function VqmovunS32 (line 1245) | void VqmovunS32(uint16x4_t* r, int32x4_t* v0) { *r = vqmovun_s32(*v0); }
  function VqmovunS64 (line 1246) | void VqmovunS64(uint32x2_t* r, int64x2_t* v0) { *r = vqmovun_s64(*v0); }
  function VqmovunHighS16 (line 1247) | void VqmovunHighS16(uint8x16_t* r, uint8x8_t* v0, int16x8_t* v1) { *r = ...
  function VqmovunHighS32 (line 1248) | void VqmovunHighS32(uint16x8_t* r, uint16x4_t* v0, int32x4_t* v1) { *r =...
  function VqmovunHighS64 (line 1249) | void VqmovunHighS64(uint32x4_t* r, uint32x2_t* v0, int64x2_t* v1) { *r =...
  function VqmovundS64 (line 1250) | void VqmovundS64(uint32_t* r, int64_t* v0) { *r = vqmovund_s64(*v0); }
  function VqmovunhS16 (line 1251) | void VqmovunhS16(uint8_t* r, int16_t* v0) { *r = vqmovunh_s16(*v0); }
  function VqmovunsS32 (line 1252) | void VqmovunsS32(uint16_t* r, int32_t* v0) { *r = vqmovuns_s32(*v0); }
  function VqnegS8 (line 1253) | void VqnegS8(int8x8_t* r, int8x8_t* v0) { *r = vqneg_s8(*v0); }
  function VqnegS16 (line 1254) | void VqnegS16(int16x4_t* r, int16x4_t* v0) { *r = vqneg_s16(*v0); }
  function VqnegS32 (line 1255) | void VqnegS32(int32x2_t* r, int32x2_t* v0) { *r = vqneg_s32(*v0); }
  function VqnegS64 (line 1256) | void VqnegS64(int64x1_t* r, int64x1_t* v0) { *r = vqneg_s64(*v0); }
  function VqnegbS8 (line 1257) | void VqnegbS8(int8_t* r, int8_t* v0) { *r = vqnegb_s8(*v0); }
  function VqnegdS64 (line 1258) | void VqnegdS64(int64_t* r, int64_t* v0) { *r = vqnegd_s64(*v0); }
  function VqneghS16 (line 1259) | void VqneghS16(int16_t* r, int16_t* v0) { *r = vqnegh_s16(*v0); }
  function VqnegqS8 (line 1260) | void VqnegqS8(int8x16_t* r, int8x16_t* v0) { *r = vqnegq_s8(*v0); }
  function VqnegqS16 (line 1261) | void VqnegqS16(int16x8_t* r, int16x8_t* v0) { *r = vqnegq_s16(*v0); }
  function VqnegqS32 (line 1262) | void VqnegqS32(int32x4_t* r, int32x4_t* v0) { *r = vqnegq_s32(*v0); }
  function VqnegqS64 (line 1263) | void VqnegqS64(int64x2_t* r, int64x2_t* v0) { *r = vqnegq_s64(*v0); }
  function VqnegsS32 (line 1264) | void VqnegsS32(int32_t* r, int32_t* v0) { *r = vqnegs_s32(*v0); }
  function VqrdmlahS16 (line 1265) | void VqrdmlahS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* ...
  function VqrdmlahS32 (line 1266) | void VqrdmlahS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* ...
  function VqrdmlahhS16 (line 1267) | void VqrdmlahhS16(int16_t* r, int16_t* v0, int16_t* v1, int16_t* v2) { *...
  function VqrdmlahqS16 (line 1268) | void VqrdmlahqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t*...
  function VqrdmlahqS32 (line 1269) | void VqrdmlahqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t*...
  function VqrdmlahsS32 (line 1270) | void VqrdmlahsS32(int32_t* r, int32_t* v0, int32_t* v1, int32_t* v2) { *...
  function VqrdmlshS16 (line 1271) | void VqrdmlshS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1, int16x4_t* ...
  function VqrdmlshS32 (line 1272) | void VqrdmlshS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1, int32x2_t* ...
  function VqrdmlshhS16 (line 1273) | void VqrdmlshhS16(int16_t* r, int16_t* v0, int16_t* v1, int16_t* v2) { *...
  function VqrdmlshqS16 (line 1274) | void VqrdmlshqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1, int16x8_t*...
  function VqrdmlshqS32 (line 1275) | void VqrdmlshqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1, int32x4_t*...
  function VqrdmlshsS32 (line 1276) | void VqrdmlshsS32(int32_t* r, int32_t* v0, int32_t* v1, int32_t* v2) { *...
  function VqrdmulhS16 (line 1277) | void VqrdmulhS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqrd...
  function VqrdmulhS32 (line 1278) | void VqrdmulhS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqrd...
  function VqrdmulhNS16 (line 1279) | void VqrdmulhNS16(int16x4_t* r, int16x4_t* v0, int16_t* v1) { *r = vqrdm...
  function VqrdmulhNS32 (line 1280) | void VqrdmulhNS32(int32x2_t* r, int32x2_t* v0, int32_t* v1) { *r = vqrdm...
  function VqrdmulhhS16 (line 1281) | void VqrdmulhhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqrdmulhh...
  function VqrdmulhqS16 (line 1282) | void VqrdmulhqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqr...
  function VqrdmulhqS32 (line 1283) | void VqrdmulhqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqr...
  function VqrdmulhqNS16 (line 1284) | void VqrdmulhqNS16(int16x8_t* r, int16x8_t* v0, int16_t* v1) { *r = vqrd...
  function VqrdmulhqNS32 (line 1285) | void VqrdmulhqNS32(int32x4_t* r, int32x4_t* v0, int32_t* v1) { *r = vqrd...
  function VqrdmulhsS32 (line 1286) | void VqrdmulhsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqrdmulhs...
  function VqrshlS8 (line 1287) | void VqrshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqrshl_s8(...
  function VqrshlS16 (line 1288) | void VqrshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqrshl...
  function VqrshlS32 (line 1289) | void VqrshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqrshl...
  function VqrshlS64 (line 1290) | void VqrshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqrshl...
  function VqrshlU8 (line 1291) | void VqrshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vqrshl_u...
  function VqrshlU16 (line 1292) | void VqrshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vqrs...
  function VqrshlU32 (line 1293) | void VqrshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vqrs...
  function VqrshlU64 (line 1294) | void VqrshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vqrs...
  function VqrshlbS8 (line 1295) | void VqrshlbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqrshlb_s8(*v0,...
  function VqrshlbU8 (line 1296) | void VqrshlbU8(uint8_t* r, uint8_t* v0, int8_t* v1) { *r = vqrshlb_u8(*v...
  function VqrshldS64 (line 1297) | void VqrshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqrshld_s64...
  function VqrshldU64 (line 1298) | void VqrshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vqrshld_u...
  function VqrshlhS16 (line 1299) | void VqrshlhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqrshlh_s16...
  function VqrshlhU16 (line 1300) | void VqrshlhU16(uint16_t* r, uint16_t* v0, int16_t* v1) { *r = vqrshlh_u...
  function VqrshlqS8 (line 1301) | void VqrshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqrshl...
  function VqrshlqS16 (line 1302) | void VqrshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqrsh...
  function VqrshlqS32 (line 1303) | void VqrshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqrsh...
  function VqrshlqS64 (line 1304) | void VqrshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqrsh...
  function VqrshlqU8 (line 1305) | void VqrshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vqrs...
  function VqrshlqU16 (line 1306) | void VqrshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vqr...
  function VqrshlqU32 (line 1307) | void VqrshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vqr...
  function VqrshlqU64 (line 1308) | void VqrshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vqr...
  function VqrshlsS32 (line 1309) | void VqrshlsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqrshls_s32...
  function VqrshlsU32 (line 1310) | void VqrshlsU32(uint32_t* r, uint32_t* v0, int32_t* v1) { *r = vqrshls_u...
  function VqshlS8 (line 1311) | void VqshlS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqshl_s8(*v...
  function VqshlS16 (line 1312) | void VqshlS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqshl_s...
  function VqshlS32 (line 1313) | void VqshlS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqshl_s...
  function VqshlS64 (line 1314) | void VqshlS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqshl_s...
  function VqshlU8 (line 1315) | void VqshlU8(uint8x8_t* r, uint8x8_t* v0, int8x8_t* v1) { *r = vqshl_u8(...
  function VqshlU16 (line 1316) | void VqshlU16(uint16x4_t* r, uint16x4_t* v0, int16x4_t* v1) { *r = vqshl...
  function VqshlU32 (line 1317) | void VqshlU32(uint32x2_t* r, uint32x2_t* v0, int32x2_t* v1) { *r = vqshl...
  function VqshlU64 (line 1318) | void VqshlU64(uint64x1_t* r, uint64x1_t* v0, int64x1_t* v1) { *r = vqshl...
  function VqshlbS8 (line 1319) | void VqshlbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqshlb_s8(*v0, *...
  function VqshlbU8 (line 1320) | void VqshlbU8(uint8_t* r, uint8_t* v0, int8_t* v1) { *r = vqshlb_u8(*v0,...
  function VqshldS64 (line 1321) | void VqshldS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqshld_s64(*...
  function VqshldU64 (line 1322) | void VqshldU64(uint64_t* r, uint64_t* v0, int64_t* v1) { *r = vqshld_u64...
  function VqshlhS16 (line 1323) | void VqshlhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqshlh_s16(*...
  function VqshlhU16 (line 1324) | void VqshlhU16(uint16_t* r, uint16_t* v0, int16_t* v1) { *r = vqshlh_u16...
  function VqshlqS8 (line 1325) | void VqshlqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqshlq_...
  function VqshlqS16 (line 1326) | void VqshlqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqshlq...
  function VqshlqS32 (line 1327) | void VqshlqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqshlq...
  function VqshlqS64 (line 1328) | void VqshlqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqshlq...
  function VqshlqU8 (line 1329) | void VqshlqU8(uint8x16_t* r, uint8x16_t* v0, int8x16_t* v1) { *r = vqshl...
  function VqshlqU16 (line 1330) | void VqshlqU16(uint16x8_t* r, uint16x8_t* v0, int16x8_t* v1) { *r = vqsh...
  function VqshlqU32 (line 1331) | void VqshlqU32(uint32x4_t* r, uint32x4_t* v0, int32x4_t* v1) { *r = vqsh...
  function VqshlqU64 (line 1332) | void VqshlqU64(uint64x2_t* r, uint64x2_t* v0, int64x2_t* v1) { *r = vqsh...
  function VqshlsS32 (line 1333) | void VqshlsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqshls_s32(*...
  function VqshlsU32 (line 1334) | void VqshlsU32(uint32_t* r, uint32_t* v0, int32_t* v1) { *r = vqshls_u32...
  function VqsubS8 (line 1335) | void VqsubS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vqsub_s8(*v...
  function VqsubS16 (line 1336) | void VqsubS16(int16x4_t* r, int16x4_t* v0, int16x4_t* v1) { *r = vqsub_s...
  function VqsubS32 (line 1337) | void VqsubS32(int32x2_t* r, int32x2_t* v0, int32x2_t* v1) { *r = vqsub_s...
  function VqsubS64 (line 1338) | void VqsubS64(int64x1_t* r, int64x1_t* v0, int64x1_t* v1) { *r = vqsub_s...
  function VqsubU8 (line 1339) | void VqsubU8(uint8x8_t* r, uint8x8_t* v0, uint8x8_t* v1) { *r = vqsub_u8...
  function VqsubU16 (line 1340) | void VqsubU16(uint16x4_t* r, uint16x4_t* v0, uint16x4_t* v1) { *r = vqsu...
  function VqsubU32 (line 1341) | void VqsubU32(uint32x2_t* r, uint32x2_t* v0, uint32x2_t* v1) { *r = vqsu...
  function VqsubU64 (line 1342) | void VqsubU64(uint64x1_t* r, uint64x1_t* v0, uint64x1_t* v1) { *r = vqsu...
  function VqsubbS8 (line 1343) | void VqsubbS8(int8_t* r, int8_t* v0, int8_t* v1) { *r = vqsubb_s8(*v0, *...
  function VqsubbU8 (line 1344) | void VqsubbU8(uint8_t* r, uint8_t* v0, uint8_t* v1) { *r = vqsubb_u8(*v0...
  function VqsubdS64 (line 1345) | void VqsubdS64(int64_t* r, int64_t* v0, int64_t* v1) { *r = vqsubd_s64(*...
  function VqsubdU64 (line 1346) | void VqsubdU64(uint64_t* r, uint64_t* v0, uint64_t* v1) { *r = vqsubd_u6...
  function VqsubhS16 (line 1347) | void VqsubhS16(int16_t* r, int16_t* v0, int16_t* v1) { *r = vqsubh_s16(*...
  function VqsubhU16 (line 1348) | void VqsubhU16(uint16_t* r, uint16_t* v0, uint16_t* v1) { *r = vqsubh_u1...
  function VqsubqS8 (line 1349) | void VqsubqS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1) { *r = vqsubq_...
  function VqsubqS16 (line 1350) | void VqsubqS16(int16x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vqsubq...
  function VqsubqS32 (line 1351) | void VqsubqS32(int32x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vqsubq...
  function VqsubqS64 (line 1352) | void VqsubqS64(int64x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vqsubq...
  function VqsubqU8 (line 1353) | void VqsubqU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqsu...
  function VqsubqU16 (line 1354) | void VqsubqU16(uint16x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vqs...
  function VqsubqU32 (line 1355) | void VqsubqU32(uint32x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vqs...
  function VqsubqU64 (line 1356) | void VqsubqU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vqs...
  function VqsubsS32 (line 1357) | void VqsubsS32(int32_t* r, int32_t* v0, int32_t* v1) { *r = vqsubs_s32(*...
  function VqsubsU32 (line 1358) | void VqsubsU32(uint32_t* r, uint32_t* v0, uint32_t* v1) { *r = vqsubs_u3...
  function Vqtbl1S8 (line 1359) | void Vqtbl1S8(int8x8_t* r, int8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1_s...
  function Vqtbl1U8 (line 1360) | void Vqtbl1U8(uint8x8_t* r, uint8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1...
  function Vqtbl1P8 (line 1361) | void Vqtbl1P8(poly8x8_t* r, poly8x16_t* v0, uint8x8_t* v1) { *r = vqtbl1...
  function Vqtbl1QS8 (line 1362) | void Vqtbl1QS8(int8x16_t* r, int8x16_t* v0, uint8x16_t* v1) { *r = vqtbl...
  function Vqtbl1QU8 (line 1363) | void Vqtbl1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1) { *r = vqt...
  function Vqtbl1QP8 (line 1364) | void Vqtbl1QP8(poly8x16_t* r, poly8x16_t* v0, uint8x16_t* v1) { *r = vqt...
  function Vqtbl2S8 (line 1365) | void Vqtbl2S8(int8x8_t* r, int8x16x2_t* v0, uint8x8_t* v1) { *r = vqtbl2...
  function Vqtbl2U8 (line 1366) | void Vqtbl2U8(uint8x8_t* r, uint8x16x2_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl2P8 (line 1367) | void Vqtbl2P8(poly8x8_t* r, poly8x16x2_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl2QS8 (line 1368) | void Vqtbl2QS8(int8x16_t* r, int8x16x2_t* v0, uint8x16_t* v1) { *r = vqt...
  function Vqtbl2QU8 (line 1369) | void Vqtbl2QU8(uint8x16_t* r, uint8x16x2_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbl2QP8 (line 1370) | void Vqtbl2QP8(poly8x16_t* r, poly8x16x2_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbl3S8 (line 1371) | void Vqtbl3S8(int8x8_t* r, int8x16x3_t* v0, uint8x8_t* v1) { *r = vqtbl3...
  function Vqtbl3U8 (line 1372) | void Vqtbl3U8(uint8x8_t* r, uint8x16x3_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl3P8 (line 1373) | void Vqtbl3P8(poly8x8_t* r, poly8x16x3_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl3QS8 (line 1374) | void Vqtbl3QS8(int8x16_t* r, int8x16x3_t* v0, uint8x16_t* v1) { *r = vqt...
  function Vqtbl3QU8 (line 1375) | void Vqtbl3QU8(uint8x16_t* r, uint8x16x3_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbl3QP8 (line 1376) | void Vqtbl3QP8(poly8x16_t* r, poly8x16x3_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbl4S8 (line 1377) | void Vqtbl4S8(int8x8_t* r, int8x16x4_t* v0, uint8x8_t* v1) { *r = vqtbl4...
  function Vqtbl4U8 (line 1378) | void Vqtbl4U8(uint8x8_t* r, uint8x16x4_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl4P8 (line 1379) | void Vqtbl4P8(poly8x8_t* r, poly8x16x4_t* v0, uint8x8_t* v1) { *r = vqtb...
  function Vqtbl4QS8 (line 1380) | void Vqtbl4QS8(int8x16_t* r, int8x16x4_t* v0, uint8x16_t* v1) { *r = vqt...
  function Vqtbl4QU8 (line 1381) | void Vqtbl4QU8(uint8x16_t* r, uint8x16x4_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbl4QP8 (line 1382) | void Vqtbl4QP8(poly8x16_t* r, poly8x16x4_t* v0, uint8x16_t* v1) { *r = v...
  function Vqtbx1S8 (line 1383) | void Vqtbx1S8(int8x8_t* r, int8x8_t* v0, int8x16_t* v1, uint8x8_t* v2) {...
  function Vqtbx1U8 (line 1384) | void Vqtbx1U8(uint8x8_t* r, uint8x8_t* v0, uint8x16_t* v1, uint8x8_t* v2...
  function Vqtbx1P8 (line 1385) | void Vqtbx1P8(poly8x8_t* r, poly8x8_t* v0, poly8x16_t* v1, uint8x8_t* v2...
  function Vqtbx1QS8 (line 1386) | void Vqtbx1QS8(int8x16_t* r, int8x16_t* v0, int8x16_t* v1, uint8x16_t* v...
  function Vqtbx1QU8 (line 1387) | void Vqtbx1QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16_t* v1, uint8x16_t...
  function Vqtbx1QP8 (line 1388) | void Vqtbx1QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16_t* v1, uint8x16_t...
  function Vqtbx2S8 (line 1389) | void Vqtbx2S8(int8x8_t* r, int8x8_t* v0, int8x16x2_t* v1, uint8x8_t* v2)...
  function Vqtbx2U8 (line 1390) | void Vqtbx2U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x2_t* v1, uint8x8_t* ...
  function Vqtbx2P8 (line 1391) | void Vqtbx2P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x2_t* v1, uint8x8_t* ...
  function Vqtbx2QS8 (line 1392) | void Vqtbx2QS8(int8x16_t* r, int8x16_t* v0, int8x16x2_t* v1, uint8x16_t*...
  function Vqtbx2QU8 (line 1393) | void Vqtbx2QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x2_t* v1, uint8x16...
  function Vqtbx2QP8 (line 1394) | void Vqtbx2QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x2_t* v1, uint8x16...
  function Vqtbx3S8 (line 1395) | void Vqtbx3S8(int8x8_t* r, int8x8_t* v0, int8x16x3_t* v1, uint8x8_t* v2)...
  function Vqtbx3U8 (line 1396) | void Vqtbx3U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x3_t* v1, uint8x8_t* ...
  function Vqtbx3P8 (line 1397) | void Vqtbx3P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x3_t* v1, uint8x8_t* ...
  function Vqtbx3QS8 (line 1398) | void Vqtbx3QS8(int8x16_t* r, int8x16_t* v0, int8x16x3_t* v1, uint8x16_t*...
  function Vqtbx3QU8 (line 1399) | void Vqtbx3QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x3_t* v1, uint8x16...
  function Vqtbx3QP8 (line 1400) | void Vqtbx3QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x3_t* v1, uint8x16...
  function Vqtbx4S8 (line 1401) | void Vqtbx4S8(int8x8_t* r, int8x8_t* v0, int8x16x4_t* v1, uint8x8_t* v2)...
  function Vqtbx4U8 (line 1402) | void Vqtbx4U8(uint8x8_t* r, uint8x8_t* v0, uint8x16x4_t* v1, uint8x8_t* ...
  function Vqtbx4P8 (line 1403) | void Vqtbx4P8(poly8x8_t* r, poly8x8_t* v0, poly8x16x4_t* v1, uint8x8_t* ...
  function Vqtbx4QS8 (line 1404) | void Vqtbx4QS8(int8x16_t* r, int8x16_t* v0, int8x16x4_t* v1, uint8x16_t*...
  function Vqtbx4QU8 (line 1405) | void Vqtbx4QU8(uint8x16_t* r, uint8x16_t* v0, uint8x16x4_t* v1, uint8x16...
  function Vqtbx4QP8 (line 1406) | void Vqtbx4QP8(poly8x16_t* r, poly8x16_t* v0, poly8x16x4_t* v1, uint8x16...
  function VraddhnS16 (line 1407) | void VraddhnS16(int8x8_t* r, int16x8_t* v0, int16x8_t* v1) { *r = vraddh...
  function VraddhnS32 (line 1408) | void VraddhnS32(int16x4_t* r, int32x4_t* v0, int32x4_t* v1) { *r = vradd...
  function VraddhnS64 (line 1409) | void VraddhnS64(int32x2_t* r, int64x2_t* v0, int64x2_t* v1) { *r = vradd...
  function VraddhnU16 (line 1410) | void VraddhnU16(uint8x8_t* r, uint16x8_t* v0, uint16x8_t* v1) { *r = vra...
  function VraddhnU32 (line 1411) | void VraddhnU32(uint16x4_t* r, uint32x4_t* v0, uint32x4_t* v1) { *r = vr...
  function VraddhnU64 (line 1412) | void VraddhnU64(uint32x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vr...
  function VraddhnHighS16 (line 1413) | void VraddhnHighS16(int8x16_t* r, int8x8_t* v0, int16x8_t* v1, int16x8_t...
  function VraddhnHighS32 (line 1414) | void VraddhnHighS32(int16x8_t* r, int16x4_t* v0, int32x4_t* v1, int32x4_...
  function VraddhnHighS64 (line 1415) | void VraddhnHighS64(int32x4_t* r, int32x2_t* v0, int64x2_t* v1, int64x2_...
  function VraddhnHighU16 (line 1416) | void VraddhnHighU16(uint8x16_t* r, uint8x8_t* v0, uint16x8_t* v1, uint16...
  function VraddhnHighU32 (line 1417) | void VraddhnHighU32(uint16x8_t* r, uint16x4_t* v0, uint32x4_t* v1, uint3...
  function VraddhnHighU64 (line 1418) | void VraddhnHighU64(uint32x4_t* r, uint32x2_t* v0, uint64x2_t* v1, uint6...
  function Vrax1QU64 (line 1419) | void Vrax1QU64(uint64x2_t* r, uint64x2_t* v0, uint64x2_t* v1) { *r = vra...
  function VrbitS8 (line 1420) | void VrbitS8(int8x8_t* r, int8x8_t* v0) { *r = vrbit_s8(*v0); }
  function VrbitU8 (line 1421) | void VrbitU8(uint8x8_t* r, uint8x8_t* v0) { *r = vrbit_u8(*v0); }
  function VrbitP8 (line 1422) | void VrbitP8(poly8x8_t* r, poly8x8_t* v0) { *r = vrbit_p8(*v0); }
  function VrbitqS8 (line 1423) | void VrbitqS8(int8x16_t* r, int8x16_t* v0) { *r = vrbitq_s8(*v0); }
  function VrbitqU8 (line 1424) | void VrbitqU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrbitq_u8(*v0); }
  function VrbitqP8 (line 1425) | void VrbitqP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrbitq_p8(*v0); }
  function VrecpeU32 (line 1426) | void VrecpeU32(uint32x2_t* r, uint32x2_t* v0) { *r = vrecpe_u32(*v0); }
  function VrecpeF32 (line 1427) | void VrecpeF32(float32x2_t* r, float32x2_t* v0) { *r = vrecpe_f32(*v0); }
  function VrecpeF64 (line 1428) | void VrecpeF64(float64x1_t* r, float64x1_t* v0) { *r = vrecpe_f64(*v0); }
  function VrecpedF64 (line 1429) | void VrecpedF64(float64_t* r, float64_t* v0) { *r = vrecped_f64(*v0); }
  function VrecpeqU32 (line 1430) | void VrecpeqU32(uint32x4_t* r, uint32x4_t* v0) { *r = vrecpeq_u32(*v0); }
  function VrecpeqF32 (line 1431) | void VrecpeqF32(float32x4_t* r, float32x4_t* v0) { *r = vrecpeq_f32(*v0); }
  function VrecpeqF64 (line 1432) | void VrecpeqF64(float64x2_t* r, float64x2_t* v0) { *r = vrecpeq_f64(*v0); }
  function VrecpesF32 (line 1433) | void VrecpesF32(float32_t* r, float32_t* v0) { *r = vrecpes_f32(*v0); }
  function VrecpsF32 (line 1434) | void VrecpsF32(float32x2_t* r, float32x2_t* v0, float32x2_t* v1) { *r = ...
  function VrecpsF64 (line 1435) | void VrecpsF64(float64x1_t* r, float64x1_t* v0, float64x1_t* v1) { *r = ...
  function VrecpsdF64 (line 1436) | void VrecpsdF64(float64_t* r, float64_t* v0, float64_t* v1) { *r = vrecp...
  function VrecpsqF32 (line 1437) | void VrecpsqF32(float32x4_t* r, float32x4_t* v0, float32x4_t* v1) { *r =...
  function VrecpsqF64 (line 1438) | void VrecpsqF64(float64x2_t* r, float64x2_t* v0, float64x2_t* v1) { *r =...
  function VrecpssF32 (line 1439) | void VrecpssF32(float32_t* r, float32_t* v0, float32_t* v1) { *r = vrecp...
  function VrecpxdF64 (line 1440) | void VrecpxdF64(float64_t* r, float64_t* v0) { *r = vrecpxd_f64(*v0); }
  function VrecpxsF32 (line 1441) | void VrecpxsF32(float32_t* r, float32_t* v0) { *r = vrecpxs_f32(*v0); }
  function VreinterpretF32S8 (line 1442) | void VreinterpretF32S8(float32x2_t* r, int8x8_t* v0) { *r = vreinterpret...
  function VreinterpretF32S16 (line 1443) | void VreinterpretF32S16(float32x2_t* r, int16x4_t* v0) { *r = vreinterpr...
  function VreinterpretF32S32 (line 1444) | void VreinterpretF32S32(float32x2_t* r, int32x2_t* v0) { *r = vreinterpr...
  function VreinterpretF32S64 (line 1445) | void VreinterpretF32S64(float32x2_t* r, int64x1_t* v0) { *r = vreinterpr...
  function VreinterpretF32U8 (line 1446) | void VreinterpretF32U8(float32x2_t* r, uint8x8_t* v0) { *r = vreinterpre...
  function VreinterpretF32U16 (line 1447) | void VreinterpretF32U16(float32x2_t* r, uint16x4_t* v0) { *r = vreinterp...
  function VreinterpretF32U32 (line 1448) | void VreinterpretF32U32(float32x2_t* r, uint32x2_t* v0) { *r = vreinterp...
  function VreinterpretF32U64 (line 1449) | void VreinterpretF32U64(float32x2_t* r, uint64x1_t* v0) { *r = vreinterp...
  function VreinterpretF32F64 (line 1450) | void VreinterpretF32F64(float32x2_t* r, float64x1_t* v0) { *r = vreinter...
  function VreinterpretF32P16 (line 1451) | void VreinterpretF32P16(float32x2_t* r, poly16x4_t* v0) { *r = vreinterp...
  function VreinterpretF32P64 (line 1452) | void VreinterpretF32P64(float32x2_t* r, poly64x1_t* v0) { *r = vreinterp...
  function VreinterpretF32P8 (line 1453) | void VreinterpretF32P8(float32x2_t* r, poly8x8_t* v0) { *r = vreinterpre...
  function VreinterpretF64S8 (line 1454) | void VreinterpretF64S8(float64x1_t* r, int8x8_t* v0) { *r = vreinterpret...
  function VreinterpretF64S16 (line 1455) | void VreinterpretF64S16(float64x1_t* r, int16x4_t* v0) { *r = vreinterpr...
  function VreinterpretF64S32 (line 1456) | void VreinterpretF64S32(float64x1_t* r, int32x2_t* v0) { *r = vreinterpr...
  function VreinterpretF64S64 (line 1457) | void VreinterpretF64S64(float64x1_t* r, int64x1_t* v0) { *r = vreinterpr...
  function VreinterpretF64U8 (line 1458) | void VreinterpretF64U8(float64x1_t* r, uint8x8_t* v0) { *r = vreinterpre...
  function VreinterpretF64U16 (line 1459) | void VreinterpretF64U16(float64x1_t* r, uint16x4_t* v0) { *r = vreinterp...
  function VreinterpretF64U32 (line 1460) | void VreinterpretF64U32(float64x1_t* r, uint32x2_t* v0) { *r = vreinterp...
  function VreinterpretF64U64 (line 1461) | void VreinterpretF64U64(float64x1_t* r, uint64x1_t* v0) { *r = vreinterp...
  function VreinterpretF64F32 (line 1462) | void VreinterpretF64F32(float64x1_t* r, float32x2_t* v0) { *r = vreinter...
  function VreinterpretF64P16 (line 1463) | void VreinterpretF64P16(float64x1_t* r, poly16x4_t* v0) { *r = vreinterp...
  function VreinterpretF64P64 (line 1464) | void VreinterpretF64P64(float64x1_t* r, poly64x1_t* v0) { *r = vreinterp...
  function VreinterpretF64P8 (line 1465) | void VreinterpretF64P8(float64x1_t* r, poly8x8_t* v0) { *r = vreinterpre...
  function VreinterpretP16S8 (line 1466) | void VreinterpretP16S8(poly16x4_t* r, int8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretP16S16 (line 1467) | void VreinterpretP16S16(poly16x4_t* r, int16x4_t* v0) { *r = vreinterpre...
  function VreinterpretP16S32 (line 1468) | void VreinterpretP16S32(poly16x4_t* r, int32x2_t* v0) { *r = vreinterpre...
  function VreinterpretP16S64 (line 1469) | void VreinterpretP16S64(poly16x4_t* r, int64x1_t* v0) { *r = vreinterpre...
  function VreinterpretP16U8 (line 1470) | void VreinterpretP16U8(poly16x4_t* r, uint8x8_t* v0) { *r = vreinterpret...
  function VreinterpretP16U16 (line 1471) | void VreinterpretP16U16(poly16x4_t* r, uint16x4_t* v0) { *r = vreinterpr...
  function VreinterpretP16U32 (line 1472) | void VreinterpretP16U32(poly16x4_t* r, uint32x2_t* v0) { *r = vreinterpr...
  function VreinterpretP16U64 (line 1473) | void VreinterpretP16U64(poly16x4_t* r, uint64x1_t* v0) { *r = vreinterpr...
  function VreinterpretP16F32 (line 1474) | void VreinterpretP16F32(poly16x4_t* r, float32x2_t* v0) { *r = vreinterp...
  function VreinterpretP16F64 (line 1475) | void VreinterpretP16F64(poly16x4_t* r, float64x1_t* v0) { *r = vreinterp...
  function VreinterpretP16P64 (line 1476) | void VreinterpretP16P64(poly16x4_t* r, poly64x1_t* v0) { *r = vreinterpr...
  function VreinterpretP16P8 (line 1477) | void VreinterpretP16P8(poly16x4_t* r, poly8x8_t* v0) { *r = vreinterpret...
  function VreinterpretP64S8 (line 1478) | void VreinterpretP64S8(poly64x1_t* r, int8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretP64S16 (line 1479) | void VreinterpretP64S16(poly64x1_t* r, int16x4_t* v0) { *r = vreinterpre...
  function VreinterpretP64S32 (line 1480) | void VreinterpretP64S32(poly64x1_t* r, int32x2_t* v0) { *r = vreinterpre...
  function VreinterpretP64S64 (line 1481) | void VreinterpretP64S64(poly64x1_t* r, int64x1_t* v0) { *r = vreinterpre...
  function VreinterpretP64U8 (line 1482) | void VreinterpretP64U8(poly64x1_t* r, uint8x8_t* v0) { *r = vreinterpret...
  function VreinterpretP64U16 (line 1483) | void VreinterpretP64U16(poly64x1_t* r, uint16x4_t* v0) { *r = vreinterpr...
  function VreinterpretP64U32 (line 1484) | void VreinterpretP64U32(poly64x1_t* r, uint32x2_t* v0) { *r = vreinterpr...
  function VreinterpretP64U64 (line 1485) | void VreinterpretP64U64(poly64x1_t* r, uint64x1_t* v0) { *r = vreinterpr...
  function VreinterpretP64F32 (line 1486) | void VreinterpretP64F32(poly64x1_t* r, float32x2_t* v0) { *r = vreinterp...
  function VreinterpretP64F64 (line 1487) | void VreinterpretP64F64(poly64x1_t* r, float64x1_t* v0) { *r = vreinterp...
  function VreinterpretP64P16 (line 1488) | void VreinterpretP64P16(poly64x1_t* r, poly16x4_t* v0) { *r = vreinterpr...
  function VreinterpretP64P8 (line 1489) | void VreinterpretP64P8(poly64x1_t* r, poly8x8_t* v0) { *r = vreinterpret...
  function VreinterpretP8S8 (line 1490) | void VreinterpretP8S8(poly8x8_t* r, int8x8_t* v0) { *r = vreinterpret_p8...
  function VreinterpretP8S16 (line 1491) | void VreinterpretP8S16(poly8x8_t* r, int16x4_t* v0) { *r = vreinterpret_...
  function VreinterpretP8S32 (line 1492) | void VreinterpretP8S32(poly8x8_t* r, int32x2_t* v0) { *r = vreinterpret_...
  function VreinterpretP8S64 (line 1493) | void VreinterpretP8S64(poly8x8_t* r, int64x1_t* v0) { *r = vreinterpret_...
  function VreinterpretP8U8 (line 1494) | void VreinterpretP8U8(poly8x8_t* r, uint8x8_t* v0) { *r = vreinterpret_p...
  function VreinterpretP8U16 (line 1495) | void VreinterpretP8U16(poly8x8_t* r, uint16x4_t* v0) { *r = vreinterpret...
  function VreinterpretP8U32 (line 1496) | void VreinterpretP8U32(poly8x8_t* r, uint32x2_t* v0) { *r = vreinterpret...
  function VreinterpretP8U64 (line 1497) | void VreinterpretP8U64(poly8x8_t* r, uint64x1_t* v0) { *r = vreinterpret...
  function VreinterpretP8F32 (line 1498) | void VreinterpretP8F32(poly8x8_t* r, float32x2_t* v0) { *r = vreinterpre...
  function VreinterpretP8F64 (line 1499) | void VreinterpretP8F64(poly8x8_t* r, float64x1_t* v0) { *r = vreinterpre...
  function VreinterpretP8P16 (line 1500) | void VreinterpretP8P16(poly8x8_t* r, poly16x4_t* v0) { *r = vreinterpret...
  function VreinterpretP8P64 (line 1501) | void VreinterpretP8P64(poly8x8_t* r, poly64x1_t* v0) { *r = vreinterpret...
  function VreinterpretS16S8 (line 1502) | void VreinterpretS16S8(int16x4_t* r, int8x8_t* v0) { *r = vreinterpret_s...
  function VreinterpretS16S32 (line 1503) | void VreinterpretS16S32(int16x4_t* r, int32x2_t* v0) { *r = vreinterpret...
  function VreinterpretS16S64 (line 1504) | void VreinterpretS16S64(int16x4_t* r, int64x1_t* v0) { *r = vreinterpret...
  function VreinterpretS16U8 (line 1505) | void VreinterpretS16U8(int16x4_t* r, uint8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS16U16 (line 1506) | void VreinterpretS16U16(int16x4_t* r, uint16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS16U32 (line 1507) | void VreinterpretS16U32(int16x4_t* r, uint32x2_t* v0) { *r = vreinterpre...
  function VreinterpretS16U64 (line 1508) | void VreinterpretS16U64(int16x4_t* r, uint64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS16F32 (line 1509) | void VreinterpretS16F32(int16x4_t* r, float32x2_t* v0) { *r = vreinterpr...
  function VreinterpretS16F64 (line 1510) | void VreinterpretS16F64(int16x4_t* r, float64x1_t* v0) { *r = vreinterpr...
  function VreinterpretS16P16 (line 1511) | void VreinterpretS16P16(int16x4_t* r, poly16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS16P64 (line 1512) | void VreinterpretS16P64(int16x4_t* r, poly64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS16P8 (line 1513) | void VreinterpretS16P8(int16x4_t* r, poly8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS32S8 (line 1514) | void VreinterpretS32S8(int32x2_t* r, int8x8_t* v0) { *r = vreinterpret_s...
  function VreinterpretS32S16 (line 1515) | void VreinterpretS32S16(int32x2_t* r, int16x4_t* v0) { *r = vreinterpret...
  function VreinterpretS32S64 (line 1516) | void VreinterpretS32S64(int32x2_t* r, int64x1_t* v0) { *r = vreinterpret...
  function VreinterpretS32U8 (line 1517) | void VreinterpretS32U8(int32x2_t* r, uint8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS32U16 (line 1518) | void VreinterpretS32U16(int32x2_t* r, uint16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS32U32 (line 1519) | void VreinterpretS32U32(int32x2_t* r, uint32x2_t* v0) { *r = vreinterpre...
  function VreinterpretS32U64 (line 1520) | void VreinterpretS32U64(int32x2_t* r, uint64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS32F32 (line 1521) | void VreinterpretS32F32(int32x2_t* r, float32x2_t* v0) { *r = vreinterpr...
  function VreinterpretS32F64 (line 1522) | void VreinterpretS32F64(int32x2_t* r, float64x1_t* v0) { *r = vreinterpr...
  function VreinterpretS32P16 (line 1523) | void VreinterpretS32P16(int32x2_t* r, poly16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS32P64 (line 1524) | void VreinterpretS32P64(int32x2_t* r, poly64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS32P8 (line 1525) | void VreinterpretS32P8(int32x2_t* r, poly8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS64S8 (line 1526) | void VreinterpretS64S8(int64x1_t* r, int8x8_t* v0) { *r = vreinterpret_s...
  function VreinterpretS64S16 (line 1527) | void VreinterpretS64S16(int64x1_t* r, int16x4_t* v0) { *r = vreinterpret...
  function VreinterpretS64S32 (line 1528) | void VreinterpretS64S32(int64x1_t* r, int32x2_t* v0) { *r = vreinterpret...
  function VreinterpretS64U8 (line 1529) | void VreinterpretS64U8(int64x1_t* r, uint8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS64U16 (line 1530) | void VreinterpretS64U16(int64x1_t* r, uint16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS64U32 (line 1531) | void VreinterpretS64U32(int64x1_t* r, uint32x2_t* v0) { *r = vreinterpre...
  function VreinterpretS64U64 (line 1532) | void VreinterpretS64U64(int64x1_t* r, uint64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS64F32 (line 1533) | void VreinterpretS64F32(int64x1_t* r, float32x2_t* v0) { *r = vreinterpr...
  function VreinterpretS64F64 (line 1534) | void VreinterpretS64F64(int64x1_t* r, float64x1_t* v0) { *r = vreinterpr...
  function VreinterpretS64P16 (line 1535) | void VreinterpretS64P16(int64x1_t* r, poly16x4_t* v0) { *r = vreinterpre...
  function VreinterpretS64P64 (line 1536) | void VreinterpretS64P64(int64x1_t* r, poly64x1_t* v0) { *r = vreinterpre...
  function VreinterpretS64P8 (line 1537) | void VreinterpretS64P8(int64x1_t* r, poly8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretS8S16 (line 1538) | void VreinterpretS8S16(int8x8_t* r, int16x4_t* v0) { *r = vreinterpret_s...
  function VreinterpretS8S32 (line 1539) | void VreinterpretS8S32(int8x8_t* r, int32x2_t* v0) { *r = vreinterpret_s...
  function VreinterpretS8S64 (line 1540) | void VreinterpretS8S64(int8x8_t* r, int64x1_t* v0) { *r = vreinterpret_s...
  function VreinterpretS8U8 (line 1541) | void VreinterpretS8U8(int8x8_t* r, uint8x8_t* v0) { *r = vreinterpret_s8...
  function VreinterpretS8U16 (line 1542) | void VreinterpretS8U16(int8x8_t* r, uint16x4_t* v0) { *r = vreinterpret_...
  function VreinterpretS8U32 (line 1543) | void VreinterpretS8U32(int8x8_t* r, uint32x2_t* v0) { *r = vreinterpret_...
  function VreinterpretS8U64 (line 1544) | void VreinterpretS8U64(int8x8_t* r, uint64x1_t* v0) { *r = vreinterpret_...
  function VreinterpretS8F32 (line 1545) | void VreinterpretS8F32(int8x8_t* r, float32x2_t* v0) { *r = vreinterpret...
  function VreinterpretS8F64 (line 1546) | void VreinterpretS8F64(int8x8_t* r, float64x1_t* v0) { *r = vreinterpret...
  function VreinterpretS8P16 (line 1547) | void VreinterpretS8P16(int8x8_t* r, poly16x4_t* v0) { *r = vreinterpret_...
  function VreinterpretS8P64 (line 1548) | void VreinterpretS8P64(int8x8_t* r, poly64x1_t* v0) { *r = vreinterpret_...
  function VreinterpretS8P8 (line 1549) | void VreinterpretS8P8(int8x8_t* r, poly8x8_t* v0) { *r = vreinterpret_s8...
  function VreinterpretU16S8 (line 1550) | void VreinterpretU16S8(uint16x4_t* r, int8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretU16S16 (line 1551) | void VreinterpretU16S16(uint16x4_t* r, int16x4_t* v0) { *r = vreinterpre...
  function VreinterpretU16S32 (line 1552) | void VreinterpretU16S32(uint16x4_t* r, int32x2_t* v0) { *r = vreinterpre...
  function VreinterpretU16S64 (line 1553) | void VreinterpretU16S64(uint16x4_t* r, int64x1_t* v0) { *r = vreinterpre...
  function VreinterpretU16U8 (line 1554) | void VreinterpretU16U8(uint16x4_t* r, uint8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU16U32 (line 1555) | void VreinterpretU16U32(uint16x4_t* r, uint32x2_t* v0) { *r = vreinterpr...
  function VreinterpretU16U64 (line 1556) | void VreinterpretU16U64(uint16x4_t* r, uint64x1_t* v0) { *r = vreinterpr...
  function VreinterpretU16F32 (line 1557) | void VreinterpretU16F32(uint16x4_t* r, float32x2_t* v0) { *r = vreinterp...
  function VreinterpretU16F64 (line 1558) | void VreinterpretU16F64(uint16x4_t* r, float64x1_t* v0) { *r = vreinterp...
  function VreinterpretU16P16 (line 1559) | void VreinterpretU16P16(uint16x4_t* r, poly16x4_t* v0) { *r = vreinterpr...
  function VreinterpretU16P64 (line 1560) | void VreinterpretU16P64(uint16x4_t* r, poly64x1_t* v0) { *r = vreinterpr...
  function VreinterpretU16P8 (line 1561) | void VreinterpretU16P8(uint16x4_t* r, poly8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU32S8 (line 1562) | void VreinterpretU32S8(uint32x2_t* r, int8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretU32S16 (line 1563) | void VreinterpretU32S16(uint32x2_t* r, int16x4_t* v0) { *r = vreinterpre...
  function VreinterpretU32S32 (line 1564) | void VreinterpretU32S32(uint32x2_t* r, int32x2_t* v0) { *r = vreinterpre...
  function VreinterpretU32S64 (line 1565) | void VreinterpretU32S64(uint32x2_t* r, int64x1_t* v0) { *r = vreinterpre...
  function VreinterpretU32U8 (line 1566) | void VreinterpretU32U8(uint32x2_t* r, uint8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU32U16 (line 1567) | void VreinterpretU32U16(uint32x2_t* r, uint16x4_t* v0) { *r = vreinterpr...
  function VreinterpretU32U64 (line 1568) | void VreinterpretU32U64(uint32x2_t* r, uint64x1_t* v0) { *r = vreinterpr...
  function VreinterpretU32F32 (line 1569) | void VreinterpretU32F32(uint32x2_t* r, float32x2_t* v0) { *r = vreinterp...
  function VreinterpretU32F64 (line 1570) | void VreinterpretU32F64(uint32x2_t* r, float64x1_t* v0) { *r = vreinterp...
  function VreinterpretU32P16 (line 1571) | void VreinterpretU32P16(uint32x2_t* r, poly16x4_t* v0) { *r = vreinterpr...
  function VreinterpretU32P64 (line 1572) | void VreinterpretU32P64(uint32x2_t* r, poly64x1_t* v0) { *r = vreinterpr...
  function VreinterpretU32P8 (line 1573) | void VreinterpretU32P8(uint32x2_t* r, poly8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU64S8 (line 1574) | void VreinterpretU64S8(uint64x1_t* r, int8x8_t* v0) { *r = vreinterpret_...
  function VreinterpretU64S16 (line 1575) | void VreinterpretU64S16(uint64x1_t* r, int16x4_t* v0) { *r = vreinterpre...
  function VreinterpretU64S32 (line 1576) | void VreinterpretU64S32(uint64x1_t* r, int32x2_t* v0) { *r = vreinterpre...
  function VreinterpretU64S64 (line 1577) | void VreinterpretU64S64(uint64x1_t* r, int64x1_t* v0) { *r = vreinterpre...
  function VreinterpretU64U8 (line 1578) | void VreinterpretU64U8(uint64x1_t* r, uint8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU64U16 (line 1579) | void VreinterpretU64U16(uint64x1_t* r, uint16x4_t* v0) { *r = vreinterpr...
  function VreinterpretU64U32 (line 1580) | void VreinterpretU64U32(uint64x1_t* r, uint32x2_t* v0) { *r = vreinterpr...
  function VreinterpretU64F32 (line 1581) | void VreinterpretU64F32(uint64x1_t* r, float32x2_t* v0) { *r = vreinterp...
  function VreinterpretU64F64 (line 1582) | void VreinterpretU64F64(uint64x1_t* r, float64x1_t* v0) { *r = vreinterp...
  function VreinterpretU64P16 (line 1583) | void VreinterpretU64P16(uint64x1_t* r, poly16x4_t* v0) { *r = vreinterpr...
  function VreinterpretU64P64 (line 1584) | void VreinterpretU64P64(uint64x1_t* r, poly64x1_t* v0) { *r = vreinterpr...
  function VreinterpretU64P8 (line 1585) | void VreinterpretU64P8(uint64x1_t* r, poly8x8_t* v0) { *r = vreinterpret...
  function VreinterpretU8S8 (line 1586) | void VreinterpretU8S8(uint8x8_t* r, int8x8_t* v0) { *r = vreinterpret_u8...
  function VreinterpretU8S16 (line 1587) | void VreinterpretU8S16(uint8x8_t* r, int16x4_t* v0) { *r = vreinterpret_...
  function VreinterpretU8S32 (line 1588) | void VreinterpretU8S32(uint8x8_t* r, int32x2_t* v0) { *r = vreinterpret_...
  function VreinterpretU8S64 (line 1589) | void VreinterpretU8S64(uint8x8_t* r, int64x1_t* v0) { *r = vreinterpret_...
  function VreinterpretU8U16 (line 1590) | void VreinterpretU8U16(uint8x8_t* r, uint16x4_t* v0) { *r = vreinterpret...
  function VreinterpretU8U32 (line 1591) | void VreinterpretU8U32(uint8x8_t* r, uint32x2_t* v0) { *r = vreinterpret...
  function VreinterpretU8U64 (line 1592) | void VreinterpretU8U64(uint8x8_t* r, uint64x1_t* v0) { *r = vreinterpret...
  function VreinterpretU8F32 (line 1593) | void VreinterpretU8F32(uint8x8_t* r, float32x2_t* v0) { *r = vreinterpre...
  function VreinterpretU8F64 (line 1594) | void VreinterpretU8F64(uint8x8_t* r, float64x1_t* v0) { *r = vreinterpre...
  function VreinterpretU8P16 (line 1595) | void VreinterpretU8P16(uint8x8_t* r, poly16x4_t* v0) { *r = vreinterpret...
  function VreinterpretU8P64 (line 1596) | void VreinterpretU8P64(uint8x8_t* r, poly64x1_t* v0) { *r = vreinterpret...
  function VreinterpretU8P8 (line 1597) | void VreinterpretU8P8(uint8x8_t* r, poly8x8_t* v0) { *r = vreinterpret_u...
  function VreinterpretqF32S8 (line 1598) | void VreinterpretqF32S8(float32x4_t* r, int8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqF32S16 (line 1599) | void VreinterpretqF32S16(float32x4_t* r, int16x8_t* v0) { *r = vreinterp...
  function VreinterpretqF32S32 (line 1600) | void VreinterpretqF32S32(float32x4_t* r, int32x4_t* v0) { *r = vreinterp...
  function VreinterpretqF32S64 (line 1601) | void VreinterpretqF32S64(float32x4_t* r, int64x2_t* v0) { *r = vreinterp...
  function VreinterpretqF32U8 (line 1602) | void VreinterpretqF32U8(float32x4_t* r, uint8x16_t* v0) { *r = vreinterp...
  function VreinterpretqF32U16 (line 1603) | void VreinterpretqF32U16(float32x4_t* r, uint16x8_t* v0) { *r = vreinter...
  function VreinterpretqF32U32 (line 1604) | void VreinterpretqF32U32(float32x4_t* r, uint32x4_t* v0) { *r = vreinter...
  function VreinterpretqF32U64 (line 1605) | void VreinterpretqF32U64(float32x4_t* r, uint64x2_t* v0) { *r = vreinter...
  function VreinterpretqF32F64 (line 1606) | void VreinterpretqF32F64(float32x4_t* r, float64x2_t* v0) { *r = vreinte...
  function VreinterpretqF32P128 (line 1607) | void VreinterpretqF32P128(float32x4_t* r, poly128_t* v0) { *r = vreinter...
  function VreinterpretqF32P16 (line 1608) | void VreinterpretqF32P16(float32x4_t* r, poly16x8_t* v0) { *r = vreinter...
  function VreinterpretqF32P64 (line 1609) | void VreinterpretqF32P64(float32x4_t* r, poly64x2_t* v0) { *r = vreinter...
  function VreinterpretqF32P8 (line 1610) | void VreinterpretqF32P8(float32x4_t* r, poly8x16_t* v0) { *r = vreinterp...
  function VreinterpretqF64S8 (line 1611) | void VreinterpretqF64S8(float64x2_t* r, int8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqF64S16 (line 1612) | void VreinterpretqF64S16(float64x2_t* r, int16x8_t* v0) { *r = vreinterp...
  function VreinterpretqF64S32 (line 1613) | void VreinterpretqF64S32(float64x2_t* r, int32x4_t* v0) { *r = vreinterp...
  function VreinterpretqF64S64 (line 1614) | void VreinterpretqF64S64(float64x2_t* r, int64x2_t* v0) { *r = vreinterp...
  function VreinterpretqF64U8 (line 1615) | void VreinterpretqF64U8(float64x2_t* r, uint8x16_t* v0) { *r = vreinterp...
  function VreinterpretqF64U16 (line 1616) | void VreinterpretqF64U16(float64x2_t* r, uint16x8_t* v0) { *r = vreinter...
  function VreinterpretqF64U32 (line 1617) | void VreinterpretqF64U32(float64x2_t* r, uint32x4_t* v0) { *r = vreinter...
  function VreinterpretqF64U64 (line 1618) | void VreinterpretqF64U64(float64x2_t* r, uint64x2_t* v0) { *r = vreinter...
  function VreinterpretqF64F32 (line 1619) | void VreinterpretqF64F32(float64x2_t* r, float32x4_t* v0) { *r = vreinte...
  function VreinterpretqF64P128 (line 1620) | void VreinterpretqF64P128(float64x2_t* r, poly128_t* v0) { *r = vreinter...
  function VreinterpretqF64P16 (line 1621) | void VreinterpretqF64P16(float64x2_t* r, poly16x8_t* v0) { *r = vreinter...
  function VreinterpretqF64P64 (line 1622) | void VreinterpretqF64P64(float64x2_t* r, poly64x2_t* v0) { *r = vreinter...
  function VreinterpretqF64P8 (line 1623) | void VreinterpretqF64P8(float64x2_t* r, poly8x16_t* v0) { *r = vreinterp...
  function VreinterpretqP128S8 (line 1624) | void VreinterpretqP128S8(poly128_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqP128S16 (line 1625) | void VreinterpretqP128S16(poly128_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqP128S32 (line 1626) | void VreinterpretqP128S32(poly128_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqP128S64 (line 1627) | void VreinterpretqP128S64(poly128_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqP128U8 (line 1628) | void VreinterpretqP128U8(poly128_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP128U16 (line 1629) | void VreinterpretqP128U16(poly128_t* r, uint16x8_t* v0) { *r = vreinterp...
  function VreinterpretqP128U32 (line 1630) | void VreinterpretqP128U32(poly128_t* r, uint32x4_t* v0) { *r = vreinterp...
  function VreinterpretqP128U64 (line 1631) | void VreinterpretqP128U64(poly128_t* r, uint64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP128F32 (line 1632) | void VreinterpretqP128F32(poly128_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqP128F64 (line 1633) | void VreinterpretqP128F64(poly128_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqP128P16 (line 1634) | void VreinterpretqP128P16(poly128_t* r, poly16x8_t* v0) { *r = vreinterp...
  function VreinterpretqP128P64 (line 1635) | void VreinterpretqP128P64(poly128_t* r, poly64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP128P8 (line 1636) | void VreinterpretqP128P8(poly128_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP16S8 (line 1637) | void VreinterpretqP16S8(poly16x8_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqP16S16 (line 1638) | void VreinterpretqP16S16(poly16x8_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqP16S32 (line 1639) | void VreinterpretqP16S32(poly16x8_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqP16S64 (line 1640) | void VreinterpretqP16S64(poly16x8_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqP16U8 (line 1641) | void VreinterpretqP16U8(poly16x8_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP16U16 (line 1642) | void VreinterpretqP16U16(poly16x8_t* r, uint16x8_t* v0) { *r = vreinterp...
  function VreinterpretqP16U32 (line 1643) | void VreinterpretqP16U32(poly16x8_t* r, uint32x4_t* v0) { *r = vreinterp...
  function VreinterpretqP16U64 (line 1644) | void VreinterpretqP16U64(poly16x8_t* r, uint64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP16F32 (line 1645) | void VreinterpretqP16F32(poly16x8_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqP16F64 (line 1646) | void VreinterpretqP16F64(poly16x8_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqP16P128 (line 1647) | void VreinterpretqP16P128(poly16x8_t* r, poly128_t* v0) { *r = vreinterp...
  function VreinterpretqP16P64 (line 1648) | void VreinterpretqP16P64(poly16x8_t* r, poly64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP16P8 (line 1649) | void VreinterpretqP16P8(poly16x8_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP64S8 (line 1650) | void VreinterpretqP64S8(poly64x2_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqP64S16 (line 1651) | void VreinterpretqP64S16(poly64x2_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqP64S32 (line 1652) | void VreinterpretqP64S32(poly64x2_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqP64S64 (line 1653) | void VreinterpretqP64S64(poly64x2_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqP64U8 (line 1654) | void VreinterpretqP64U8(poly64x2_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP64U16 (line 1655) | void VreinterpretqP64U16(poly64x2_t* r, uint16x8_t* v0) { *r = vreinterp...
  function VreinterpretqP64U32 (line 1656) | void VreinterpretqP64U32(poly64x2_t* r, uint32x4_t* v0) { *r = vreinterp...
  function VreinterpretqP64U64 (line 1657) | void VreinterpretqP64U64(poly64x2_t* r, uint64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP64F32 (line 1658) | void VreinterpretqP64F32(poly64x2_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqP64F64 (line 1659) | void VreinterpretqP64F64(poly64x2_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqP64P128 (line 1660) | void VreinterpretqP64P128(poly64x2_t* r, poly128_t* v0) { *r = vreinterp...
  function VreinterpretqP64P16 (line 1661) | void VreinterpretqP64P16(poly64x2_t* r, poly16x8_t* v0) { *r = vreinterp...
  function VreinterpretqP64P8 (line 1662) | void VreinterpretqP64P8(poly64x2_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqP8S8 (line 1663) | void VreinterpretqP8S8(poly8x16_t* r, int8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqP8S16 (line 1664) | void VreinterpretqP8S16(poly8x16_t* r, int16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqP8S32 (line 1665) | void VreinterpretqP8S32(poly8x16_t* r, int32x4_t* v0) { *r = vreinterpre...
  function VreinterpretqP8S64 (line 1666) | void VreinterpretqP8S64(poly8x16_t* r, int64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqP8U8 (line 1667) | void VreinterpretqP8U8(poly8x16_t* r, uint8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqP8U16 (line 1668) | void VreinterpretqP8U16(poly8x16_t* r, uint16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqP8U32 (line 1669) | void VreinterpretqP8U32(poly8x16_t* r, uint32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqP8U64 (line 1670) | void VreinterpretqP8U64(poly8x16_t* r, uint64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqP8F32 (line 1671) | void VreinterpretqP8F32(poly8x16_t* r, float32x4_t* v0) { *r = vreinterp...
  function VreinterpretqP8F64 (line 1672) | void VreinterpretqP8F64(poly8x16_t* r, float64x2_t* v0) { *r = vreinterp...
  function VreinterpretqP8P128 (line 1673) | void VreinterpretqP8P128(poly8x16_t* r, poly128_t* v0) { *r = vreinterpr...
  function VreinterpretqP8P16 (line 1674) | void VreinterpretqP8P16(poly8x16_t* r, poly16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqP8P64 (line 1675) | void VreinterpretqP8P64(poly8x16_t* r, poly64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS16S8 (line 1676) | void VreinterpretqS16S8(int16x8_t* r, int8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqS16S32 (line 1677) | void VreinterpretqS16S32(int16x8_t* r, int32x4_t* v0) { *r = vreinterpre...
  function VreinterpretqS16S64 (line 1678) | void VreinterpretqS16S64(int16x8_t* r, int64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqS16U8 (line 1679) | void VreinterpretqS16U8(int16x8_t* r, uint8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS16U16 (line 1680) | void VreinterpretqS16U16(int16x8_t* r, uint16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS16U32 (line 1681) | void VreinterpretqS16U32(int16x8_t* r, uint32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqS16U64 (line 1682) | void VreinterpretqS16U64(int16x8_t* r, uint64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS16F32 (line 1683) | void VreinterpretqS16F32(int16x8_t* r, float32x4_t* v0) { *r = vreinterp...
  function VreinterpretqS16F64 (line 1684) | void VreinterpretqS16F64(int16x8_t* r, float64x2_t* v0) { *r = vreinterp...
  function VreinterpretqS16P128 (line 1685) | void VreinterpretqS16P128(int16x8_t* r, poly128_t* v0) { *r = vreinterpr...
  function VreinterpretqS16P16 (line 1686) | void VreinterpretqS16P16(int16x8_t* r, poly16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS16P64 (line 1687) | void VreinterpretqS16P64(int16x8_t* r, poly64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS16P8 (line 1688) | void VreinterpretqS16P8(int16x8_t* r, poly8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS32S8 (line 1689) | void VreinterpretqS32S8(int32x4_t* r, int8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqS32S16 (line 1690) | void VreinterpretqS32S16(int32x4_t* r, int16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqS32S64 (line 1691) | void VreinterpretqS32S64(int32x4_t* r, int64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqS32U8 (line 1692) | void VreinterpretqS32U8(int32x4_t* r, uint8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS32U16 (line 1693) | void VreinterpretqS32U16(int32x4_t* r, uint16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS32U32 (line 1694) | void VreinterpretqS32U32(int32x4_t* r, uint32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqS32U64 (line 1695) | void VreinterpretqS32U64(int32x4_t* r, uint64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS32F32 (line 1696) | void VreinterpretqS32F32(int32x4_t* r, float32x4_t* v0) { *r = vreinterp...
  function VreinterpretqS32F64 (line 1697) | void VreinterpretqS32F64(int32x4_t* r, float64x2_t* v0) { *r = vreinterp...
  function VreinterpretqS32P128 (line 1698) | void VreinterpretqS32P128(int32x4_t* r, poly128_t* v0) { *r = vreinterpr...
  function VreinterpretqS32P16 (line 1699) | void VreinterpretqS32P16(int32x4_t* r, poly16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS32P64 (line 1700) | void VreinterpretqS32P64(int32x4_t* r, poly64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS32P8 (line 1701) | void VreinterpretqS32P8(int32x4_t* r, poly8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS64S8 (line 1702) | void VreinterpretqS64S8(int64x2_t* r, int8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqS64S16 (line 1703) | void VreinterpretqS64S16(int64x2_t* r, int16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqS64S32 (line 1704) | void VreinterpretqS64S32(int64x2_t* r, int32x4_t* v0) { *r = vreinterpre...
  function VreinterpretqS64U8 (line 1705) | void VreinterpretqS64U8(int64x2_t* r, uint8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS64U16 (line 1706) | void VreinterpretqS64U16(int64x2_t* r, uint16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS64U32 (line 1707) | void VreinterpretqS64U32(int64x2_t* r, uint32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqS64U64 (line 1708) | void VreinterpretqS64U64(int64x2_t* r, uint64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS64F32 (line 1709) | void VreinterpretqS64F32(int64x2_t* r, float32x4_t* v0) { *r = vreinterp...
  function VreinterpretqS64F64 (line 1710) | void VreinterpretqS64F64(int64x2_t* r, float64x2_t* v0) { *r = vreinterp...
  function VreinterpretqS64P128 (line 1711) | void VreinterpretqS64P128(int64x2_t* r, poly128_t* v0) { *r = vreinterpr...
  function VreinterpretqS64P16 (line 1712) | void VreinterpretqS64P16(int64x2_t* r, poly16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqS64P64 (line 1713) | void VreinterpretqS64P64(int64x2_t* r, poly64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS64P8 (line 1714) | void VreinterpretqS64P8(int64x2_t* r, poly8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqS8S16 (line 1715) | void VreinterpretqS8S16(int8x16_t* r, int16x8_t* v0) { *r = vreinterpret...
  function VreinterpretqS8S32 (line 1716) | void VreinterpretqS8S32(int8x16_t* r, int32x4_t* v0) { *r = vreinterpret...
  function VreinterpretqS8S64 (line 1717) | void VreinterpretqS8S64(int8x16_t* r, int64x2_t* v0) { *r = vreinterpret...
  function VreinterpretqS8U8 (line 1718) | void VreinterpretqS8U8(int8x16_t* r, uint8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqS8U16 (line 1719) | void VreinterpretqS8U16(int8x16_t* r, uint16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqS8U32 (line 1720) | void VreinterpretqS8U32(int8x16_t* r, uint32x4_t* v0) { *r = vreinterpre...
  function VreinterpretqS8U64 (line 1721) | void VreinterpretqS8U64(int8x16_t* r, uint64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqS8F32 (line 1722) | void VreinterpretqS8F32(int8x16_t* r, float32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqS8F64 (line 1723) | void VreinterpretqS8F64(int8x16_t* r, float64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqS8P128 (line 1724) | void VreinterpretqS8P128(int8x16_t* r, poly128_t* v0) { *r = vreinterpre...
  function VreinterpretqS8P16 (line 1725) | void VreinterpretqS8P16(int8x16_t* r, poly16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqS8P64 (line 1726) | void VreinterpretqS8P64(int8x16_t* r, poly64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqS8P8 (line 1727) | void VreinterpretqS8P8(int8x16_t* r, poly8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqU16S8 (line 1728) | void VreinterpretqU16S8(uint16x8_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqU16S16 (line 1729) | void VreinterpretqU16S16(uint16x8_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqU16S32 (line 1730) | void VreinterpretqU16S32(uint16x8_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqU16S64 (line 1731) | void VreinterpretqU16S64(uint16x8_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqU16U8 (line 1732) | void VreinterpretqU16U8(uint16x8_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU16U32 (line 1733) | void VreinterpretqU16U32(uint16x8_t* r, uint32x4_t* v0) { *r = vreinterp...
  function VreinterpretqU16U64 (line 1734) | void VreinterpretqU16U64(uint16x8_t* r, uint64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU16F32 (line 1735) | void VreinterpretqU16F32(uint16x8_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqU16F64 (line 1736) | void VreinterpretqU16F64(uint16x8_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqU16P128 (line 1737) | void VreinterpretqU16P128(uint16x8_t* r, poly128_t* v0) { *r = vreinterp...
  function VreinterpretqU16P16 (line 1738) | void VreinterpretqU16P16(uint16x8_t* r, poly16x8_t* v0) { *r = vreinterp...
  function VreinterpretqU16P64 (line 1739) | void VreinterpretqU16P64(uint16x8_t* r, poly64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU16P8 (line 1740) | void VreinterpretqU16P8(uint16x8_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU32S8 (line 1741) | void VreinterpretqU32S8(uint32x4_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqU32S16 (line 1742) | void VreinterpretqU32S16(uint32x4_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqU32S32 (line 1743) | void VreinterpretqU32S32(uint32x4_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqU32S64 (line 1744) | void VreinterpretqU32S64(uint32x4_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqU32U8 (line 1745) | void VreinterpretqU32U8(uint32x4_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU32U16 (line 1746) | void VreinterpretqU32U16(uint32x4_t* r, uint16x8_t* v0) { *r = vreinterp...
  function VreinterpretqU32U64 (line 1747) | void VreinterpretqU32U64(uint32x4_t* r, uint64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU32F32 (line 1748) | void VreinterpretqU32F32(uint32x4_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqU32F64 (line 1749) | void VreinterpretqU32F64(uint32x4_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqU32P128 (line 1750) | void VreinterpretqU32P128(uint32x4_t* r, poly128_t* v0) { *r = vreinterp...
  function VreinterpretqU32P16 (line 1751) | void VreinterpretqU32P16(uint32x4_t* r, poly16x8_t* v0) { *r = vreinterp...
  function VreinterpretqU32P64 (line 1752) | void VreinterpretqU32P64(uint32x4_t* r, poly64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU32P8 (line 1753) | void VreinterpretqU32P8(uint32x4_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU64S8 (line 1754) | void VreinterpretqU64S8(uint64x2_t* r, int8x16_t* v0) { *r = vreinterpre...
  function VreinterpretqU64S16 (line 1755) | void VreinterpretqU64S16(uint64x2_t* r, int16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqU64S32 (line 1756) | void VreinterpretqU64S32(uint64x2_t* r, int32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqU64S64 (line 1757) | void VreinterpretqU64S64(uint64x2_t* r, int64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqU64U8 (line 1758) | void VreinterpretqU64U8(uint64x2_t* r, uint8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU64U16 (line 1759) | void VreinterpretqU64U16(uint64x2_t* r, uint16x8_t* v0) { *r = vreinterp...
  function VreinterpretqU64U32 (line 1760) | void VreinterpretqU64U32(uint64x2_t* r, uint32x4_t* v0) { *r = vreinterp...
  function VreinterpretqU64F32 (line 1761) | void VreinterpretqU64F32(uint64x2_t* r, float32x4_t* v0) { *r = vreinter...
  function VreinterpretqU64F64 (line 1762) | void VreinterpretqU64F64(uint64x2_t* r, float64x2_t* v0) { *r = vreinter...
  function VreinterpretqU64P128 (line 1763) | void VreinterpretqU64P128(uint64x2_t* r, poly128_t* v0) { *r = vreinterp...
  function VreinterpretqU64P16 (line 1764) | void VreinterpretqU64P16(uint64x2_t* r, poly16x8_t* v0) { *r = vreinterp...
  function VreinterpretqU64P64 (line 1765) | void VreinterpretqU64P64(uint64x2_t* r, poly64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU64P8 (line 1766) | void VreinterpretqU64P8(uint64x2_t* r, poly8x16_t* v0) { *r = vreinterpr...
  function VreinterpretqU8S8 (line 1767) | void VreinterpretqU8S8(uint8x16_t* r, int8x16_t* v0) { *r = vreinterpret...
  function VreinterpretqU8S16 (line 1768) | void VreinterpretqU8S16(uint8x16_t* r, int16x8_t* v0) { *r = vreinterpre...
  function VreinterpretqU8S32 (line 1769) | void VreinterpretqU8S32(uint8x16_t* r, int32x4_t* v0) { *r = vreinterpre...
  function VreinterpretqU8S64 (line 1770) | void VreinterpretqU8S64(uint8x16_t* r, int64x2_t* v0) { *r = vreinterpre...
  function VreinterpretqU8U16 (line 1771) | void VreinterpretqU8U16(uint8x16_t* r, uint16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqU8U32 (line 1772) | void VreinterpretqU8U32(uint8x16_t* r, uint32x4_t* v0) { *r = vreinterpr...
  function VreinterpretqU8U64 (line 1773) | void VreinterpretqU8U64(uint8x16_t* r, uint64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqU8F32 (line 1774) | void VreinterpretqU8F32(uint8x16_t* r, float32x4_t* v0) { *r = vreinterp...
  function VreinterpretqU8F64 (line 1775) | void VreinterpretqU8F64(uint8x16_t* r, float64x2_t* v0) { *r = vreinterp...
  function VreinterpretqU8P128 (line 1776) | void VreinterpretqU8P128(uint8x16_t* r, poly128_t* v0) { *r = vreinterpr...
  function VreinterpretqU8P16 (line 1777) | void VreinterpretqU8P16(uint8x16_t* r, poly16x8_t* v0) { *r = vreinterpr...
  function VreinterpretqU8P64 (line 1778) | void VreinterpretqU8P64(uint8x16_t* r, poly64x2_t* v0) { *r = vreinterpr...
  function VreinterpretqU8P8 (line 1779) | void VreinterpretqU8P8(uint8x16_t* r, poly8x16_t* v0) { *r = vreinterpre...
  function Vrev16S8 (line 1780) | void Vrev16S8(int8x8_t* r, int8x8_t* v0) { *r = vrev16_s8(*v0); }
  function Vrev16U8 (line 1781) | void Vrev16U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev16_u8(*v0); }
  function Vrev16P8 (line 1782) | void Vrev16P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev16_p8(*v0); }
  function Vrev16QS8 (line 1783) | void Vrev16QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev16q_s8(*v0); }
  function Vrev16QU8 (line 1784) | void Vrev16QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev16q_u8(*v0); }
  function Vrev16QP8 (line 1785) | void Vrev16QP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrev16q_p8(*v0); }
  function Vrev32S8 (line 1786) | void Vrev32S8(int8x8_t* r, int8x8_t* v0) { *r = vrev32_s8(*v0); }
  function Vrev32S16 (line 1787) | void Vrev32S16(int16x4_t* r, int16x4_t* v0) { *r = vrev32_s16(*v0); }
  function Vrev32U8 (line 1788) | void Vrev32U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev32_u8(*v0); }
  function Vrev32U16 (line 1789) | void Vrev32U16(uint16x4_t* r, uint16x4_t* v0) { *r = vrev32_u16(*v0); }
  function Vrev32P16 (line 1790) | void Vrev32P16(poly16x4_t* r, poly16x4_t* v0) { *r = vrev32_p16(*v0); }
  function Vrev32P8 (line 1791) | void Vrev32P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev32_p8(*v0); }
  function Vrev32QS8 (line 1792) | void Vrev32QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev32q_s8(*v0); }
  function Vrev32QS16 (line 1793) | void Vrev32QS16(int16x8_t* r, int16x8_t* v0) { *r = vrev32q_s16(*v0); }
  function Vrev32QU8 (line 1794) | void Vrev32QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev32q_u8(*v0); }
  function Vrev32QU16 (line 1795) | void Vrev32QU16(uint16x8_t* r, uint16x8_t* v0) { *r = vrev32q_u16(*v0); }
  function Vrev32QP16 (line 1796) | void Vrev32QP16(poly16x8_t* r, poly16x8_t* v0) { *r = vrev32q_p16(*v0); }
  function Vrev32QP8 (line 1797) | void Vrev32QP8(poly8x16_t* r, poly8x16_t* v0) { *r = vrev32q_p8(*v0); }
  function Vrev64S8 (line 1798) | void Vrev64S8(int8x8_t* r, int8x8_t* v0) { *r = vrev64_s8(*v0); }
  function Vrev64S16 (line 1799) | void Vrev64S16(int16x4_t* r, int16x4_t* v0) { *r = vrev64_s16(*v0); }
  function Vrev64S32 (line 1800) | void Vrev64S32(int32x2_t* r, int32x2_t* v0) { *r = vrev64_s32(*v0); }
  function Vrev64U8 (line 1801) | void Vrev64U8(uint8x8_t* r, uint8x8_t* v0) { *r = vrev64_u8(*v0); }
  function Vrev64U16 (line 1802) | void Vrev64U16(uint16x4_t* r, uint16x4_t* v0) { *r = vrev64_u16(*v0); }
  function Vrev64U32 (line 1803) | void Vrev64U32(uint32x2_t* r, uint32x2_t* v0) { *r = vrev64_u32(*v0); }
  function Vrev64F32 (line 1804) | void Vrev64F32(float32x2_t* r, float32x2_t* v0) { *r = vrev64_f32(*v0); }
  function Vrev64P16 (line 1805) | void Vrev64P16(poly16x4_t* r, poly16x4_t* v0) { *r = vrev64_p16(*v0); }
  function Vrev64P8 (line 1806) | void Vrev64P8(poly8x8_t* r, poly8x8_t* v0) { *r = vrev64_p8(*v0); }
  function Vrev64QS8 (line 1807) | void Vrev64QS8(int8x16_t* r, int8x16_t* v0) { *r = vrev64q_s8(*v0); }
  function Vrev64QS16 (line 1808) | void Vrev64QS16(int16x8_t* r, int16x8_t* v0) { *r = vrev64q_s16(*v0); }
  function Vrev64QS32 (line 1809) | void Vrev64QS32(int32x4_t* r, int32x4_t* v0) { *r = vrev64q_s32(*v0); }
  function Vrev64QU8 (line 1810) | void Vrev64QU8(uint8x16_t* r, uint8x16_t* v0) { *r = vrev64q_u8(*v0); }
  function Vrev64QU16 (line 1811) | void Vrev64QU16(uint16x8_t* r, uint16x8_t* v0) { *r = vrev64q_u16(*v0); }
  function Vrev64QU32 (line 1812) | void Vrev64QU32(uint32x4_t* r, uint32x4_t* v0) { *r = vrev64q_u32(*v0); }
  function Vrev64QF32 (line 1813) | void Vrev64QF32(float32x4_t* r, float32x4_t* v0) { *r = vrev64q_f32(*v0); }
  function Vrev64QP16 (line 1814) | void Vrev64QP16(poly16x8_t* r, poly16x8_t*
Condensed preview — 76 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (2,025K chars).
[
  {
    "path": ".gitignore",
    "chars": 38,
    "preview": ".vscode\nraw.h\nintrinsics.json\ndata.xml"
  },
  {
    "path": "LICENSE",
    "chars": 1069,
    "preview": "MIT License\n\nCopyright (c) 2023 Alivan Akbar\n\nPermission is hereby granted, free of charge, to any person obtaining a co"
  },
  {
    "path": "README.md",
    "chars": 4944,
    "preview": "# SIMD Implementation in Golang\n\nThis repository contains an implementation of SIMD (Single Instruction, Multiple Data) "
  },
  {
    "path": "arm/generate.go",
    "chars": 51,
    "preview": "package arm\n\n//go:generate go run ../generator/arm\n"
  },
  {
    "path": "arm/neon/functions.c",
    "chars": 201074,
    "preview": "#include <arm_neon.h>\n\nvoid VabaS8(int8x8_t* r, int8x8_t* v0, int8x8_t* v1, int8x8_t* v2) { *r = vaba_s8(*v0, *v1, *v2);"
  },
  {
    "path": "arm/neon/functions.go",
    "chars": 880821,
    "preview": "package neon\n\nimport (\n\t\"github.com/alivanz/go-simd/arm\"\n)\n\n/*\n#include <arm_neon.h>\n*/\nimport \"C\"\n\n// Signed Absolute d"
  },
  {
    "path": "arm/neon/functions_bypass.go",
    "chars": 656,
    "preview": "package neon\n\n/*\n#include <arm_neon.h>\nvoid vmulS8_bypass(int8x8_t* r, int8x8_t* v0, int8x8_t* v1) { *r = vmul_s8(*v0, *"
  },
  {
    "path": "arm/neon/functions_cgo.go",
    "chars": 187,
    "preview": "package neon\n\n/*\n#cgo CFLAGS: -march=armv8.5-a+crypto+i8mm\n#include <arm_neon.h>\n*/\nimport \"C\"\n\ntype int8x8 = C.int8x8_t"
  },
  {
    "path": "arm/neon/functions_test.go",
    "chars": 2306,
    "preview": "package neon\n\nimport (\n\t\"math/rand\"\n\t\"reflect\"\n\t\"runtime\"\n\t\"testing\"\n\t\"unsafe\"\n\n\t\"github.com/alivanz/go-simd/arm\"\n)\n\nfun"
  },
  {
    "path": "arm/neon/loops.c",
    "chars": 91173,
    "preview": "#include <arm_neon.h>\n\n#define save(dst, src) *dst = src\n#define load(src) (*src)\n#define LOOP1(name, rtype, itype, f, s"
  },
  {
    "path": "arm/neon/loops.go",
    "chars": 500235,
    "preview": "package neon\n\nimport (\n\t\"github.com/alivanz/go-simd/arm\"\n)\n\n/*\n#include <arm_neon.h>\n*/\nimport \"C\"\n\n// Signed Absolute D"
  },
  {
    "path": "arm/neon/loops_test.go",
    "chars": 2389,
    "preview": "package neon\n\nimport (\n\t\"math/rand\"\n\t\"reflect\"\n\t\"testing\"\n\t\"unsafe\"\n\n\t\"github.com/alivanz/go-simd/arm\"\n)\n\nfunc TestVabsS"
  },
  {
    "path": "arm/types.go",
    "chars": 6323,
    "preview": "package arm\n\n/*\n#include <arm_neon.h>\n*/\nimport \"C\"\n\n// typedef float float32_t;\ntype Float32 = C.float32_t\n\n// typedef "
  },
  {
    "path": "example/neon/main.go",
    "chars": 413,
    "preview": "package main\n\nimport (\n\t\"log\"\n\n\t\"github.com/alivanz/go-simd/arm\"\n\t\"github.com/alivanz/go-simd/arm/neon\"\n)\n\nfunc main() {"
  },
  {
    "path": "example/sse2/main.go",
    "chars": 306,
    "preview": "package main\n\nimport (\n\t\"log\"\n\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\nfunc main() {\n\ta := x86.MmSetrEpi8(0, 1, 2, 3, 4, 5,"
  },
  {
    "path": "generator/arm/arm.go",
    "chars": 845,
    "preview": "package main\n\nimport (\n\t\"encoding/json\"\n\t\"os\"\n\n\t\"github.com/alivanz/go-simd/generator/utils\"\n)\n\ntype ArmIntrinsics []Arm"
  },
  {
    "path": "generator/arm/main.go",
    "chars": 10314,
    "preview": "package main\n\nimport (\n\t\"bytes\"\n\t\"fmt\"\n\t\"io\"\n\t\"log\"\n\t\"os\"\n\t\"os/exec\"\n\t\"sort\"\n\t\"strconv\"\n\t\"strings\"\n\n\t\"github.com/alivanz"
  },
  {
    "path": "generator/arm/sort.go",
    "chars": 444,
    "preview": "package main\n\nimport \"strings\"\n\nvar (\n\tsuffixOrder = []string{\n\t\t\"_s8\",\n\t\t\"_s16\",\n\t\t\"_s32\",\n\t\t\"_s64\",\n\t\t\"_u8\",\n\t\t\"_u16\","
  },
  {
    "path": "generator/scanner/scan.go",
    "chars": 2963,
    "preview": "package scanner\n\nimport (\n\t\"bytes\"\n\t\"regexp\"\n\n\t\"github.com/alivanz/go-simd/generator/types\"\n\t\"github.com/alivanz/go-simd"
  },
  {
    "path": "generator/scanner/scan_test.go",
    "chars": 3413,
    "preview": "package scanner\n\nimport (\n\t\"reflect\"\n\t\"regexp\"\n\t\"testing\"\n\n\t\"github.com/alivanz/go-simd/generator/types\"\n)\n\nfunc TestAtt"
  },
  {
    "path": "generator/scanner/util.go",
    "chars": 301,
    "preview": "package scanner\n\nfunc commaSplit(ss ...string) []string {\n\tswitch len(ss) {\n\tcase 0:\n\t\treturn nil\n\tcase 1:\n\t\ts := regWhi"
  },
  {
    "path": "generator/types/function.go",
    "chars": 632,
    "preview": "package types\n\nimport (\n\t\"regexp\"\n\t\"strings\"\n)\n\ntype Function struct {\n\tName      string\n\tArgs      []Type\n\tReturn    *T"
  },
  {
    "path": "generator/types/type.go",
    "chars": 1197,
    "preview": "package types\n\nimport (\n\t\"strings\"\n\n\t\"github.com/iancoleman/strcase\"\n)\n\ntype Type struct {\n\tName       string\n\tFull     "
  },
  {
    "path": "generator/utils/download.go",
    "chars": 449,
    "preview": "package utils\n\nimport (\n\t\"io\"\n\t\"net/http\"\n\t\"os\"\n)\n\nfunc Download(dst, url string) error {\n\tif _, err := os.Stat(dst); !o"
  },
  {
    "path": "generator/utils/filter.go",
    "chars": 194,
    "preview": "package utils\n\nfunc Filter[T any](arr []T, fn func(e T) bool) []T {\n\tout := make([]T, 0, len(arr))\n\tfor _, e := range ar"
  },
  {
    "path": "generator/utils/slice.go",
    "chars": 323,
    "preview": "package utils\n\nfunc Transform[A, B any](arr []A, fn func(i int, e A) B) []B {\n\tif arr == nil {\n\t\treturn nil\n\t}\n\tout := m"
  },
  {
    "path": "generator/writer/cgo.go",
    "chars": 320,
    "preview": "package writer\n\nimport (\n\t\"fmt\"\n\t\"strings\"\n)\n\nfunc Cflags(flags []string) string {\n\treturn fmt.Sprintf(\"#cgo CFLAGS: %s\""
  },
  {
    "path": "generator/writer/function.go",
    "chars": 2406,
    "preview": "package writer\n\nimport (\n\t\"fmt\"\n\t\"io\"\n\t\"strings\"\n\n\t\"github.com/alivanz/go-simd/generator/types\"\n\t\"github.com/alivanz/go-"
  },
  {
    "path": "generator/writer/package.go",
    "chars": 1097,
    "preview": "package writer\n\nimport (\n\t\"fmt\"\n\t\"io\"\n\t\"strings\"\n\n\t\"github.com/alivanz/go-simd/generator/types\"\n)\n\nfunc Package(w io.Wri"
  },
  {
    "path": "generator/writer/package_test.go",
    "chars": 745,
    "preview": "package writer\n\nimport (\n\t\"bytes\"\n\t\"io\"\n\t\"strings\"\n\t\"testing\"\n)\n\nfunc TestPackage(t *testing.T) {\n\tvar buf bytes.Buffer\n"
  },
  {
    "path": "generator/writer/type.go",
    "chars": 353,
    "preview": "package writer\n\nimport (\n\t\"fmt\"\n\t\"io\"\n\n\t\"github.com/alivanz/go-simd/generator/types\"\n)\n\nfunc DeclareType(w io.Writer, t "
  },
  {
    "path": "generator/writer/writer.go",
    "chars": 450,
    "preview": "package writer\n\nimport (\n\t\"io\"\n\t\"os\"\n\t\"path/filepath\"\n)\n\nfunc WriteToFile(dst string, fn func(w io.Writer) error) error "
  },
  {
    "path": "generator/x86/info.go",
    "chars": 1312,
    "preview": "package main\n\nimport (\n\t\"bytes\"\n\t\"io/ioutil\"\n\t\"regexp\"\n\n\t\"github.com/alivanz/go-simd/generator/utils\"\n)\n\ntype Intrinsic "
  },
  {
    "path": "generator/x86/main.go",
    "chars": 4058,
    "preview": "package main\n\nimport (\n\t\"bytes\"\n\t\"fmt\"\n\t\"io\"\n\t\"log\"\n\t\"os\"\n\t\"os/exec\"\n\t\"regexp\"\n\t\"strings\"\n\n\t\"github.com/alivanz/go-simd/"
  },
  {
    "path": "go.mod",
    "chars": 89,
    "preview": "module github.com/alivanz/go-simd\n\ngo 1.20\n\nrequire github.com/iancoleman/strcase v0.2.0\n"
  },
  {
    "path": "go.sum",
    "chars": 177,
    "preview": "github.com/iancoleman/strcase v0.2.0 h1:05I4QRnGpI0m37iZQRuskXh+w77mr6Z41lwQzuHLwW0=\ngithub.com/iancoleman/strcase v0.2."
  },
  {
    "path": "x86/aes/functions.c",
    "chars": 492,
    "preview": "#include <immintrin.h>\n\nvoid MmAesencSi128(__m128i* r, __m128i* v0, __m128i* v1) { *r = _mm_aesenc_si128(*v0, *v1); }\nvo"
  },
  {
    "path": "x86/aes/functions.go",
    "chars": 1380,
    "preview": "package aes\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -maes\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/avx/functions.c",
    "chars": 11922,
    "preview": "#include <immintrin.h>\n\nvoid Mm256AddPd(__m256d* r, __m256d* v0, __m256d* v1) { *r = _mm256_add_pd(*v0, *v1); }\nvoid Mm2"
  },
  {
    "path": "x86/avx/functions.go",
    "chars": 35711,
    "preview": "package avx\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mavx\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/avx2/functions.c",
    "chars": 12360,
    "preview": "#include <immintrin.h>\n\nvoid Mm256AbsEpi8(__m256i* r, __m256i* v0) { *r = _mm256_abs_epi8(*v0); }\nvoid Mm256AbsEpi16(__m"
  },
  {
    "path": "x86/avx2/functions.go",
    "chars": 32766,
    "preview": "package avx2\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mavx2\n#include <immintrin.h>\n*/\nimport \"C\"\n"
  },
  {
    "path": "x86/bmi/functions.c",
    "chars": 1412,
    "preview": "#include <immintrin.h>\n\nvoid AndnU32(unsigned int* r, unsigned int* v0, unsigned int* v1) { *r = __andn_u32(*v0, *v1); }"
  },
  {
    "path": "x86/bmi/functions.go",
    "chars": 2544,
    "preview": "package bmi\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mbmi\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/bmi2/functions.c",
    "chars": 654,
    "preview": "#include <immintrin.h>\n\nvoid BzhiU32(unsigned int* r, unsigned int* v0, unsigned int* v1) { *r = _bzhi_u32(*v0, *v1); }\n"
  },
  {
    "path": "x86/bmi2/functions.go",
    "chars": 1743,
    "preview": "package bmi2\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mbmi2\n#include <immintrin.h>\n*/\nimport \"C\"\n"
  },
  {
    "path": "x86/crc32/functions.c",
    "chars": 455,
    "preview": "#include <immintrin.h>\n\nvoid MmCrc32U8(unsigned int* r, unsigned int* v0, unsigned char* v1) { *r = _mm_crc32_u8(*v0, *v"
  },
  {
    "path": "x86/crc32/functions.go",
    "chars": 1124,
    "preview": "package crc32\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mcrc32\n#include <immintrin.h>\n*/\nimport \"C"
  },
  {
    "path": "x86/f16c/functions.c",
    "chars": 232,
    "preview": "#include <immintrin.h>\n\nvoid CvtshSs(float* r, unsigned short* v0) { *r = _cvtsh_ss(*v0); }\nvoid MmCvtphPs(__m128* r, __"
  },
  {
    "path": "x86/f16c/functions.go",
    "chars": 890,
    "preview": "package f16c\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mf16c\n#include <immintrin.h>\n*/\nimport \"C\"\n"
  },
  {
    "path": "x86/fma/functions.c",
    "chars": 3432,
    "preview": "#include <immintrin.h>\n\nvoid MmFmaddPs(__m128* r, __m128* v0, __m128* v1, __m128* v2) { *r = _mm_fmadd_ps(*v0, *v1, *v2)"
  },
  {
    "path": "x86/fma/functions.go",
    "chars": 11022,
    "preview": "package fma\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mfma\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/fsgsbase/functions.c",
    "chars": 564,
    "preview": "#include <immintrin.h>\n\nvoid ReadfsbaseU32(unsigned int* r) { *r = _readfsbase_u32(); }\nvoid ReadfsbaseU64(unsigned long"
  },
  {
    "path": "x86/fsgsbase/functions.go",
    "chars": 1504,
    "preview": "package fsgsbase\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mfsgsbase\n#include <immintrin.h>\n*/\nimp"
  },
  {
    "path": "x86/generate.go",
    "chars": 51,
    "preview": "package x86\n\n//go:generate go run ../generator/x86\n"
  },
  {
    "path": "x86/lzcnt/functions.c",
    "chars": 259,
    "preview": "#include <immintrin.h>\n\nvoid Lzcnt32(unsigned int* r, unsigned int* v0) { *r = __lzcnt32(*v0); }\nvoid LzcntU32(unsigned "
  },
  {
    "path": "x86/lzcnt/functions.go",
    "chars": 625,
    "preview": "package lzcnt\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mlzcnt\n#include <immintrin.h>\n*/\nimport \"C"
  },
  {
    "path": "x86/mmx/functions.c",
    "chars": 5575,
    "preview": "#include <immintrin.h>\n\nvoid MmEmpty() { _mm_empty(); }\nvoid MmCvtsi32Si64(__m64* r, int* v0) { *r = _mm_cvtsi32_si64(*v"
  },
  {
    "path": "x86/mmx/functions.go",
    "chars": 14516,
    "preview": "package mmx\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mmmx\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/mmx_sse/functions.c",
    "chars": 1673,
    "preview": "#include <immintrin.h>\n\nvoid MmCvtpsPi32(__m64* r, __m128* v0) { *r = _mm_cvtps_pi32(*v0); }\nvoid MmCvtPs2Pi(__m64* r, _"
  },
  {
    "path": "x86/mmx_sse/functions.go",
    "chars": 5930,
    "preview": "package mmx_sse\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mmmx -msse\n#include <immintrin.h>\n*/\nimp"
  },
  {
    "path": "x86/mmx_sse2/functions.c",
    "chars": 476,
    "preview": "#include <immintrin.h>\n\nvoid MmCvtpdPi32(__m64* r, __m128d* v0) { *r = _mm_cvtpd_pi32(*v0); }\nvoid MmCvttpdPi32(__m64* r"
  },
  {
    "path": "x86/mmx_sse2/functions.go",
    "chars": 1437,
    "preview": "package mmx_sse2\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mmmx -msse2\n#include <immintrin.h>\n*/\ni"
  },
  {
    "path": "x86/mmx_ssse3/functions.c",
    "chars": 1214,
    "preview": "#include <immintrin.h>\n\nvoid MmAbsPi8(__m64* r, __m64* v0) { *r = _mm_abs_pi8(*v0); }\nvoid MmAbsPi16(__m64* r, __m64* v0"
  },
  {
    "path": "x86/mmx_ssse3/functions.go",
    "chars": 4106,
    "preview": "package mmx_ssse3\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mmmx -mssse3\n#include <immintrin.h>\n*/"
  },
  {
    "path": "x86/popcnt/functions.c",
    "chars": 182,
    "preview": "#include <immintrin.h>\n\nvoid MmPopcntU32(int* r, unsigned int* v0) { *r = _mm_popcnt_u32(*v0); }\nvoid MmPopcntU64(long l"
  },
  {
    "path": "x86/popcnt/functions.go",
    "chars": 534,
    "preview": "package popcnt\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mpopcnt\n#include <immintrin.h>\n*/\nimport "
  },
  {
    "path": "x86/sse/functions.c",
    "chars": 6462,
    "preview": "#include <immintrin.h>\n\nvoid MmAddSs(__m128* r, __m128* v0, __m128* v1) { *r = _mm_add_ss(*v0, *v1); }\nvoid MmAddPs(__m1"
  },
  {
    "path": "x86/sse/functions.go",
    "chars": 22677,
    "preview": "package sse\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -msse\n#include <immintrin.h>\n*/\nimport \"C\"\n\n/"
  },
  {
    "path": "x86/sse2/functions.c",
    "chars": 15783,
    "preview": "#include <immintrin.h>\n\nvoid MmAddSd(__m128d* r, __m128d* v0, __m128d* v1) { *r = _mm_add_sd(*v0, *v1); }\nvoid MmAddPd(_"
  },
  {
    "path": "x86/sse2/functions.go",
    "chars": 45280,
    "preview": "package sse2\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -msse2\n#include <immintrin.h>\n*/\nimport \"C\"\n"
  },
  {
    "path": "x86/sse3/functions.c",
    "chars": 809,
    "preview": "#include <immintrin.h>\n\nvoid MmAddsubPs(__m128* r, __m128* v0, __m128* v1) { *r = _mm_addsub_ps(*v0, *v1); }\nvoid MmHadd"
  },
  {
    "path": "x86/sse3/functions.go",
    "chars": 2568,
    "preview": "package sse3\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -msse3\n#include <immintrin.h>\n*/\nimport \"C\"\n"
  },
  {
    "path": "x86/ssse3/functions.c",
    "chars": 1328,
    "preview": "#include <immintrin.h>\n\nvoid MmAbsEpi8(__m128i* r, __m128i* v0) { *r = _mm_abs_epi8(*v0); }\nvoid MmAbsEpi16(__m128i* r, "
  },
  {
    "path": "x86/ssse3/functions.go",
    "chars": 4225,
    "preview": "package ssse3\n\nimport (\n\t\"github.com/alivanz/go-simd/x86\"\n)\n\n/*\n#cgo CFLAGS: -mssse3\n#include <immintrin.h>\n*/\nimport \"C"
  },
  {
    "path": "x86/types.go",
    "chars": 1113,
    "preview": "package x86\n\n/*\n#include <immintrin.h>\n*/\nimport \"C\"\n\n// typedef longlong __m64 __attribute__((__vector_size__(8), __ali"
  }
]

About this extraction

This page contains the full source code of the alivanz/go-simd GitHub repository, extracted and formatted as plain text for AI agents and large language models (LLMs). The extraction includes 76 files (1.9 MB), approximately 634.7k tokens, and a symbol index with 7261 extracted functions, classes, methods, constants, and types. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.

Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.

Copied to clipboard!