tensor.qlinear_concat
Last updated
Last updated
Concatenate a list of tensors after dequantizing them with their respective scales and zero_points and returns the quantized result.
tensors
( Span<Tensor<i8>>,
) - Array of the quantized input tensors.
scales
( Span<Tensor<T>>,
) - Array of the scales of the quantized input tensors.
zero_points
( Span<Tensor<T>>,
) - Arrayof the zero_points of the quantized input tensors.
y_scale
(@Tensor<T>
) - Scale for output.
y_zero_point
(@Tensor<T>
) - Zero point for output.
axis
(usize
) - Axis to concat on.
Panic if tensor length is not greater than 1.
Panics if dimension is not greater than axis.
u32 tensor, not supported. fp8x23wide tensor, not supported. fp16x16wide tensor, not supported.
A new Tensor<i8>
concatenated quantized tensor of the dequantized input tensors.