tensor.qlinear_concat
qlinear_concat(tensors: Span<Tensor<i8>>, scales: Span<Tensor<T>>, zero_points: Span<Tensor<T>>, y_scale: @Tensor<T>, y_zero_point: @Tensor<T>, axis: usize) -> Tensor::<i8>;Concatenate a list of tensors after dequantizing them with their respective scales and zero_points and returns the quantized result.
Args
tensors(Span<Tensor<i8>>,) - Array of the quantized input tensors.scales(Span<Tensor<T>>,) - Array of the scales of the quantized input tensors.zero_points(Span<Tensor<T>>,) - Arrayof the zero_points of the quantized input tensors.y_scale(@Tensor<T>) - Scale for output.y_zero_point(@Tensor<T>) - Zero point for output.axis(usize) - Axis to concat on.
Panics
Panic if tensor length is not greater than 1.
Panics if dimension is not greater than axis.
Type Constraints
u32 tensor, not supported. fp8x23wide tensor, not supported. fp16x16wide tensor, not supported.
Returns
A new Tensor<i8> concatenated quantized tensor of the dequantized input tensors.
Example
Last updated
Was this helpful?