tensor.qlinear_concat

  qlinear_concat(tensors: Span<Tensor<i8>>, scales: Span<Tensor<T>>, zero_points: Span<Tensor<T>>, y_scale: @Tensor<T>, y_zero_point: @Tensor<T>, axis: usize) -> Tensor::<i8>;

Concatenate a list of tensors after dequantizing them with their respective scales and zero_points and returns the quantized result.

Args

  • tensors( Span<Tensor<i8>>,) - Array of the quantized input tensors.

  • scales( Span<Tensor<T>>,) - Array of the scales of the quantized input tensors.

  • zero_points( Span<Tensor<T>>,) - Arrayof the zero_points of the quantized input tensors.

  • y_scale(@Tensor<T>) - Scale for output.

  • y_zero_point(@Tensor<T>) - Zero point for output.

  • axis(usize) - Axis to concat on.

Panics

  • Panic if tensor length is not greater than 1.

  • Panics if dimension is not greater than axis.

Type Constraints

u32 tensor, not supported. fp8x23wide tensor, not supported. fp16x16wide tensor, not supported.

Returns

A new Tensor<i8> concatenated quantized tensor of the dequantized input tensors.

Example

Last updated