Block or Report
Block or report hwu36
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
214 contributions in the last year
Less
More
Contribution activity
February 2023
Created 5 commits in 1 repository
Created a pull request in NVIDIA/cutlass that received 2 comments
Opened 2 other pull requests in 1 repository
NVIDIA/cutlass
2
merged
Reviewed 12 pull requests in 1 repository
NVIDIA/cutlass
12 pull requests
- Fix typos 2
- Fix typos
- fMHA: Sync FW with xFormers
- Add fixed_channel and few_channel mode to int8 in generator
- min -> std::min
- Changes to iterators to support s8 gemm with f16 outputs
- Fix some typos
- Extend DualGemm: support batched mode + decouple B0/B1 layouts
- Fix type bug in conv2d/gemm with broadcast
- xFormer updates to fMHA FW
- Add acc2smem back to epilogue/threadblock/epilogue.h
- Re-enable alignments for int32_t accumulators
Answered 1 discussion in 1 repository
NVIDIA/cutlass
NVIDIA/cutlass
-
Vectorized epilogue in v3 API
This contribution was made on Feb 13



