Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles about
Performance
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs
Yesterday by
Siyoung Jang
and
Roberto Morabito
Distributed, Parallel, and Cluster Computing
,
Artificial Intelligence
Performance of Confidential Computing GPUs
Yesterday by
Antonio Martínez Ibarra
and
others
Performance
Optimizing Asynchronous Federated Learning: A~Delicate Trade-Off Between Model-Parameter Staleness and Update Frequency
Yesterday by
Abdelkrim Alahyane
and
others
Machine Learning
,
Performance
Towards Stream-Based Monitoring for EVM Networks
Yesterday by
Emanuel Onica
and
others
Performance
,
Distributed, Parallel, and Cluster Computing
A Methodology to Evaluate Strategies Predicting Rankings on Unseen Domains
Yesterday by
Sébastien Piérard
and
others
Performance
,
Computer Vision and Pattern Recognition
Bridging the Gap: Physical PCI Device Integration Into SystemC-TLM Virtual Platforms
Yesterday by
Nils Bosbach
and
others
Software Engineering
,
Hardware Architecture
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
Yesterday by
Yuxin Wang
and
others
Distributed, Parallel, and Cluster Computing
,
Performance
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
2 days ago by
Arshia Kermani
and
others
Machine Learning
,
Artificial Intelligence
Extracting Practical, Actionable Energy Insights from Supercomputer Telemetry and Logs
2 days ago by
Melanie Cornelius
and
others
Distributed, Parallel, and Cluster Computing
,
Performance
Task-parallelism in SWIFT for heterogeneous compute architectures
2 days ago by
Abouzied M. A. Nasar
and
others
Performance
,
Instrumentation and Methods for Astrophysics
Heterogeneous Memory Pool Tuning
3 days ago by
Filip Vaverka
and
others
Performance
Towards Efficient Multi-Scale Deformable Attention on NPU
3 days ago by
Chenghuan Huang
and
others
Performance
,
Computer Vision and Pattern Recognition
Bayesian Hierarchical Models for Quantitative Estimates for Performance metrics applied to Saddle Search Algorithms
3 days ago by
Rohit Goswami
Chemical Physics
,
Performance
Net-Zero: A Comparative Study on Neural Network Design for Climate-Economic PDEs Under Uncertainty
3 days ago by
Carlos Rodriguez-Pardo
and
others
Machine Learning
,
Artificial Intelligence
eBPF-Based Instrumentation for Generalisable Diagnosis of Performance Degradation
3 days ago by
Diogo Landau
and
others
Distributed, Parallel, and Cluster Computing
,
Performance
Effects of the Auto-Correlation of Delays on the Age of Information: A Gaussian Process Framework
4 days ago by
Atsushi Inoie
and
Yoshiaki Inoue
Performance
Unleashing Automated Congestion Control Customization in the Wild
4 days ago by
Amit Cohen
and
others
Networking and Internet Architecture
,
Artificial Intelligence
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
6 days ago by
Jintao Zhang
and
others
Machine Learning
,
Artificial Intelligence
msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML
6 days ago by
Zhaolan Huang
and
Emmanuel Baccelli
Machine Learning
,
Performance
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
6 days ago by
Anjiang Wei
and
others
Computation and Language
,
Artificial Intelligence
Characterizing GPU Energy Usage in Exascale-Ready Portable Science Applications
6 days ago by
William Godoy
and
others
Performance
,
Computational Engineering, Finance, and Science
MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices
7 days ago by
Mohammadali Shakerdargah
and
others
Distributed, Parallel, and Cluster Computing
,
Artificial Intelligence
Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities
15 May 2025 by
Hiari Pizzini Cavagna
and
others
Performance
,
Artificial Intelligence
An Integrated UVM-TLM Co-Simulation Framework for RISC-V Functional Verification and Performance Evaluation
15 May 2025 by
Ruizhi Qiu
and
Yang Liu
Hardware Architecture
,
Performance
On the Partitioning of GPU Power among Multi-Instances
14 May 2025 by
Tirth Vamja
and
others
Distributed, Parallel, and Cluster Computing
,
Performance
Statistical Modeling and Uncertainty Estimation of LLM Inference Systems
14 May 2025 by
Kaustabha Ray
and
others
Performance
Geometric lower bounds for the steady-state occupancy of processing networks with limited connectivity
13 May 2025 by
Diego Goldsztajn
and
Andres Ferragut
Probability
,
Performance
Revisiting 16-bit Neural Network Training: A Practical Approach for Resource-Limited Learning
13 May 2025 by
Juyoung Yun
and
others
Machine Learning
,
Artificial Intelligence
Comparing Parallel Functional Array Languages: Programming and Performance
13 May 2025 by
David Van Balen
and
others
Programming Languages
,
Distributed, Parallel, and Cluster Computing
USEFUSE: Uniform Stride for Enhanced Performance in Fused Layer Architecture of Deep Neural Networks
13 May 2025 by
Muhammad Sohail Ibrahim
and
others
Machine Learning
,
Hardware Architecture
Load more