Type | Year | Title | Cited | Download |
---|---|---|---|---|
Conference
|
2025 | Dynamic Layer-Specific Overlapping for Efficient LLM Inference on Resource-Constrained Systems Misun Yu International Symposium on Code Generation and Optimization (CGO) 2025, pp.1-3 | ||
Conference
|
2024 | ML2Tuner: Efficient Code Tuning via Multi-Level Machine Learning Models JooHyoung Cha Conference on Neural Information Processing Systems (NeurIPS) 2024 : Workshop, pp.1-12 | ||
Journal
|
2024 | NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators Jeman Park ETRI Journal, v.46, no.5, pp.851-864 | 2 |