The project made various contributions along four research avenues. (1) We proposed microarchitecture enhancements to improve single-thread performance, including Probabilistic Branch Support [MICRO 2018], Precise Runahead Execution [HPCA 2020], Forward Slice Core [PACT 2020] and Vector Runahead [ISCA 2021, MICRO 2023]. (2) We proposed time-proportional performance analysis: TIP [MICRO 2021] and TEA [ISCA 2023], the RPPM performance model for multicore CPUs [ISPASS 2019], the MDM model for GPUs [MICRO 2020], and scale-model simulation [CAL 2021, ISPASS 2022, HPCA 2024]. The HSM slowdown model enables fair and QoS-aware resource management in multitasking GPUs [ASPLOS 2020]. (3) We proposed write-rationing garbage collection to manage hybrid memories for CPUs [PLDI 2018, SIGMETRICS 2019]. We propose PAE address mapping to balance memory utilization in GPUs [ISCA 2018], adaptive memory-side last-level caching to trade off bandwidth versus capacity in GPUs [ISCA 2019, MICRO 2020], resource management in heterogeneous system-on-chips [HPCA 2022], caching in multi-chip GPU systems [ISCA 2023]. (4) We extended the multicore simulator Sniper to the ARM instruction-set architecture [ISPASS 2019], we developed an emulation platform for evaluating future hybrid memory systems on existing commodity hardware [ISPASS 2019], we developed rigorous benchmarking methodology for Python workloads [IISWC 2020], representative GPU workloads [IISWC 2021], GPU simulation methodology [ISPASS 2023].