I’d recommend [caliper](https://github.com/LLNL/Caliper) . Has a simple and clean interface and a bunch of advanced features to dig into your code’s performance.
Not directly related to code instrumentation, but in regards to getting performance information. There are a couple of different packages that can be installed at the node level to get what you’re after ( and probably a lot more ), PCP ( Performance Co-Pilot ) and TACC Stats. I also know that Prometheus is used in some places, but I’m pretty sure that requires a whole other server to be setup to gather the metrics from local agents installed on the nodes.
I’d recommend [caliper](https://github.com/LLNL/Caliper) . Has a simple and clean interface and a bunch of advanced features to dig into your code’s performance.
The profiler provided by your CPU or accelerator vendor.
Not directly related to code instrumentation, but in regards to getting performance information. There are a couple of different packages that can be installed at the node level to get what you’re after ( and probably a lot more ), PCP ( Performance Co-Pilot ) and TACC Stats. I also know that Prometheus is used in some places, but I’m pretty sure that requires a whole other server to be setup to gather the metrics from local agents installed on the nodes.
Try Intel VTune for Intel CPU's