Man page - rocm-smi(1)
Packages contains this manual
Manual
ROCM-SMI
NAMEDESCRIPTION
options:
Display Options:
Topology:
Pages information:
Hardware-related information:
Software-related/controlled information:
Set options:
Reset options:
Auto-response options:
Output options:
SEE ALSO
NAME
rocm-smi - rocm-smi - a tool to monitor AMD accelerators and GPUs
DESCRIPTION
usage: rocm-smi [-h] [-V] [-d DEVICE [DEVICE ...]] [--alldevices] [--showhw] [-a] [-i] [-v] [-e [EVENT ...]]
[--showdriverversion] [--showtempgraph] [--showfwinfo [BLOCK ...]] [--showmclkrange] [--showmemvendor] [--showsclkrange] [--showproductname] [--showserial] [--showuniqueid] [--showvoltagerange] [--showbus] [--showpagesinfo] [--showpendingpages] [--showretiredpages] [--showunreservablepages] [-f] [-P] [-t] [-u] [--showmemuse] [--showvoltage] [-b] [-c] [-g] [-l] [-M] [-m] [-o] [-p] [-S] [-s] [--showmeminfo TYPE [TYPE ...]] [--showpids [VERBOSE]] [--showpidgpus [SHOWPIDGPUS ...]] [--showreplaycount] [--showrasinfo [SHOWRASINFO ...]] [--showvc] [--showxgmierr] [--showtopo] [--showtopoaccess] [--showtopoweight] [--showtopohops] [--showtopotype] [--showtoponuma] [--showenergycounter] [--shownodesbw] [--showcomputepartition] [--showmemorypartition] [-r] [--resetfans] [--resetprofile] [--resetpoweroverdrive] [--resetxgmierr] [--resetperfdeterminism] [--resetcomputepartition] [--resetmemorypartition] [--setclock TYPE LEVEL] [--setsclk LEVEL [LEVEL ...]] [--setmclk LEVEL [LEVEL ...]] [--setpcie LEVEL [LEVEL ...]] [--setslevel SCLKLEVEL SCLK SVOLT] [--setmlevel MCLKLEVEL MCLK MVOLT] [--setvc POINT SCLK SVOLT] [--setsrange SCLKMIN SCLKMAX] [--setextremum min|max sclk|mclk CLK] [--setmrange MCLKMIN MCLKMAX] [--setfan LEVEL] [--setperflevel LEVEL] [--setoverdrive %] [--setmemoverdrive %] [--setpoweroverdrive WATTS] [--setprofile SETPROFILE] [--setperfdeterminism SCLK] [--setcomputepartition {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}] [--setmemorypartition {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}] [--rasenable BLOCK ERRTYPE] [--rasdisable BLOCK ERRTYPE] [--rasinject BLOCK] [--gpureset] [--load FILE | --save FILE] [--autorespond RESPONSE] [--loglevel LEVEL] [--json] [--csv]
AMD ROCm System Management Interface | ROCM-SMI version: 2.2.0
options:
-h , --help
show this help message and exit
--gpureset
Reset specified GPU (One GPU must be specified)
--load FILE
Load Clock, Fan, Performance and Profile settings from FILE
--save FILE
Save Clock, Fan, Performance and Profile settings to FILE
-V , --version
Show version information
-d DEVICE [DEVICE ...], --device DEVICE [DEVICE ...]
Execute command on specified device
Display Options:
|
--alldevices |
--showhw
Show Hardware details
-a , --showallinfo
Show Temperature, Fan and Clock values
Topology:
-i , --showid
Show DEVICE IDs
-v , --showvbios
Show VBIOS version
-e [EVENT ...], --showevents [EVENT ...]
Show event list
--showdriverversion
Show kernel driver version
--showtempgraph
Show Temperature Graph
--showfwinfo [BLOCK ...]
Show FW information
--showmclkrange
Show mclk range
--showmemvendor
Show GPU memory vendor
--showsclkrange
Show sclk range
--showproductname
Show product details
--showserial
Show GPU’s Serial Number
--showuniqueid
Show GPU’s Unique ID
--showvoltagerange
Show voltage range
--showbus
Show PCI bus number
Pages information:
--showpagesinfo
Show retired, pending and unreservable pages
--showpendingpages
Show pending retired pages
--showretiredpages
Show retired pages
--showunreservablepages
Show unreservable pages
Hardware-related information:
-f , --showfan
Show current fan speed
-P , --showpower
Show current average or instant socket graphics package power consumption
-t , --showtemp
Show current temperature
-u , --showuse
Show current GPU use
--showmemuse
Show current GPU memory used
--showvoltage
Show current GPU voltage
Software-related/controlled information:
-b , --showbw
Show estimated PCIe use
-c , --showclocks
Show current clock frequencies
-g , --showgpuclocks
Show current GPU clock frequencies
-l , --showprofile
Show Compute Profile attributes
-M , --showmaxpower
Show maximum graphics package power this GPU will consume
-m , --showmemoverdrive
Show current GPU Memory Clock OverDrive level
-o , --showoverdrive
Show current GPU Clock OverDrive level
-p , --showperflevel
Show current DPM Performance Level
-S , --showclkvolt
Show supported GPU and Memory Clocks and Voltages
-s , --showclkfrq
Show supported GPU and Memory Clock
--showmeminfo TYPE [TYPE ...]
Show Memory usage information for given block(s) TYPE
--showpids [VERBOSE]
Show current running KFD PIDs (pass details to VERBOSE for detailed information)
--showpidgpus [SHOWPIDGPUS ...]
Show GPUs used by specified KFD PIDs (all if no arg given)
--showreplaycount
Show PCIe Replay Count
--showrasinfo [SHOWRASINFO ...]
Show RAS enablement information and error counts for the specified block(s) (all if no arg given)
--showvc
Show voltage curve
--showxgmierr
Show XGMI error information since last read
--showtopo
Show hardware topology information
--showtopoaccess
Shows the link accessibility between GPUs
--showtopoweight
Shows the relative weight between GPUs
--showtopohops
Shows the number of hops between GPUs
--showtopotype
Shows the link type between GPUs
--showtoponuma
Shows the numa nodes
--showenergycounter
Energy accumulator that stores amount of energy consumed
--shownodesbw
Shows the numa nodes
--showcomputepartition
Shows current compute partitioning
--showmemorypartition
Shows current memory partition
Set options:
--setclock TYPE LEVEL
Set Clock Frequency Level(s) for specified clock (requires manual Perf level)
--setsclk LEVEL [LEVEL ...]
Set GPU Clock Frequency Level(s) (requires manual Perf level)
--setmclk LEVEL [LEVEL ...]
Set GPU Memory Clock Frequency Level(s) (requires manual Perf level)
--setpcie LEVEL [LEVEL ...]
Set PCIE Clock Frequency Level(s) (requires manual Perf level)
--setslevel SCLKLEVEL SCLK SVOLT
Change GPU Clock frequency (MHz) and Voltage (mV) for a specific Level
--setmlevel MCLKLEVEL MCLK MVOLT
Change GPU Memory clock frequency (MHz) and Voltage for (mV) a specific Level
--setvc POINT SCLK SVOLT
Change SCLK Voltage Curve (MHz mV) for a specific point
--setsrange SCLKMIN SCLKMAX
Set min and max SCLK speed
--setextremum min|max sclk|mclk CLK
Set min/max of SCLK/MCLK speed
--setmrange MCLKMIN MCLKMAX
Set min and max MCLK speed
--setfan LEVEL
Set GPU Fan Speed (Level or %)
--setperflevel LEVEL
Set Performance Level
--setoverdrive %
Set GPU OverDrive level (requires manual|high Perf level)
--setmemoverdrive %
Set GPU Memory Overclock OverDrive level (requires manual|high Perf level)
--setpoweroverdrive WATTS
Set the maximum GPU power using Power OverDrive in Watts
--setprofile SETPROFILE
Specify Power Profile level (#) or a quoted string of CUSTOM Profile attributes "# # # #..." (requires manual Perf level)
--setperfdeterminism SCLK
Set clock frequency limit to get minimal performance variation
--setcomputepartition {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}
Set compute partition
--setmemorypartition {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}
Set memory partition
--rasenable BLOCK ERRTYPE
Enable RAS for specified block and error type
--rasdisable BLOCK ERRTYPE
Disable RAS for specified block and error type
--rasinject BLOCK
Inject RAS poison for specified block (ONLY WORKS ON UNSECURED BOARDS)
Reset options:
-r , --resetclocks
Reset clocks and OverDrive to default
--resetfans
Reset fans to automatic (driver) control
--resetprofile
Reset Power Profile back to default
--resetpoweroverdrive
Set the maximum GPU power back to the device default state
--resetxgmierr
Reset XGMI error count
--resetperfdeterminism
Disable performance determinism
--resetcomputepartition
Resets to boot compute partition state
--resetmemorypartition
Resets to boot memory partition state
Auto-response options:
--autorespond RESPONSE
Response to automatically provide for all prompts (NOT RECOMMENDED)
Output options:
--loglevel LEVEL
How much output will be printed for what program is doing, one of debug/info/warning/error/critical
|
--json |
Print output in JSON format |
|||
|
--csv |
Print output in CSV format |
SEE ALSO
The full documentation for rocm-smi is maintained as a Texinfo manual. If the info and rocm-smi programs are properly installed at your site, the command
info rocm-smi
should give you access to the complete manual.