1.. SPDX-License-Identifier: GPL-2.0 2 3===================================================== 4sysfs - _The_ filesystem for exporting kernel objects 5===================================================== 6 7Patrick Mochel <mochel@osdl.org> 8 9Mike Murphy <mamurph@cs.clemson.edu> 10 11:Revised: 16 August 2011 12:Original: 10 January 2003 13 14 15What it is: 16~~~~~~~~~~~ 17 18sysfs is a ram-based filesystem initially based on ramfs. It provides 19a means to export kernel data structures, their attributes, and the 20linkages between them to userspace. 21 22sysfs is tied inherently to the kobject infrastructure. Please read 23Documentation/kobject.txt for more information concerning the kobject 24interface. 25 26 27Using sysfs 28~~~~~~~~~~~ 29 30sysfs is always compiled in if CONFIG_SYSFS is defined. You can access 31it by doing:: 32 33 mount -t sysfs sysfs /sys 34 35 36Directory Creation 37~~~~~~~~~~~~~~~~~~ 38 39For every kobject that is registered with the system, a directory is 40created for it in sysfs. That directory is created as a subdirectory 41of the kobject's parent, expressing internal object hierarchies to 42userspace. Top-level directories in sysfs represent the common 43ancestors of object hierarchies; i.e. the subsystems the objects 44belong to. 45 46Sysfs internally stores a pointer to the kobject that implements a 47directory in the kernfs_node object associated with the directory. In 48the past this kobject pointer has been used by sysfs to do reference 49counting directly on the kobject whenever the file is opened or closed. 50With the current sysfs implementation the kobject reference count is 51only modified directly by the function sysfs_schedule_callback(). 52 53 54Attributes 55~~~~~~~~~~ 56 57Attributes can be exported for kobjects in the form of regular files in 58the filesystem. Sysfs forwards file I/O operations to methods defined 59for the attributes, providing a means to read and write kernel 60attributes. 61 62Attributes should be ASCII text files, preferably with only one value 63per file. It is noted that it may not be efficient to contain only one 64value per file, so it is socially acceptable to express an array of 65values of the same type. 66 67Mixing types, expressing multiple lines of data, and doing fancy 68formatting of data is heavily frowned upon. Doing these things may get 69you publicly humiliated and your code rewritten without notice. 70 71 72An attribute definition is simply:: 73 74 struct attribute { 75 char * name; 76 struct module *owner; 77 umode_t mode; 78 }; 79 80 81 int sysfs_create_file(struct kobject * kobj, const struct attribute * attr); 82 void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr); 83 84 85A bare attribute contains no means to read or write the value of the 86attribute. Subsystems are encouraged to define their own attribute 87structure and wrapper functions for adding and removing attributes for 88a specific object type. 89 90For example, the driver model defines struct device_attribute like:: 91 92 struct device_attribute { 93 struct attribute attr; 94 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 95 char *buf); 96 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 97 const char *buf, size_t count); 98 }; 99 100 int device_create_file(struct device *, const struct device_attribute *); 101 void device_remove_file(struct device *, const struct device_attribute *); 102 103It also defines this helper for defining device attributes:: 104 105 #define DEVICE_ATTR(_name, _mode, _show, _store) \ 106 struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store) 107 108For example, declaring:: 109 110 static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo); 111 112is equivalent to doing:: 113 114 static struct device_attribute dev_attr_foo = { 115 .attr = { 116 .name = "foo", 117 .mode = S_IWUSR | S_IRUGO, 118 }, 119 .show = show_foo, 120 .store = store_foo, 121 }; 122 123Note as stated in include/linux/kernel.h "OTHER_WRITABLE? Generally 124considered a bad idea." so trying to set a sysfs file writable for 125everyone will fail reverting to RO mode for "Others". 126 127For the common cases sysfs.h provides convenience macros to make 128defining attributes easier as well as making code more concise and 129readable. The above case could be shortened to: 130 131static struct device_attribute dev_attr_foo = __ATTR_RW(foo); 132 133the list of helpers available to define your wrapper function is: 134 135__ATTR_RO(name): 136 assumes default name_show and mode 0444 137__ATTR_WO(name): 138 assumes a name_store only and is restricted to mode 139 0200 that is root write access only. 140__ATTR_RO_MODE(name, mode): 141 fore more restrictive RO access currently 142 only use case is the EFI System Resource Table 143 (see drivers/firmware/efi/esrt.c) 144__ATTR_RW(name): 145 assumes default name_show, name_store and setting 146 mode to 0644. 147__ATTR_NULL: 148 which sets the name to NULL and is used as end of list 149 indicator (see: kernel/workqueue.c) 150 151Subsystem-Specific Callbacks 152~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 154When a subsystem defines a new attribute type, it must implement a 155set of sysfs operations for forwarding read and write calls to the 156show and store methods of the attribute owners:: 157 158 struct sysfs_ops { 159 ssize_t (*show)(struct kobject *, struct attribute *, char *); 160 ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t); 161 }; 162 163[ Subsystems should have already defined a struct kobj_type as a 164descriptor for this type, which is where the sysfs_ops pointer is 165stored. See the kobject documentation for more information. ] 166 167When a file is read or written, sysfs calls the appropriate method 168for the type. The method then translates the generic struct kobject 169and struct attribute pointers to the appropriate pointer types, and 170calls the associated methods. 171 172 173To illustrate:: 174 175 #define to_dev(obj) container_of(obj, struct device, kobj) 176 #define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr) 177 178 static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr, 179 char *buf) 180 { 181 struct device_attribute *dev_attr = to_dev_attr(attr); 182 struct device *dev = to_dev(kobj); 183 ssize_t ret = -EIO; 184 185 if (dev_attr->show) 186 ret = dev_attr->show(dev, dev_attr, buf); 187 if (ret >= (ssize_t)PAGE_SIZE) { 188 printk("dev_attr_show: %pS returned bad count\n", 189 dev_attr->show); 190 } 191 return ret; 192 } 193 194 195 196Reading/Writing Attribute Data 197~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 199To read or write attributes, show() or store() methods must be 200specified when declaring the attribute. The method types should be as 201simple as those defined for device attributes:: 202 203 ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf); 204 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 205 const char *buf, size_t count); 206 207IOW, they should take only an object, an attribute, and a buffer as parameters. 208 209 210sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the 211method. Sysfs will call the method exactly once for each read or 212write. This forces the following behavior on the method 213implementations: 214 215- On read(2), the show() method should fill the entire buffer. 216 Recall that an attribute should only be exporting one value, or an 217 array of similar values, so this shouldn't be that expensive. 218 219 This allows userspace to do partial reads and forward seeks 220 arbitrarily over the entire file at will. If userspace seeks back to 221 zero or does a pread(2) with an offset of '0' the show() method will 222 be called again, rearmed, to fill the buffer. 223 224- On write(2), sysfs expects the entire buffer to be passed during the 225 first write. Sysfs then passes the entire buffer to the store() method. 226 A terminating null is added after the data on stores. This makes 227 functions like sysfs_streq() safe to use. 228 229 When writing sysfs files, userspace processes should first read the 230 entire file, modify the values it wishes to change, then write the 231 entire buffer back. 232 233 Attribute method implementations should operate on an identical 234 buffer when reading and writing values. 235 236Other notes: 237 238- Writing causes the show() method to be rearmed regardless of current 239 file position. 240 241- The buffer will always be PAGE_SIZE bytes in length. On i386, this 242 is 4096. 243 244- show() methods should return the number of bytes printed into the 245 buffer. This is the return value of scnprintf(). 246 247- show() must not use snprintf() when formatting the value to be 248 returned to user space. If you can guarantee that an overflow 249 will never happen you can use sprintf() otherwise you must use 250 scnprintf(). 251 252- store() should return the number of bytes used from the buffer. If the 253 entire buffer has been used, just return the count argument. 254 255- show() or store() can always return errors. If a bad value comes 256 through, be sure to return an error. 257 258- The object passed to the methods will be pinned in memory via sysfs 259 referencing counting its embedded object. However, the physical 260 entity (e.g. device) the object represents may not be present. Be 261 sure to have a way to check this, if necessary. 262 263 264A very simple (and naive) implementation of a device attribute is:: 265 266 static ssize_t show_name(struct device *dev, struct device_attribute *attr, 267 char *buf) 268 { 269 return scnprintf(buf, PAGE_SIZE, "%s\n", dev->name); 270 } 271 272 static ssize_t store_name(struct device *dev, struct device_attribute *attr, 273 const char *buf, size_t count) 274 { 275 snprintf(dev->name, sizeof(dev->name), "%.*s", 276 (int)min(count, sizeof(dev->name) - 1), buf); 277 return count; 278 } 279 280 static DEVICE_ATTR(name, S_IRUGO, show_name, store_name); 281 282 283(Note that the real implementation doesn't allow userspace to set the 284name for a device.) 285 286 287Top Level Directory Layout 288~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 290The sysfs directory arrangement exposes the relationship of kernel 291data structures. 292 293The top level sysfs directory looks like:: 294 295 block/ 296 bus/ 297 class/ 298 dev/ 299 devices/ 300 firmware/ 301 net/ 302 fs/ 303 304devices/ contains a filesystem representation of the device tree. It maps 305directly to the internal kernel device tree, which is a hierarchy of 306struct device. 307 308bus/ contains flat directory layout of the various bus types in the 309kernel. Each bus's directory contains two subdirectories:: 310 311 devices/ 312 drivers/ 313 314devices/ contains symlinks for each device discovered in the system 315that point to the device's directory under root/. 316 317drivers/ contains a directory for each device driver that is loaded 318for devices on that particular bus (this assumes that drivers do not 319span multiple bus types). 320 321fs/ contains a directory for some filesystems. Currently each 322filesystem wanting to export attributes must create its own hierarchy 323below fs/ (see ./fuse.txt for an example). 324 325dev/ contains two directories char/ and block/. Inside these two 326directories there are symlinks named <major>:<minor>. These symlinks 327point to the sysfs directory for the given device. /sys/dev provides a 328quick way to lookup the sysfs interface for a device from the result of 329a stat(2) operation. 330 331More information can driver-model specific features can be found in 332Documentation/driver-api/driver-model/. 333 334 335TODO: Finish this section. 336 337 338Current Interfaces 339~~~~~~~~~~~~~~~~~~ 340 341The following interface layers currently exist in sysfs: 342 343 344devices (include/linux/device.h) 345-------------------------------- 346Structure:: 347 348 struct device_attribute { 349 struct attribute attr; 350 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 351 char *buf); 352 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 353 const char *buf, size_t count); 354 }; 355 356Declaring:: 357 358 DEVICE_ATTR(_name, _mode, _show, _store); 359 360Creation/Removal:: 361 362 int device_create_file(struct device *dev, const struct device_attribute * attr); 363 void device_remove_file(struct device *dev, const struct device_attribute * attr); 364 365 366bus drivers (include/linux/device.h) 367------------------------------------ 368Structure:: 369 370 struct bus_attribute { 371 struct attribute attr; 372 ssize_t (*show)(struct bus_type *, char * buf); 373 ssize_t (*store)(struct bus_type *, const char * buf, size_t count); 374 }; 375 376Declaring:: 377 378 static BUS_ATTR_RW(name); 379 static BUS_ATTR_RO(name); 380 static BUS_ATTR_WO(name); 381 382Creation/Removal:: 383 384 int bus_create_file(struct bus_type *, struct bus_attribute *); 385 void bus_remove_file(struct bus_type *, struct bus_attribute *); 386 387 388device drivers (include/linux/device.h) 389--------------------------------------- 390 391Structure:: 392 393 struct driver_attribute { 394 struct attribute attr; 395 ssize_t (*show)(struct device_driver *, char * buf); 396 ssize_t (*store)(struct device_driver *, const char * buf, 397 size_t count); 398 }; 399 400Declaring:: 401 402 DRIVER_ATTR_RO(_name) 403 DRIVER_ATTR_RW(_name) 404 405Creation/Removal:: 406 407 int driver_create_file(struct device_driver *, const struct driver_attribute *); 408 void driver_remove_file(struct device_driver *, const struct driver_attribute *); 409 410 411Documentation 412~~~~~~~~~~~~~ 413 414The sysfs directory structure and the attributes in each directory define an 415ABI between the kernel and user space. As for any ABI, it is important that 416this ABI is stable and properly documented. All new sysfs attributes must be 417documented in Documentation/ABI. See also Documentation/ABI/README for more 418information. 419