1.. SPDX-License-Identifier: GPL-2.0
2
3=====================================================
4sysfs - _The_ filesystem for exporting kernel objects
5=====================================================
6
7Patrick Mochel	<mochel@osdl.org>
8
9Mike Murphy <mamurph@cs.clemson.edu>
10
11:Revised:    16 August 2011
12:Original:   10 January 2003
13
14
15What it is:
16~~~~~~~~~~~
17
18sysfs is a ram-based filesystem initially based on ramfs. It provides
19a means to export kernel data structures, their attributes, and the
20linkages between them to userspace.
21
22sysfs is tied inherently to the kobject infrastructure. Please read
23Documentation/core-api/kobject.rst for more information concerning the kobject
24interface.
25
26
27Using sysfs
28~~~~~~~~~~~
29
30sysfs is always compiled in if CONFIG_SYSFS is defined. You can access
31it by doing::
32
33    mount -t sysfs sysfs /sys
34
35
36Directory Creation
37~~~~~~~~~~~~~~~~~~
38
39For every kobject that is registered with the system, a directory is
40created for it in sysfs. That directory is created as a subdirectory
41of the kobject's parent, expressing internal object hierarchies to
42userspace. Top-level directories in sysfs represent the common
43ancestors of object hierarchies; i.e. the subsystems the objects
44belong to.
45
46Sysfs internally stores a pointer to the kobject that implements a
47directory in the kernfs_node object associated with the directory. In
48the past this kobject pointer has been used by sysfs to do reference
49counting directly on the kobject whenever the file is opened or closed.
50With the current sysfs implementation the kobject reference count is
51only modified directly by the function sysfs_schedule_callback().
52
53
54Attributes
55~~~~~~~~~~
56
57Attributes can be exported for kobjects in the form of regular files in
58the filesystem. Sysfs forwards file I/O operations to methods defined
59for the attributes, providing a means to read and write kernel
60attributes.
61
62Attributes should be ASCII text files, preferably with only one value
63per file. It is noted that it may not be efficient to contain only one
64value per file, so it is socially acceptable to express an array of
65values of the same type.
66
67Mixing types, expressing multiple lines of data, and doing fancy
68formatting of data is heavily frowned upon. Doing these things may get
69you publicly humiliated and your code rewritten without notice.
70
71
72An attribute definition is simply::
73
74    struct attribute {
75	    char                    * name;
76	    struct module		*owner;
77	    umode_t                 mode;
78    };
79
80
81    int sysfs_create_file(struct kobject * kobj, const struct attribute * attr);
82    void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr);
83
84
85A bare attribute contains no means to read or write the value of the
86attribute. Subsystems are encouraged to define their own attribute
87structure and wrapper functions for adding and removing attributes for
88a specific object type.
89
90For example, the driver model defines struct device_attribute like::
91
92    struct device_attribute {
93	    struct attribute	attr;
94	    ssize_t (*show)(struct device *dev, struct device_attribute *attr,
95			    char *buf);
96	    ssize_t (*store)(struct device *dev, struct device_attribute *attr,
97			    const char *buf, size_t count);
98    };
99
100    int device_create_file(struct device *, const struct device_attribute *);
101    void device_remove_file(struct device *, const struct device_attribute *);
102
103It also defines this helper for defining device attributes::
104
105    #define DEVICE_ATTR(_name, _mode, _show, _store) \
106    struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store)
107
108For example, declaring::
109
110    static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo);
111
112is equivalent to doing::
113
114    static struct device_attribute dev_attr_foo = {
115	    .attr = {
116		    .name = "foo",
117		    .mode = S_IWUSR | S_IRUGO,
118	    },
119	    .show = show_foo,
120	    .store = store_foo,
121    };
122
123Note as stated in include/linux/kernel.h "OTHER_WRITABLE?  Generally
124considered a bad idea." so trying to set a sysfs file writable for
125everyone will fail reverting to RO mode for "Others".
126
127For the common cases sysfs.h provides convenience macros to make
128defining attributes easier as well as making code more concise and
129readable. The above case could be shortened to:
130
131static struct device_attribute dev_attr_foo = __ATTR_RW(foo);
132
133the list of helpers available to define your wrapper function is:
134
135__ATTR_RO(name):
136		 assumes default name_show and mode 0444
137__ATTR_WO(name):
138		 assumes a name_store only and is restricted to mode
139                 0200 that is root write access only.
140__ATTR_RO_MODE(name, mode):
141	         fore more restrictive RO access currently
142                 only use case is the EFI System Resource Table
143                 (see drivers/firmware/efi/esrt.c)
144__ATTR_RW(name):
145	         assumes default name_show, name_store and setting
146                 mode to 0644.
147__ATTR_NULL:
148	         which sets the name to NULL and is used as end of list
149                 indicator (see: kernel/workqueue.c)
150
151Subsystem-Specific Callbacks
152~~~~~~~~~~~~~~~~~~~~~~~~~~~~
153
154When a subsystem defines a new attribute type, it must implement a
155set of sysfs operations for forwarding read and write calls to the
156show and store methods of the attribute owners::
157
158    struct sysfs_ops {
159	    ssize_t (*show)(struct kobject *, struct attribute *, char *);
160	    ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t);
161    };
162
163[ Subsystems should have already defined a struct kobj_type as a
164descriptor for this type, which is where the sysfs_ops pointer is
165stored. See the kobject documentation for more information. ]
166
167When a file is read or written, sysfs calls the appropriate method
168for the type. The method then translates the generic struct kobject
169and struct attribute pointers to the appropriate pointer types, and
170calls the associated methods.
171
172
173To illustrate::
174
175    #define to_dev(obj) container_of(obj, struct device, kobj)
176    #define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr)
177
178    static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr,
179				char *buf)
180    {
181	    struct device_attribute *dev_attr = to_dev_attr(attr);
182	    struct device *dev = to_dev(kobj);
183	    ssize_t ret = -EIO;
184
185	    if (dev_attr->show)
186		    ret = dev_attr->show(dev, dev_attr, buf);
187	    if (ret >= (ssize_t)PAGE_SIZE) {
188		    printk("dev_attr_show: %pS returned bad count\n",
189				    dev_attr->show);
190	    }
191	    return ret;
192    }
193
194
195
196Reading/Writing Attribute Data
197~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
198
199To read or write attributes, show() or store() methods must be
200specified when declaring the attribute. The method types should be as
201simple as those defined for device attributes::
202
203    ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf);
204    ssize_t (*store)(struct device *dev, struct device_attribute *attr,
205		    const char *buf, size_t count);
206
207IOW, they should take only an object, an attribute, and a buffer as parameters.
208
209
210sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the
211method. Sysfs will call the method exactly once for each read or
212write. This forces the following behavior on the method
213implementations:
214
215- On read(2), the show() method should fill the entire buffer.
216  Recall that an attribute should only be exporting one value, or an
217  array of similar values, so this shouldn't be that expensive.
218
219  This allows userspace to do partial reads and forward seeks
220  arbitrarily over the entire file at will. If userspace seeks back to
221  zero or does a pread(2) with an offset of '0' the show() method will
222  be called again, rearmed, to fill the buffer.
223
224- On write(2), sysfs expects the entire buffer to be passed during the
225  first write. Sysfs then passes the entire buffer to the store() method.
226  A terminating null is added after the data on stores. This makes
227  functions like sysfs_streq() safe to use.
228
229  When writing sysfs files, userspace processes should first read the
230  entire file, modify the values it wishes to change, then write the
231  entire buffer back.
232
233  Attribute method implementations should operate on an identical
234  buffer when reading and writing values.
235
236Other notes:
237
238- Writing causes the show() method to be rearmed regardless of current
239  file position.
240
241- The buffer will always be PAGE_SIZE bytes in length. On i386, this
242  is 4096.
243
244- show() methods should return the number of bytes printed into the
245  buffer. This is the return value of scnprintf().
246
247- show() must not use snprintf() when formatting the value to be
248  returned to user space. If you can guarantee that an overflow
249  will never happen you can use sprintf() otherwise you must use
250  scnprintf().
251
252- store() should return the number of bytes used from the buffer. If the
253  entire buffer has been used, just return the count argument.
254
255- show() or store() can always return errors. If a bad value comes
256  through, be sure to return an error.
257
258- The object passed to the methods will be pinned in memory via sysfs
259  referencing counting its embedded object. However, the physical
260  entity (e.g. device) the object represents may not be present. Be
261  sure to have a way to check this, if necessary.
262
263
264A very simple (and naive) implementation of a device attribute is::
265
266    static ssize_t show_name(struct device *dev, struct device_attribute *attr,
267			    char *buf)
268    {
269	    return scnprintf(buf, PAGE_SIZE, "%s\n", dev->name);
270    }
271
272    static ssize_t store_name(struct device *dev, struct device_attribute *attr,
273			    const char *buf, size_t count)
274    {
275	    snprintf(dev->name, sizeof(dev->name), "%.*s",
276		    (int)min(count, sizeof(dev->name) - 1), buf);
277	    return count;
278    }
279
280    static DEVICE_ATTR(name, S_IRUGO, show_name, store_name);
281
282
283(Note that the real implementation doesn't allow userspace to set the
284name for a device.)
285
286
287Top Level Directory Layout
288~~~~~~~~~~~~~~~~~~~~~~~~~~
289
290The sysfs directory arrangement exposes the relationship of kernel
291data structures.
292
293The top level sysfs directory looks like::
294
295    block/
296    bus/
297    class/
298    dev/
299    devices/
300    firmware/
301    net/
302    fs/
303
304devices/ contains a filesystem representation of the device tree. It maps
305directly to the internal kernel device tree, which is a hierarchy of
306struct device.
307
308bus/ contains flat directory layout of the various bus types in the
309kernel. Each bus's directory contains two subdirectories::
310
311	devices/
312	drivers/
313
314devices/ contains symlinks for each device discovered in the system
315that point to the device's directory under root/.
316
317drivers/ contains a directory for each device driver that is loaded
318for devices on that particular bus (this assumes that drivers do not
319span multiple bus types).
320
321fs/ contains a directory for some filesystems.  Currently each
322filesystem wanting to export attributes must create its own hierarchy
323below fs/ (see ./fuse.txt for an example).
324
325dev/ contains two directories char/ and block/. Inside these two
326directories there are symlinks named <major>:<minor>.  These symlinks
327point to the sysfs directory for the given device.  /sys/dev provides a
328quick way to lookup the sysfs interface for a device from the result of
329a stat(2) operation.
330
331More information can driver-model specific features can be found in
332Documentation/driver-api/driver-model/.
333
334
335TODO: Finish this section.
336
337
338Current Interfaces
339~~~~~~~~~~~~~~~~~~
340
341The following interface layers currently exist in sysfs:
342
343
344devices (include/linux/device.h)
345--------------------------------
346Structure::
347
348    struct device_attribute {
349	    struct attribute	attr;
350	    ssize_t (*show)(struct device *dev, struct device_attribute *attr,
351			    char *buf);
352	    ssize_t (*store)(struct device *dev, struct device_attribute *attr,
353			    const char *buf, size_t count);
354    };
355
356Declaring::
357
358    DEVICE_ATTR(_name, _mode, _show, _store);
359
360Creation/Removal::
361
362    int device_create_file(struct device *dev, const struct device_attribute * attr);
363    void device_remove_file(struct device *dev, const struct device_attribute * attr);
364
365
366bus drivers (include/linux/device.h)
367------------------------------------
368Structure::
369
370    struct bus_attribute {
371	    struct attribute        attr;
372	    ssize_t (*show)(struct bus_type *, char * buf);
373	    ssize_t (*store)(struct bus_type *, const char * buf, size_t count);
374    };
375
376Declaring::
377
378    static BUS_ATTR_RW(name);
379    static BUS_ATTR_RO(name);
380    static BUS_ATTR_WO(name);
381
382Creation/Removal::
383
384    int bus_create_file(struct bus_type *, struct bus_attribute *);
385    void bus_remove_file(struct bus_type *, struct bus_attribute *);
386
387
388device drivers (include/linux/device.h)
389---------------------------------------
390
391Structure::
392
393    struct driver_attribute {
394	    struct attribute        attr;
395	    ssize_t (*show)(struct device_driver *, char * buf);
396	    ssize_t (*store)(struct device_driver *, const char * buf,
397			    size_t count);
398    };
399
400Declaring::
401
402    DRIVER_ATTR_RO(_name)
403    DRIVER_ATTR_RW(_name)
404
405Creation/Removal::
406
407    int driver_create_file(struct device_driver *, const struct driver_attribute *);
408    void driver_remove_file(struct device_driver *, const struct driver_attribute *);
409
410
411Documentation
412~~~~~~~~~~~~~
413
414The sysfs directory structure and the attributes in each directory define an
415ABI between the kernel and user space. As for any ABI, it is important that
416this ABI is stable and properly documented. All new sysfs attributes must be
417documented in Documentation/ABI. See also Documentation/ABI/README for more
418information.
419