3月 09, 2020

Linux Kernel in a Nutshell

by Greg Kroah-Hartman

Chapter 1, Introduction

This book will go into how to build and install a custom kernel, and provide some hints on how to enable specific options that you will probably wish to use for different situations.

Chapter 2, Requirements for Building and Using the Kernel

Compiler

Be warned that getting the most recent gcc version is not always a good idea. Some of the newest gcc releases don’t build the kernel properly.


$ gcc --version

Linker

An additional set of tools known as binutils is needed to do the linking and assembling of source files.


$ ld -v

make

make is a tool that walks the kernel source tree to determine which files need to be compiled, and then calls the compiler and other build tools to do the work in building the kernel.


$ make --version

After building, following files are generated under the kernel source folder:


Module.symvers
System.map
vmlinux

Tools to Use the Kernel

There are a small number of program for which the kernel version is important to them. If the kernel is upgraded, some of these packages may also need to be upgraded in order for the system to work properly.

util-linux
module-init-tools
e2fsprogs
jfsutils
quota-tools
nfs-utils
udev
procps
pcmciautils.

Chapter 3, Retrieving the Kernel Source

The distribution packages have the advantage of being built to be compatible with the compiler and other tools provided by the distribution.If you can create your own environment with the latest kernel, compiler, and other tools, you will be able to build exactly what you want.

Kernel development release cycle:

While the development of the new features was happening, the 2.6.17.1, 2.6.17.2, and other stable kernel versions were released, containing bug fixes and security updates.

Open Source

Setup the Build Environment


sudo apt-get gcc make perl 
sudo apt-get install build-essential libncurses-dev bison flex libssl-dev libelf-dev

Obtaining the source from www.kernel.org


$ mkdir linux; cd linux
$ wget https://cdn.kernel.org/pub/linux/kernel/v5.x/linux-5.5.8.tar.xz
$ unxz linux-5.5.8.tar.xz
$ tar xf linux-5.5.8.tar

Ubuntu

Setup the Build Environment

deb-src


sudo apt-get update


sudo apt-get build-dep linux linux-image-$(uname -r)
sudo apt-get install libncurses-dev flex bison openssl libssl-dev dkms libelf-dev libudev-dev libpci-dev libiberty-dev autoconf

Obtaining the source for an Ubuntu release

the kernel that is installed on your system


apt-get source linux-image-$(uname -r)

the most up to date sources for the Ubuntu release you are running


git clone git://kernel.ubuntu.com/ubuntu/ubuntu-release codename.git

Chapter 4, Configuring and Building

Independent Build

Modifying the configuration


cp -v /boot/config-$(uname -r) linux/linux-5.5.8/.config

.config

make config

step through every configuration option and ask you

make defconfig
make menuconfig
gconfig, xconfig

Building the kernel


make

Building Faster on Multiprocessor Machines

-j

Building Only a Portion of the Kernel

a specific directory


$ make drivers/usb/serial
$ make M=drivers/usb/serial
$ make

a specific file


$ make drivers/usb/serial/visor.ko

Source in One Place, Output in Another

Different Architectures

cross-compiling

architecture with the ARCH=
compiler with the CC=
cross-compile toolchain with the CROSS_COMPILE=


$ make ARCH=x86_64 defconfig
$ make ARCH=arm CROSS_COMPILE=/usr/local/bin/arm-linux-

ccache


$ make CC="ccache gcc"
$ make CC="ccache distcc"

ccache is a software development tool that caches the output of C/C++ compilation so that the next time, the same compilation can be avoided and the results can be taken from the cache.

Ubuntu

Modifying the configuration

first version number

debian/changelog


chmod a+x debian/rules
chmod a+x debian/scripts/*
chmod a+x debian/scripts/misc/*
fakeroot debian/rules clean
fakeroot debian/rules editconfigs

menuconfig

Building the kernel


fakeroot debian/rules clean
# quicker build:
fakeroot debian/rules binary-headers binary-generic binary-perarch
# if you need linux-tools or lowlatency kernel, run instead:
fakeroot debian/rules binary

.deb binary package files

Testing the new kernel


sudo dpkg -i linux*4.8.0-17.19*.deb
sudo reboot

Chapter 5, Installing and Booting from a Kernel

Using a Installation Scripts

If you have built any modules,you must install modules first:


# make modules_install

This will install all the modules that you have built and place them in the proper location in the filesystem for the new kernel to properly find. Modules are placed in the /lib/modules/kernel_version directory, where kernel_version is the kernel
version of the new kernel you have just built.


/lib/modules/5.5.8

Almost all distributions come with a script called installkernel that can be used by the kernel build system to automatically install a built kernel into the proper location and modify the bootloader so that nothing extra needs to be done by the developer
installkernel installs a new kernel image onto the system from the Linux source tree. It is called by the Linux kernel makefiles when make install is invoked there.
This will kick off the following process:

The kernel build system will verify that the kernel has been successfully built properly.
The build system will install the static kernel (vmlinuz-x.x.x) into the /boot directory and name this executable file based on the kernel version of the built kernel.
Any needed initial ramdisk images (initrd.img-x.x.x) will be automatically created, using the modules that have just been installed during the modules_install phase.

Ubuntu


$ sudo update-initramfs -c -k 5.5.8

Redhat


$ sudo mkinitrd /boot/initrd.img  $(uname -r)

initrd


$ lsinitramfs initrd.img-5.13.0-1007-intel

initrd

file

pure ramdisk
cpio archive


$ file /boot/initrd.img-5.3.0-40-generic 
/boot/initrd.img-5.3.0-40-generic: ASCII cpio archive (SVR4 with no CRC)

gzipped cpio archive


initrd-file: gzip compressed data, was "build.initramfs", from Unix

cramfs image

    
Linux Compressed ROM File System data

initrd

framebuffer

bootsplash

where modules can be placed that will automatically get loaded on boot up

initrd

The initrd ramdisk contains the modules required for mounting the root partition.
This initrd resides on the same partition on which kernel image is present.
So the kernel loads the initrd in memory, accesses the modules and mounts the root partition in read-only mode.

The bootloader program will be properly notified that a new kernel is present, and it will be added to the appropriate menu so the user can select it the next time the machine is booted.


$ sudo update-grub

update-grub

After this is finished, the kernel is successfully installed, and you can safely reboot and try out your new kernel image. Note that this installation does not overwrite any older kernel images, so if there is a problem with your new kernel image, the old kernel can be selected at boot time.

The following files are generated:


/boot/vmlinuz-5.5.8
/boot/initrd.img-5.5.8
/boot/config-5.5.8
/boot/System.map-5.5.8

And, /boot/grub/grub.cfg is modified.

Installing by Hands

If your distribution does not have a installkernel command, or you wish to just do the work by hands:

The modules must be installed


# make modules_install

The static kernel image must be copied into the /boot directory.


# make kernelversion
5.5.8


# make kernelversion
# cp arch/i386/boot/bzImage /boot/bzImage-KERNEL_VERSION
# cp System.map /boot/System.map-KERNEL_VERSION

Modify the bootloader so it knows about the new kernel.

The system initialization

The computer system undergoes several phases of boot strap processes from the power-on event until it offers the fully functional operating system (OS) to the user.

The typical boot strap process is like a four-stage rocket.
Each stage rocket hands over the system control to the next stage one.

Stage 1: the UEFI

boot manager

Stage 2: the boot loader

boot loader

kernel image

initrd

Stage 3: the mini-Debian system

kernel

the kernel converts initrd into a “normal” RAM disk and frees the memory used by initrd
if the root device is not /dev/ram0, the old (deprecated) change_root procedure is followed.
if the root device is /dev/ram0, the initrd image is then mounted as root
/sbin/init is executed (this can be any valid executable, including shell scripts; it is run with uid 0 and can do basically everything init can do).

an optional preparatory stage

initrd

initramfs

/init

a shell script program
a binary systemd program

Stage 4: the normal Debian system

kernel

/init

mounts the “real” root file system
places the root file system at the root directory using the pivot_root system call

The root filesystem is switched

init execs the /sbin/init on the new root filesystem, performing the usual boot sequence
the initrd file system is removed

GRUB

GRUB stands for GRand Unified Bootloader.
When a computer is turned on, BIOS finds the configured primary bootable device (usually the computer's hard disk) and loads and executes the initial bootstrap program from the master boot record (MBR). The MBR is the first sector of the hard disk, with zero as its offset (sectors counting starts at zero).

boot.img
diskboot.img
core.img

/boot/grub

normal.mod

All the file mentioned in the above are installed by grub-install.
Grub can be configured to automatically load a specified OS after a user-defined timeout. If the timeout is set to zero seconds, pressing and holding ⇧ Shift while the computer is booting makes it possible to access the boot menu.
In the operating system selection menu GRUB accepts a couple of commands:

By pressing e
By pressing c

Once boot options have been selected, GRUB loads the selected kernel into memory and passes control to the kernel.

GRUB is configured by the file /boot/grub/grub.cfg.
On a modern Ubuntu, to prevent from editing this file incorrectly, you edit a few settings in /etc/default/grub, and then run update-grub to rebuild it.
Look at some usable variables in /etc/default/grub:


GRUB_DEFAULT=saved
GRUB_TIMEOUT=2
GRUB_CMDLINE_LINUX_DEFAULT=”panic=5″

GRUB_DEFAULT=saved


$ grep menuentry /boot/grub/grub.cfg


sudo grub-set-default "Ubuntu, with Linux 5.3.0-40-generic"

GRUB_TIMEOUT=2
GRUB_CMDLINE_LINUX_DEFAULT=”panic=5″

Chapter 6, Upgrading a Kernel

Download the New Source

Which Patch Applies to Which Release?

Stable kernel patches apply to the base kernel version.
Base kernel release patches only apply to the previous base kernel version.

Finding the Patch

There are 2 patches needed to go from the 2.6.17.9 to the 2.6.17.11 release:

patch-2.6.17.9-10.bz2
patch-2.6.17.10-11.bz2

Applying the Patch

Decompress the patch


bzip2 -dv patch-2.6.17.9-10.bz2

Apply the patch files to the kernel directory:


cd linux-2.6.17.9
patch -p1 < ../patch-2.6.17.9-10

It is a good idea to look at the Makefile of the kernel to see the kernel version patched:


$ head -n 5 Makefile
VERSION = 2
PATCHLEVEL = 6
SUBLEVEL = 17
EXTRAVERSION = .10
NAME=Crazed Snow-Weasel

Reconfigure the Kernel

Once you have a working configuration, the only thing that is necessary is to update it with any new options that have been added to the kernel since the last release. To do this, the make oldconfig and make silentoldconfig options should be used.

make oldconfig

takes the current kernel configuration

asks the user what the new configuration value should be set to

make silentoldconfig

Chapter 7, Customizing a Kernel

To decide which drivers and configuration options are needed for your machine to work properly.

Using a Distribution Kernel

Most distribution kernels are built to include the configuration within the /proc filesystem.


$ cp /proc/config.gz ~/linux/
$ cd ~/linux
$ gzip -dv config.gz

The disadvantage of this kernel image is that you will have built almost every kernel module and driver that is present in the kernel source tree.
A virtual filesystem called sysfs provides a glimpse into how the different portions of the kernel are hooked together.
sysfs should always be mounted at the /sys location in your filesystem.

Example: Determining the network driver

find out net device


$ ls /sys/class/net/
eno1  lo  wlp2s0

find which driver is controlling the device


$ ls -l /sys/class/net/eno1/device/driver/module/drivers
total 0
lrwxrwxrwx 1 root root 0  三  12 15:32 pci:e1000e -> ../../../bus/pci/drivers/e1000e

find the kernel configuration option that controls the driver


$ find -type f -name Makefile | xargs grep e1000e
./drivers/net/ethernet/intel/Makefile:obj-$(CONFIG_E1000E) += e1000e/
./drivers/net/ethernet/intel/e1000e/Makefile:obj-$(CONFIG_E1000E) += e1000e.o
./drivers/net/ethernet/intel/e1000e/Makefile:e1000e-objs := 82571.o ich8lan.o 80003es2lan.o \

select the option to enable this module

Use a script that will do all of that work,


#!/bin/bash
#
# find_all_modules.sh
#
for i in `find /sys/ -name modalias -exec cat {} \;`; do
  /sbin/modprobe --config /dev/null --show-depends $i ;
done | rev | cut -f 1 -d '/' | rev | sort -u

Determining the Correct Module from Scratch

The easiest way to figure out which driver controls a new device is to build all of the different drivers of that type in the kernel source tree as modules, and let the udev startup process match the driver to the device.

Find the driver for a device


$ lspci | grep Ethernet
00:19.0 Ethernet controller: Intel Corporation 82577LM Gigabit Network Connection (rev 05)

PCI bus ID


$ ls /sys/bus/pci/devices/ | grep 00:19.0
0000:00:19.0
$ cat /sys/bus/pci/devices/0000:00:19.0/vendor
0x8086
$ cat /sys/bus/pci/devices/0000:00:19.0/device
0x10ea

Search include/linux/pci_ids.h for our vendor and product number


$ grep -i 0x8086 include/linux/pci_ids.h | grep VENDOR
#define PCI_VENDOR_ID_INTEL  0x8086
$ grep -i 0x10ea include/linux/pci_ids.h | grep DEVICE

look for driver source files referring to this vendor definition


$ grep -Rl PCI_VENDOR_ID_INTEL drivers/net
drivers/net/wireless/intel/ipw2x00/ipw2100.c
drivers/net/wireless/intel/ipw2x00/ipw2200.c
drivers/net/wireless/intel/iwlwifi/pcie/drv.c
drivers/net/wireless/intel/iwlegacy/common.h
drivers/net/can/pch_can.c
drivers/net/can/c_can/c_can_pci.c
drivers/net/ethernet/oki-semi/pch_gbe/pch_gbe_main.c
drivers/net/ethernet/broadcom/tg3.c
drivers/net/ethernet/dec/tulip/tulip_core.c
drivers/net/ethernet/intel/i40e/i40e_common.c
drivers/net/ethernet/intel/ixgbe/ixgbe_main.c
drivers/net/ethernet/intel/ixgb/ixgb_main.c
drivers/net/ethernet/intel/e1000/e1000_main.c
drivers/net/ethernet/intel/e1000/e1000.h
drivers/net/ethernet/intel/e100.c
drivers/net/ethernet/intel/i40evf/i40e_common.c

struct pci_device_id


$ lsusb | grep Mouse
Bus 002 Device 003: ID 046d:c077 Logitech, Inc. M105 Optical Mouse

Chapter 8, Kernel Configuration Recipes

Chapter 9, Kernel Boot Command-Line Parameter Reference

Chapter 10, Kernel Build Command-Line Reference

Chapter 11, Kernel Configuration Option Reference

Appendix A, Helpful Utilities

Inline assembly for x86 in Linux

GNU assembler syntax in brief

Register naming

Source and destination ordering

In any instruction, source comes first and destination follows. This differs from Intel syntax, where source comes after destination.
Transfers the contents of eax to ebx:


mov %eax, %ebx

Size of operand

The instructions are suffixed by b, w, or l, depending on whether the operand is a byte, word, or long.
This is not mandatory; GCC tries provide the appropriate suffix by reading the operands.
But specifying the suffixes manually improves the code readability and eliminates the possibility of the compilers guessing incorrectly.


movb %al, %bl
movw %ax, %bx
movl %eax, %ebx

Immediate operand

An immediate operand is specified by using $.
Move the value of 0xffff into eax register:


movl $0xffff, %eax

Indirect memory reference

Any indirect references to memory are done by using ( ).
Transfer the byte in the memory pointed by esi into al register


movb (%esi), %al

Inline assembly

GCC provides the special construct "asm" for inline assembly

Basic inline


asm("assembly code");

Examples,


asm("movl %ecx %eax"); /* moves the contents of ecx to eax */

Extended asm

In extended assembly, we can also specify the operands. It has the following format:


asm ( assembler template
    : output operands               (optional)
    : input operands                (optional)
    : list of clobbered registers       (optional)
    );

where:

assembler template
output operands
input operands
clobbered registers

Each operand is described by an operand-constraint string followed by the C expression in parentheses.
Commas separate the operands within each group.

Examples,

there are no output operands but there are input operands



         asm ("stosl"
             : /* no output registers */
             : "c" (count), "a" (fill_value), "D" (dest)
             : "%ecx", "%edi" 
             );

there are output operands and input operands


    int x = 10, y;

    asm ("movl %1, %%eax;
         "movl %%eax, %0;"
        :"=r"(y)    /* %0. y is output operand */
        :"r"(x)     /* %1. x is input operand */
        :"%eax");   /* %eax is clobbered register */

y is the output operand, referred to by %0
x is the input operand, referred to by %1
"r" and "=r" are constraints on the operands

"r"

"=r"

The clobbered register %eax after the third colon tells GCC that the value of %eax is to be modified inside "asm", so GCC won’t use this register to store any other value.
operands have a single % as prefix.


        movl %edx, %eax     /* x is moved to %eax */
        movl %eax, %edx     /* y is allocated in edx and updated */

Kernel Modules

Obtaining information

Modules are stored in:


$ ls /usr/lib/modules/$(uname -r)

To show information about a module


$ modinfo snd_hda_intel
filename:       /lib/modules/5.4.0-80-generic/kernel/sound/pci/hda/snd-hda-intel.ko
description:    Intel HDA driver
license:        GPL
srcversion:     2F60277DAE563209FA7BA4A
alias:          pci:v00001D17d00003288sv*sd*bc*sc*i*
...
alias:          pci:v00008086d00001C20sv*sd*bc*sc*i*
depends:        snd-hda-core,snd-hda-codec,snd-pcm,snd,snd-intel-dspcfg
...
parm:           index:Index value for Intel HD audio interface. (array of int)
...

To display the configuration of a particular module


$ modprobe -c | grep snd_hda_intel

List the dependencies of a module


$ modprobe --show-depends snd_hda_intel
insmod /lib/modules/5.4.0-80-generic/kernel/sound/soundcore.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/core/snd.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/core/snd-timer.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/core/snd-pcm.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/core/snd-hwdep.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/hda/snd-hda-core.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/pci/hda/snd-hda-codec.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/hda/snd-intel-dspcfg.ko 
insmod /lib/modules/5.4.0-80-generic/kernel/sound/pci/hda/snd-hda-intel.ko

Automatic module loading with systemd

All necessary modules loading is handled automatically by udev, so if you do not need to use any out-of-tree kernel modules, there is no need to put modules in any configuration file because that should be loaded at boot .

Kernel modules can be explicitly listed in files under /etc/modules-load.d/ for systemd to load them during boot.
systemd-modules-load.service reads files from /etc/modules-load.d/ which contain kernel modules to load during boot in a static list.
Each configuration file under /etc/modules-load.d/:

named in the style


		/etc/modules-load.d/*.conf

simply contain a list of kernel modules names to load, separated by newlines
Empty lines and lines whose first non-whitespace character is # or ; are ignored.

Setting module options

Manually set parameters at load time using modprobe


$ sudo modprobe module_name name=value

Using files in /etc/modprobe.d/

/etc/modprobe.d/

udev

modprobe


options module_name name=value

initramfs

mkinitcpio.conf

modconf

initramfs

Using kernel command line


module_name.name=value

Aliasing

Aliases are alternate names for a module.

Create an alias, /etc/modprobe.d/myalias.conf:


alias mymod really_long_module_name

It means you can use "modprobe my-mod" instead of "modprobe really_long_modulename".

Blacklisting

Blacklisting is a mechanism to prevent the kernel module from loading.

To blacklist a module :

Using files in /etc/modprobe.d/
Using kernel command line

Linux Kernel in a Nutshell

Linux Kernel in a Nutshell

Chapter 1, Introduction

Chapter 2, Requirements for Building and Using the Kernel

Compiler

Linker

make

Tools to Use the Kernel

Chapter 3, Retrieving the Kernel Source

Open Source

Ubuntu

Chapter 4, Configuring and Building

Independent Build

Ubuntu

Chapter 5, Installing and Booting from a Kernel

Using a Installation Scripts

Installing by Hands

The system initialization

GRUB

Chapter 6, Upgrading a Kernel

Download the New Source

Which Patch Applies to Which Release?

Finding the Patch

Applying the Patch

Reconfigure the Kernel

Chapter 7, Customizing a Kernel

Using a Distribution Kernel

Example: Determining the network driver

Determining the Correct Module from Scratch

Find the driver for a device

Chapter 8, Kernel Configuration Recipes

Chapter 9, Kernel Boot Command-Line Parameter Reference

Chapter 10, Kernel Build Command-Line Reference

Chapter 11, Kernel Configuration Option Reference

Appendix A, Helpful Utilities

Inline assembly for x86 in Linux

GNU assembler syntax in brief

Register naming

Source and destination ordering

Size of operand

Immediate operand

Indirect memory reference

Inline assembly

Basic inline

Extended asm

Kernel Modules

Obtaining information

Automatic module loading with systemd

Setting module options

Aliasing

Blacklisting

留言

熱門文章

A Tutorial on the Device Tree

Linux Modem Manager