Skip to main content

What happens when you type ls -l in a Linux shell

ls -l is a relatively simple and straightforward command. But what actually takes place behind the screen will get you to appreciate how little work is required from you to acquire the output you desire.
ls is a shell command that lists files and directories within a directory. With the -l option, ls will list out files and directories in long list format. You will encounter something like this:
Displaying file type, permissions, owner, size, date of creation, and file name.
To understand how shell interpret a user’s input, it helps to get a basic understanding of how an operating system is organized.
An operating system can be broken down into 3 main chunks: Hardware holds the main memory, controls the CPU, disks, network interfaces, etc. Following hardware is kernel, which is the core of the operating system. It acts as an interface between hardware and any running program. User interface sits in the user process level. This is what you see on your screen.
The kernel is also responsible for determining which processes are allowed to use the CPU as well as memory and system calls management. We can’t and shouldn’t be trusted to access kernel directly, because we can screw things up royally on accident.
But programs in the user’s space can interact with the kernel via the kernel feature, system calls(syscalls). System calls allow applications to communicate with the kernel. An application can call functions in a library that rely on the system call interface to instruct the kernel to execute on behalf of the application.
Computers understand binary language made up of 0’s and 1’s. Instead of figuring out the binary code for a command(good luck), user’s input is translated to machine readable language via a command line interpreter. Shell is a type of command line interpreter — an interface to the kernel that allows user to execute commands in the operating system.

What happens when you type ls -l and hit enter — in plain English:

Everything is a file in Linux and other UNIX-like operating systems. The command ls is a file containing the program to execute the ls command. So when you type in a command, shell will attempt to locate the command(file) by searching through its disk and directories in your PATH. If the program is found, shell will clone itself and run the corresponding program in the clone/new process. The cloned process will then be replaced with the program it wants to run, and the original process will wait until the new process completes its execution before moving forward. The output can be written in standard output, which is what you see in your terminal. It can also be piped, or redirected, into a file or even to another command. When we type ls and hit enter, we are typing our command from the standard input. Similarly, you can invoke the command by having it piped from a file or a command. After the command is executed, control will be returned to the original process and you will be greeted with the login shell, once again.

What happens when you type ls -l and hit enter — in C

Steps to processing a user command:
  • Get user input
  • Check for expansions and alias.
  • Check builtin.
  • Check PATH.
  • If file exists: fork and execute program in the child process.
  • After child process terminates, parent process will print prompt to user.
Get user input:
Upon entering shell, you will encounter the prompt(prompt string 1, or PS1). After you input ls -l to the command line, shell reads your input by using getline() (man 3 getline). Getline reads from the standard input file stream, STDIN, and stores the user input into a buffer as a string.
The buffer(containing the input "ls -l") is then broken down into tokens and stored in an array: {"ls", "-l", "NULL"}. args[0] = "ls", args[1] = "-l", args[2] = "NULL"
Check for expansions and alias:
Prior to searching for the ls command, shell will check for special characters that need to be expanded(i.e., *, $, etc). For example, if the user input is ls *.c, this is when *.c will be replaced with all .c files within current directory. Shell also checks for aliases(an alias in shell is similar to a keyboard shortcut). If ls is an alias, shell will replace ls with its corresponding value here. Say we set up an alias called apple: alias apple="cat". After entering apple main.c, our command will be processed as cat main.c.
Check builtin:
If ls is not an alias, shell will then check whether the command ls is a built-in command. A built-in command is a command that is built into and executed in shell itself. Examples: cd, echo, alias, help, read, type. Additional bash builtins.
Check PATH:
If ls is not a builtin, shell will look for ls in the environment variable, PATH. First, a copy of PATH will be tokenized(separated by :) with each representing the path to a directory. A copy is necessary here because we don’t actually want to alter PATH. We want to butcher a copy of PATH into tokens in order to check against the entered command. Now shell will append/ls — slash + our tokenized input — to the end of each directory listed in PATH to check whether or not the file exists.
The tokens are delimited by a colon, so here, the user input will be checked against /usr/local/sbin, /usr/local/bin, usr/sbin, usr/bin, etc.
/ls will be appended to the end of each token: /usr/local/sbin/ls , /usr/local/bin/ls, etc.
After searching through each token, an error will occur if ls is not an existing file.
Fork and execute program in the child process:
Upon locating the ls file in PATH, shell will open and run the file ls by calling fork(). fork() is a system call for creating a clone of the current process. All user processes on a Linux system stem off as a result of fork(). Both processes will run the instructions following fork(). So to distinguish from the two processes,fork() returns the child’s process id to the parent, 0 in the child process, and -1 to indicate an error.
  • Shell(calling process/parent process)will call fork() to create a copy of itself(child process). The clone/child process will have its own system process ID. Running a program in a separate process protects the parent/current process should the program cause any problems upon execution.
  • The child process then calls execve() to run the user’s command,ls. execve() will replace the current(child) process with the program it calls( ls, in this case).
  • Parent process waits until the child competes its execution via wait().
Flow of processes and syscalls
Upon completion, the child process will terminate and control will be returned back to the parent process. You will be greeted with the prompt once ls is executed and will be able to enter the next command :)
When you log on to your system, shell login maintains an environment in which you can run programs. An environment contains environment variables that provide information about the system behavior. Inputting env on the command line will print a list of environment variables to standard output. Inputting echo $PATH will print out the specified environment variable — in this case, PATH.
When you type in a command, shell will search in a list of directories to see whether that program/file exists. This list of directories is stored in the environment variable PATH, and it’s set when you log on. More on — Environment Variable, PATH.

Comments

Popular posts from this blog

OWASP Top 10 Threats and Mitigations Exam - Single Select

Last updated 4 Aug 11 Course Title: OWASP Top 10 Threats and Mitigation Exam Questions - Single Select 1) Which of the following consequences is most likely to occur due to an injection attack? Spoofing Cross-site request forgery Denial of service   Correct Insecure direct object references 2) Your application is created using a language that does not support a clear distinction between code and data. Which vulnerability is most likely to occur in your application? Injection   Correct Insecure direct object references Failure to restrict URL access Insufficient transport layer protection 3) Which of the following scenarios is most likely to cause an injection attack? Unvalidated input is embedded in an instruction stream.   Correct Unvalidated input can be distinguished from valid instructions. A Web application does not validate a client’s access to a resource. A Web action performs an operation on behalf of the user without checkin...

CKA Simulator Kubernetes 1.22

  https://killer.sh Pre Setup Once you've gained access to your terminal it might be wise to spend ~1 minute to setup your environment. You could set these: alias k = kubectl                         # will already be pre-configured export do = "--dry-run=client -o yaml"     # k get pod x $do export now = "--force --grace-period 0"   # k delete pod x $now Vim To make vim use 2 spaces for a tab edit ~/.vimrc to contain: set tabstop=2 set expandtab set shiftwidth=2 More setup suggestions are in the tips section .     Question 1 | Contexts Task weight: 1%   You have access to multiple clusters from your main terminal through kubectl contexts. Write all those context names into /opt/course/1/contexts . Next write a command to display the current context into /opt/course/1/context_default_kubectl.sh , the command should use kubectl . Finally write a second command doing the same thing into ...