I wonder is there any program that can take a bash script as input and print out all bash commands it will run? A program that would unroll loops, expand environment variables and generally not perform any destructive action nor call any external binaries. It’s like a dry run of sorts.

  • RandomLegend [He/Him]@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    9
    ·
    edit-2
    2 years ago

    Found this over on Stackoverflow

    You could try running the script under Kornshell. When you execute a script with ksh -D, it reads the commands and checks them for syntax, but doesn’t execute them. Combine that with set -xv, and you’ll print out the commands that will be executed.

    You can also use set -n for the same effect. Kornshell and BASH are fairly compatible with each other. If it’s a pure Bourne shell script, both Kornshell and BASH will execute it pretty much the same.

    You can also run ksh -u which will cause unset shell variables to cause the script to fail. However, that wouldn’t have caught the catless cat of a nonexistent file. In that case, the shell variable was set. It was set to null.

    Of course, you could run the script under a restricted shell too, but that’s probably not going to uninstall the package.

    That’s the best you can probably do.

    • heartlessevil@lemmy.one
      link
      fedilink
      English
      arrow-up
      6
      ·
      edit-2
      2 years ago

      I agree that’s probably the best you can do, but if it just printing the statements it sees and not actually running them, the behavior when it is run could be very different. For example:

      touch a_file
      if test -f a_file; then 
        rm -rf / 
      fi
      

      To do what OP is asking for would require running inside a sandbox.

      • RandomLegend [He/Him]@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 years ago

        yeah i think a sandbox would be the best solution.

        Depending on what script OP is trying to run it would be best to just “rebuild” the potentially affected part of your system inside a VM and see what happens.

    • tvcvt@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      This is great—I’ve somehow never noticed set -n before. Very helpful.

  • bionicjoey@lemmy.ca
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    2 years ago

    It would depend. Bash allows for command substitution, so it’s possible that there are commands in a script where you would only know what they would do by running other lines in the script.

    Edit: also, this is treading dangerously close to the Halting Problem. Imagine for a moment that you succeeded in creating such a program, written in Bash. Now imagine you gave this program its own source code as input. What would you expect it to tell you?

    • wolf
      link
      fedilink
      English
      arrow-up
      4
      ·
      2 years ago

      This is not close to the halting problem, it is harder than the halting problem. ;-)

      • bionicjoey@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 years ago

        Hmmm, it depends a bit on what exactly you are trying to achieve. If you want to know “what is the full extent of commands that might be called by this program, not including their arguments or their environment variables” then it could be done. Bash has a solid definition for which token in any command line is the command (normally the first token which isn’t a variable assignment), but that would miss things like if a script uses a command which executes everything after it as a command (such as time or sudo), or if there are any eval calls.

        • wolf
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 years ago

          Of course, if you narrow the problem scope you might be able to solve it. OP was talking about unrolling loops etc. which I interpreted as having the exact amount of times a loop is executed. AFAIK this would lead to a state explosion for nearly all non-trivial cases, and we already now that knowing if a program (incl. loops) terminates is not generally solvable.