How do you containerize stuff you install from source in a way that you can completely remove later?

j4k3@lemmy.world · 1 year ago

How do you containerize stuff you install from source in a way that you can completely remove later?

DryTomatoes@lemmy.world · edit-2 1 year ago

I did Linux From Scratch recently and they have a brilliant solution. Here’s the full text but it’s a long read so I’ll briefly explain it. https://www.linuxfromscratch.org/hints/downloads/files/more_control_and_pkg_man.txt

Basically you make a new user with the name of the package you want to install. Login to that user then compile and install the package.

Now when you search for files owned by the user with the same name as the package you will find every file that package installed.

You can document that somewhere or just use the find command when you are ready to remove all files related to the package.

I didn’t actually do this for my own LFS build so I have no further experience on the matter. I think it will eventually lead to dependency hell when two packages want to install the same file.

I guess flatpaks are better about keeping libraries separate but I’m not sure if they leave random files all over your hard drive the way apt remove/apt purge does. (Getting really annoyed about all the crud left in my home dir)

FarraigePlaisteach@kbin.social · 1 year ago

That’s clever. It should work on any system, shouldn’t it?

j4k3@lemmy.world · 1 year ago

Thanks for the read. This is what I was thinking about trying but hadn’t quite fleshed out yet. It is right on the edge of where I’m at in my learning curve. Perfect timing, thanks.

Do you have any advice when the packages are mostly python based instead of makefiles?

doot@social.bug.expert · 1 year ago

for python, a bunch of venvs should do it

DryTomatoes@lemmy.world · 1 year ago

This method should work with any command that’s installing files on your disk but it’s probably not worth the headache when virtual environments exist for python.

j4k3@lemmy.world · 1 year ago

Python, in these instances, is being used as the installer script. As far as I can tell it involves all of the same packaging and directory issues as what make is doing. Like, most of the packages have a Python startup script that takes a text file and installs everything from it. This usually includes a pip git+address or two. So far, just getting my feet wet to try out AI has been enough for me to overlook what all is happening behind the curtain. The machine is behind an external whitelist firewall all by itself. I am just starting to get to the point where I want to dial everything in so I know exactly what is happening.

I’ve noticed a few oddball times during installations pip said something like “package unavailable; reverting to base system.” This was while it is inside conda, which itself is inside a distrobox container. I’m not sure what “base system” it might be referring to here or if this is something normal. I am probing for any potential gotchas revolving around python and containers. I imagine it is still just a matter of reading a lot of code in the installation path.

voluntaryexilecat@lemmy.dbzer0.com · 1 year ago

I use a mixture of systemd-nspawn and different user logins. This is sufficient for experimentation, for actual use I try to package (makepkg) those tools to have them organized by my package manager.

Also LVM thinpools with snapshots are a great tool. You can mount a dedicated LV to each single user home to keep everything separated.

demesisx@infosec.pub · 1 year ago

Lily33@kbin.social · 1 year ago

NixOS containers could do what OP’s asking for, but it’ll be trickier with just nix (on other distro). It’ll handle build dependencies and such, but you’ll still need to keep your home or other directories clean some other way.

demesisx@infosec.pub · 1 year ago

OP could use flakes to create these dev environments and clean them up without a trace once done.

Lily33@kbin.social · 1 year ago

Any files created by programs running in the dev environments will remain.

demesisx@infosec.pub · 1 year ago

nix-collect-garbage

Lily33@kbin.social · 1 year ago

Does NOT delete any files that were written to, for example, ~/.local by anything ran in the dev shell.

jet@hackertalks.com · 1 year ago

Qubes: you can install software inside of its own disposable VM. Or it can be a persistent VM we’re only the data in home persists. Or it can be a VM where the root persists. You have a ton of control. And it’s really useful to see what’s changed in the system.

All the other solutions here are talking about in the operating system, qubes is doing it outside the operating system

Kangie@lemmy.srcfiles.zip · 1 year ago

I use Gentoo where builds from source are supported by the package manager. ;)

Overall though, any containerisation option such as Docker / Podman or Singularity is what I would typically do to put things in boxes.

For semi-persistent envs a chroot is fine, and I have a nice Gentoo-specific chroot script that makes my life easier when reproing bugs or testing software.

j4k3@lemmy.world · 1 year ago

Wait. Does emerge support building packages natively when they are not from Gentoo?

Most of the stuff I’m messing with is mixed repos with entire projects that include binaries for the LLMs, weights, and such. Most of the “build” is just setting up the python environment with the right dependency versions for each tool. The main issues are the tools and libraries like transformers, pytorch, and anything that interacts with CUDA. These get placed all over the file system for each build.

Reborn2966@feddit.it · 1 year ago

it it does not need a gui, use docker and log in into it. do the stuff and when you are done, docker rm and everything disappear.

you can enable cuda inside the container, follow the docs for that.

bonus point, vs code can open itself inside a container.

meteokr@community.adiquaints.moe · 1 year ago

You can use GUI stuff in docker as well, though it can be a big fiddly to setup.

Gamey@lemmy.world · 1 year ago

I think Podman should do a good job but I never used it myself, Distrobox is build on it and a lot easier to use so that’s what I would recommend!

Sims@lemmy.ml · 1 year ago

Haven’t tried it (and don’t use docker), so a wild shot: https://github.com/jupyterhub/repo2docker

‘repo2docker fetches a repository (from GitHub, GitLab, Zenodo, Figshare, Dataverse installations, a Git repository or a local directory) and builds a container image in which the code can be executed. The image build process is based on the configuration files found in the repository.’

That way you can perhaps just delete the docker image and everything is gone. Doesn’t seem to depend on jupyter…

user8e8f87e@berlin.social · 1 year ago

@j4k3 https://github.com/89luca89/distrobox/blob/main/docs/useful_tips.md#create-a-distrobox-with-a-custom-home-directory

Justin@apollo.town · 1 year ago

I’ve never worried about this but I’d use Flatpak. The whole install goes in a specific directory and the metadata/config/data files go in their own specific directory.

akik@lemmy.world · edit-2 1 year ago

export LDFLAGS="-Wl,-rpath=/sw/app/version/lib"
./configure --prefix=/sw/app/version
make
sudo make install

socphoenix@midwest.social · 1 year ago

Chroot would be fine for this and not overly complicated

Sneaky Bastard@feddit.de · 1 year ago

Not sure if that’s a good idea but if you use Fedora, you also have your root on a BTRFS partition after a default installation. You could utilize the snapshot features of BTRFS to roll back after testing.

triplenadir@lemmygrad.ml · 1 year ago

software like stow keeps track of files installed, and helps you remove it later

gabriele97@lemmy.g97.top · 1 year ago

Give a look at distrobox