[D] How do you set up a multi-user data-science workstation?
I’m becoming the de-facto sysadmin for a single machine, to be used as a server for a small group of people. I’ve been using Linux for almost a decade now, but I’ve never set up a server for multiple users.
I imagine I’m not the only person who has done this. My plan is roughly as follows:
- Making a new user for everyone using
useradd. - Setting everyone up to use
sshand public-private key pairs so they can connect. (Everyone is on Mac OSC and/or some flavor of Linux.) - Installing
anacondato/opt, and modifying everyone’s.bashrc(or, alternatively, modifying/etc/profileif that’s better?) - Setting up a default environment with
numpy,jupyter-lab, and other common data science tools. - Showing everyone how to SSH tunnel
jupyter-laband run it in their browser. - Basic security measures (
fail2ban, no SSH password auth, etc.) and checking with my organizations IT to make sure things work properly.
I’m still wondering… * Lock the base conda environment, so that people have to make child environments and can’t modify the base. * Is this the best way to set up Anaconda for multiple users? Is /usr/bin a better place to put it? Etc. * Would there be any bottlenecking issues with multiple people using ssh and tunneling on the same port?
submitted by /u/gnulynnux
[link] [comments]