package ui templ About() { <div> <h1>about</h1> <p> Converge is a utility for troubleshooting builds on continuous integration servers. It solves a common problem where the cause of job failure is difficult to determine. This is complicated further by the fact that build jobs are usually run on a build farm where there is no access to the build agents or in more modern envrionments when jobs are run in ephemeral containers. </p> <p> With Converge it is possible to get remote shell access to such jobs. This works by configuring the build job to connect to a Converge server using an agent program. The agent program can be downloaded from within the CI job using curl or wget. Next, an end-user can connect to the Converge server, a rendez-vous server, that connects the client and server together based on a common identifier specified by both client and server. </p> <h2>how it works</h2> <p> The steps involved are as follows: <ul> <li>The agent connects to converge server and specifies an id, the so-called rendez-vous id, identifying the agent. The agent outputs an example command that can be used to connect to this agent. </li> <li>The agent sets up multiplexing of connections together with converge server which allows it to listen on incoming connections. </li> <li>This is used by the agent for running an embedded SSH server that is listening for incoming connection requests from clients. </li> <li>The client/user connects to the converge server using the command specified by the agent. This uses the same id as that used by the agent. The converge server can now match these ids an set up an end-to-end connection from client to agent. The role of converge server is simply in matching these ids and connecting the two websocket connections (from agent and from client) together by copying data between them as it arrives. </li> <li>The embedded SSH server now performs authentication, after successful login, a shell is spawned and the session is established. The shell can be any linux shell but also command prompt and powershell are possible. The connection is practically identical to a regular terminal connection. To achieve this, the shell is made to beiieve that it is connected to a terminal. </li> </ul> </p> <p>With regards to the rendez-vous id there are the following remarks: <ul> <li> If no id is specified than an id is generated. </li> <li> If the agent uses an id already in use by another agent, then converge server will generate a new id. </li> </ul> The agent will always print the id and command required to connect to it to standard output. </p> <h2>security</h2> <p> The setup is such that the connection from client (end-user) to server (agent on CI job) is end-to-end encrypted. The Converge server itself is no more than a bitpipe which pumps data between client and agent. </p> <p>Using authorized keys is a secure way of connecting. When running the agent, the authorized keys must be put in a file, allowing only the designated users to connect. The file containing authorized keys can also be edited during a session with the agent, allowing more people to be added when required without having to start over again. Using authorized keys is made easy through the <a href="usage.html">usage</a> page, which provides the exact commands to execute based on the target environment. If users are hesitant to use their public key it is also possible to generate a separate ssh key-pair using <code>ssh-keygen</code> and use that instead. </p> <p>To be able to use Converge, you must already have access to the configuration of a build job. Having that access means it is possible to execute any command on a build agent. The Converge agent is started by the build job and does not have any additional rights compared to what you could script in the continous integration job definition. </p> <p>Converge does not provide any stealth features to hide it. The public sessions page show all agents and clients including details about the clients and the agents. The idea is that it should be light-weight and easy to use. There is no reason to hide the fact that someone is debugging a continuous integration job. Also, all sessions are logged,both using standard kubernetes tooling such as (fluentbit/filebeat, and loki/elasticsearch depending on the environment). This logging includes only the details about the sessions, but not what the user is doing inside a session. Also, Converge provides a prometheus metrics endpoint which allows user sessions to be tracked over time after the fact. Thie data is also made accessible using a grafana dashboard. </p> <h2>SSH and SFTP</h2> <p> Both ssh and sftp are supported. Multiple concurrent sessions to same agent are allowed as well as multiple agents are also allowed. </p> <h2>timeouts</h2> <p> There is a timeout mechanism in the agent such that jobs do not hang indefinitely waiting for a connection. This mechanism is useful to make sure build agents do not keep build agents occupied for a long time. By default, the agent exits with status 0 when the last client exits after logging in. The timeout is an inactivity timeout. Activity is detected as follows: <ul> <li><b>ssh</b>: any key press is considered activity</li> <li><b>sftp</b>: any output from the server side is considered activity. This is done to make sure that longer downloads cannot be killed by a timeout. A simple <code>ls</code> command in an sftp session will also lead to activity since the server will output the result of the command. </li> </ul> </p> <p>When the user touches a .hold file, the agent keeps waiting for connections even after the last client logs out, taking into account the timeout. By default the agent exits when the last user has logged out. </p> <h2>remote shell usage</h2> <p> The agent supports a --shells command-line option by which a comma-separated list of shells can be prepended to the default search path for shells, e.g. <code>--shells zsh,csh,sh</code> (linux) or <code>cmd,powershell</code> for windows. </p> <p> The agent sets an <coder>agentdir</coder> environment variable that points to the directory where the agent is running. </p> <p>The user will get notifications from the agent any time something important happens such as the session being close to timeout. </p> <h2>other tools</h2> <p>Using available existing tools such as <a href="https://github.com/namespacelabs/breakpoint">breakpoint</a> in combination with a websocket tunneling tool such as <a href="https://github.com/erebe/wstunnel">wstunnel</a> a similar solution can be obtained. There are however some problems with these solutions that converge is trying to address: </p> <p> <ul> <li>Breakpoint uses an embedded SSH server which is a really good idea but uses the QUIC protocol for connecting to a rendez-vous server. The rendez-vous server then exposes a random port for every client. This make deployment on kubernetes really hard where fixed ports must be used and QUIC is also not a widely supported protocol.</li> <li>The problem with the random ports can be solved by using wstunnel running together with breakpoint server in a kubernetes pod, where wstunnel can forward traffic over an external websocket connection to the local random port that breakpoint server is listening on.</li> <li>breakpoint leaves it open on how users install the breakpoint executable (agent). </li> <li>Because of the hacky nature of this setup, it is very difficult for users to use and troubleshoot when things go wrong. </li> </ul> </p> Converve server addresses these issues in the following ways: <ul> <li>Use the websocket protocol both for agents and for clients, providing a fixed port and a supported protocol for kubernetes deploymment. Websockets are also supported by kubernetes ingress controllers so this makes it easy to deploy on kubernetes. </li> <li>Providing online documentation where the instructions take into account the hostname and protocol where converge is running allowing users to cut and paste instructions that can be used without modification. In the usage page the users can even generate the correct agent startup commands and client connection commands based on the type of shell they are connecting to. </li> <li>Converge server provides out of the box downloads of required software. This makes sure client and server are always up to date and can be downloaded in any continuous integration job without having to package the required executables in an ad-hoc way. In addition a protocol version check is done. </li> <li>User-friendly error messages can be given to users in most cases when things do not work out because of <code>wsproxy</code>. This is an SSH proxy command that communicates with converge and provides additional information to the user. </li> <li>A live screen showing the current sessions that are running. The sessions webpage provides additional feedback about the running sessions. </li> <li>Interactivity in the user's session with notifications about timeouts and a very simple inactivity timeout mechanism. </li> <li>Possibility for the user to define the remote shell to use. </li> <li>Support for unix like bash shells and command prompt and powershell. </li> </ul> <p> </p> </div> } templ AboutTab() { @BasePage(1) { @About() } }