GC_tris's comments

GC_tris · on July 12, 2024

Each GPU is connected with 400Gbps. The rest ist just the normal dataplane which is independent from the GPUs.

Source: Was personally involved in design of that deployment.

spott · on July 12, 2024

Good to know. I went to recheck, because I swore I didn't see anything about that when I looked earlier, but they now say 3.2Tbps infiniband... not sure if they changed it or I was just blind.

Thanks!

GC_tris · on March 11, 2024

Disclaimer: I technically still am employed at Genesis Cloud (though no longer actively involved).

Genesis Cloud started integration and testing of Gaudi2 quite a while ago. I fully agree with the take of the article.

I can't promise per hour rental, but for longer times they are available! (should you be interested you can find contact details on the website)

az226 · on March 11, 2024

Would you rent out a node for few days for benchmark testing?

GC_tris · on May 27, 2023

Can you be more specific?

Hetzner has excellent connectivity: https://www.hetzner.com/unternehmen/rechenzentrum/ They are always working to increase their connectivity. I'd even go so far to claim that in many parts of the world they outperform certain hyperscalers.

k8sToGo · on May 27, 2023

I used to have a dedicated server there and what happened to me is that my uploads were fast, but my downloads were slow. Looking at an MTR route, it was clear that the route back to me was different (perhaps cheaper?). With google drive for example I could always max out my gbit connection. Same with rsync.net

Also I know that some cheaper Home ISPs also cheap out on peering.

Now, this was some time ago, so things might have changed, just as you suggested.

GC_tris · on May 17, 2023

Less than 50$ will be really hard, at least in any form of professional setup (so not hosted in a random basement ;) ).

Our lowest at Genesis Cloud at this time are instances with an RTX 3060Ti for 0.20$/hour which adds up to 146$/month ( https://www.genesiscloud.com/pricing#nvidia3060ti ) Though, this includes free storage, no egress fees and has a lot more power than a Jetson.

If you need to optimize for low cost hosting, did you already check whether you actually must have a GPU for your use case? Modern CPU have some impressing capabilities.

oblivuslimited · on May 17, 2023

Here we have another instance of self-promotion that does not align with what GGP mentioned. Thank you, Genesis and Lambda, for promoting yourselves in a startup thread. Given your long-standing presence in this industry, I would expect better engagement from you.

zamnos · on May 17, 2023

You were doing so well, self promoting and engaging with the community on your post earlier. I didn't expect to see you stoop to this level of commenting.

Maybe it's time to step away from the keyboard for a while?

oblivuslimited · on May 17, 2023

I appreciate and respect every user who contributes by asking questions, providing feedback, or sharing suggestions. However, it is disappointing and unreasonable to witness self-promotion from companies that have been established in this industry for a considerable period of time under a startup thread.

Moreover, the fact that their self-promotion does not align with the intention of the original discussion and GGP explains their purpose. Their primary goal is not genuinely assisting or finding a solution.

In such cases, as you can imagine, it's challenging for me to maintain respect.

zamnos · on May 17, 2023

If you're finding it too challenging, might I reiterate my suggestion to take a break from the keyboard? it's not a good look.

oblivuslimited · on May 17, 2023

You make a valid point, and I appreciate your suggestion.

I apologize if my previous comment came across as dismissive. I believe I have expressed my viewpoint clearly, but I'm open to further discussion.

Let's continue this conversation in a friendly and respectful manner, but after I come back from my break as you have suggested. :)

GC_tris · on May 7, 2023

Really cool.

Many years ago at university I got to play with a system built by a former student that took wood blocks (children's toys) to build structures on top of a table monitored by kinect cameras. It would then identify features and generate a floor plan. Now imagine combining this! It would allow for a whole new level of exploration of ideas.

Can you share a bit what setup you use to generate the images? Do you run your own GPUs?

eddieweng · on May 7, 2023

Hi there!

That university project sounds fascinating! Combining it with ReRender AI would indeed open up new possibilities for exploring ideas.

As for my setup, I generate images using my own GPUs hosted on LambdaLabs, which I've found to be the most cost-effective solution.

GC_tris · on May 4, 2023

Genesis Cloud (https://www.genesiscloud.com/pricing).

Disclaimer: I am the CTO ;)

Why use us?

Competitive prices (billing by the minute, only pay when you actually run an instance). High reliability (professional DCs, customized hardware to suit requirements). Good connectivity (traffic is also free, no in-/egress fees). High security level (full VMs with dedicated GPUs with proper separation of customers instead of shared hosts with docker). Free storage. A great support team. Green energy (no greenwashing by carbon offsetting, we use energy sources that are renewable and carbon free at the source (geothermal/hydro)).

I could go on... Would love it if you just try our services, after sign up there are free credits available for risk free testing.

GC_tris · on May 3, 2023

While I do not have any A100 handy right now I have an instance running on Genesis Cloud with 4x RTX 3090.

A quick, very unscientific, test using the oobabooba/text-generation-webui with some models I tried earlier gives me:

* oasst-sft-7-llama-30b (spread over 4x GPU): Output generated in 28.26 seconds (5.77 tokens/s, 163 tokens, context 55, seed 1589698825)

* llama-30b-4bit-128g (only using 1 GPU as it is so small): Output generated in 12.88 seconds (6.29 tokens/s, 81 tokens, context 308, seed 1374806153)

* llama-65b-4bit-128g (only using 2 GPU): Output generated in 33.36 seconds (3.81 tokens/s, 127 tokens, context 94, seed 512503086)

* llama (vanilla, using 4x GPU): Output generated in 5.75 seconds (4.69 tokens/s, 27 tokens, context 160, seed 1561420693)

They all feel fast enough for interactive use. If you do not have an interface that streams the output (so you can see it progressing) it might feel a bit weird if you often have to wait ~30s to get the whole output chunk.

GC_tris · on April 6, 2023

Yeah, a 3090 should do well.

If you want to try it on one reach out to me (email in profile). We rent those out in the cloud. Would allow you to confirm performance before buying one for local use.

GC_tris · on April 3, 2023

Hey folks,

Happy to announce that we significantly reduced the prices for our GPU cloud instances.

Let me know what you think!

GC_tris · on Sept 3, 2022

> I would like to use an affordable GPU cloud hosting solution

Tristan from Genesis Cloud here. Please feel free to reach out to me (email is on my profile). Happy to provide some resources for this project.

arthurcolle · on Sept 3, 2022

Hello Tristan, thank you kindly! I will send you an email when I'm back in my dungeon. Cheers