Launch HN: GradientJ (YC W23) – Build NLP Applications Faster with LLMs

ccooffee · on April 4, 2023

I have no experience with LLMs, so here's some website feedback to be taken with a chunk of salt:

1. The Youtube video at the bottom of your page is very tiny and cannot be fullscreened without first loading the video in Youtube. The video itself merely shows some basic-seeming workflows with some (to-me) terrible background music. The video does not seem to emphasize anything. It's just...wandering around a web application...

2. Between reading the webpage content and watching the video, I don't have a good idea of what you are actually offering as a product and why it is so valuable. The pitch summary in this HN post is much more helpful than your website.

3. Your website is not very accessible at the moment due to low contrast and overuse of opacity for style. I can barely understand what your images are attempting to convey. Your app doesn't appear very accessible according to the Youtube video, again due to low contrast among colors.

mcconaughey · on April 4, 2023

Daniel, co-founder of GradientJ here!

Appreciate the feedback. We have a more detailed demo video that is linked in the YT description. Will be sure the resolution is adequate.

We will definitely work on improving the website and demos. Want it to be easily accessible for everybody.

danvayn · on April 4, 2023

Don't discount yourself; your feedback is true regardless of LLM experience.

antonioevans · on April 4, 2023

This application is highly intriguing. It holds potential as an excellent instrument for experimenting with models and fine-tuning them. However, the $500 price tag for simply trying it out is excessively expensive and inhibits accessibility. I cannot even test things you had in your video.

ttul · on April 4, 2023

If their target market is enterprise, $500 for trying it out is not going to be a huge barrier. Perhaps their strategy is to ensure that the people trying out their app are real buyers?

siva7 · on April 5, 2023

The barrier is getting those $500 approved for just trying something out.

IVCrush · on April 4, 2023

First time we're really opening up access so still iterating on what's open to everyone.

Happy to get give you and anyone else full access if you shoot an email to: oscar at gradientj.com

KRAKRISMOTT · on April 4, 2023

Where do we upload/download the actual LLM models? What's your privacy policy on the finetuned deltas?

IVCrush · on April 4, 2023

Since most of our early users are just using foundational LLM models over API (like OpenAI models), we're still working on the best way to manage uploading custom weights and NLP models. However, for users that need it asap, we can upload and download fine-tuned weights/architectures manually.

In terms of privacy policy, we haven't had many users doing much with fine-tuned deltas, but we think of it the same way we think of all model data: All inference and benchmarking data belongs to the user and we don't aggregate it across other users or shared between orgs.

anoy8888 · on April 5, 2023

Isn’t LLM a good NLP? What is the need to build another not as good NLP. Just curious

IVCrush · on April 5, 2023

If I'm understanding correctly, the question is, why would you ever need to move off just the base LLMs if they're already fantastic at NLP tasks?

The main reason why you'd want to move from a prompted LLM to a smaller, fine-tuned NLP model (even if it's still an LLM) is usually to save latency and money on compute.

Out-of-the-box, the popular LLMs are pretty great for most NLP tasks. Because of this, you can quickly bootstrap a first version of your NLP applications (text analytics, unstructured data extraction, etc.) using just prompting.

For a lot of these tasks, though, you don't need the full expressive power of the base LLM. So the idea is you take the data you collect from the first prompted version and use it to either fine-tune a smaller LLM, or even a more simple, traditional model.

These smaller models are usually faster and cheaper to run which can save you a lot of money at scale.

paulgb · on April 5, 2023

Thanks for the explanation, this is very interesting! Just so that I’m sure I’m understanding, this doesn’t have to do with what GradientJ currently offers, or does it?

IVCrush · on April 5, 2023

It does! Though right now we're focused on what teams need to get that first version out the door, ultimately, we want to offer people a platform that lets them manage their NLP app throughout its lifecycle (LLM or otherwise).

Going through that process of idea -> first model -> optimized model is the core "loop" of the LLM lifecycle. The problem is to do that effectively you need to set up the right infrastructure to both aggregate the data going into and coming out of your model AND set up benchmarks to run experiments.

Having this data-eval engine set up is what lets you easily (or even autonomously) evaluate whether it makes sense to switch from that prompted model to a smaller model.

Right now, GradientJ lays some of the rudimentary groundwork for this loop by letting you set up testing for prompt-based LLM models and automatically aggregate the input/output data that goes through your model in production. We've got some basic fine-tuning capabilities, but really we're still working on refining the tools to use that data to evaluate across multiple NLP models (both LLM and non-LLM).

paulgb · on April 5, 2023

I see, very cool!

artembugara · on April 5, 2023

They literally got me signing a contract after explaining that haha.

anoy8888 · on April 5, 2023

Thanks for explaining

redskyluan · on April 5, 2023

what will be the target user of this service?

mcconaughey · on April 5, 2023

Developers and data scientists working on NLP applications in their companies use the platform. We’ve found that Product Managers also use the UI to collaborate on prompt development.

We’re open to teams and companies of all sizes so we can learn how the problems evolve at different scales.