Lower GPT-4 cap?

mozz@mbin.grits.dev · 7 months ago

Lower GPT-4 cap?

viking@infosec.pub · 7 months ago

With the API I’m paying less than 10% of the subscription fee.

Just how massive are we talking about?

Grimy@lemmy.world · 7 months ago

Don’t you need a subscription to use the gpt-4 API?

Last I checked, gpt-4 is 0.02 for 1000 tokens. Every message in chat also has a summary of the whole convo plus the most recent messages. I feel like that’s busting the 10% pretty quickly if it’s intensive daily use.

viking@infosec.pub · 7 months ago

The tokens don’t have a fixed price, the 2 cents are an average depending on the complexity. I’m using it moderately almost every day, and have rarely paid more than $2/month.

And no subscription needed, it’s prepaid.

habanhero@lemmy.ca · 7 months ago

You don’t need a subscription. You just buy credits and it’s pay-as-you-go.

Source: me as that’s how I’m using it. No subscription fee / recurring payment needed.

mozz@mbin.grits.dev · 7 months ago

You need a subscription either way

GPT-4 costs from 0.01 to 0.12 per 1000 tokens depending on some details – but regardless of that, it’s not like chat type chat where you might have tons of small messages which each depend on the full 32k or whatever of context; each singular message usually has an explicit context for the stuff you want to tell it, and no more than 50-100 of them per day to implement your thing at most, so like 50 cents to a few dollars a day even at an obscene level of usage. Might be more than $20/month in total but more likely less.

habanhero@lemmy.ca · 7 months ago

You don’t need any subscriptions to use the API. It’s pay-as-you-go.

https://openai.com/pricing

mozz@mbin.grits.dev · 7 months ago

Oh, I was meaning in terms of “you have to pay for it” – yes, you’re 100% right, you don’t have to be on the $20/month thing in order to use GPT-4 on the API.

brbposting@sh.itjust.works · 7 months ago

Do you use DALLE via API?

TypingMind w/lifetime license works beautifully for cheap simultaneous GPT-4 Turbo & Claude-3-Opus when it comes to text. And can upload images. Generating would be interesting, don’t believe it can do it.

viking@infosec.pub · 7 months ago

Nope I’m only interested in the word processor / coding bit, so GPT 4 is all I need. I’m accessing it through https://github.com/Bin-Huang/chatbox.

AwkwardLookMonkeyPuppet@lemmy.world · 7 months ago

So you just set up your own interface and then make requests there? I did set up a MERN stack app for chatting with it as an experiment, but I never did anything else with it after that.

viking@infosec.pub · edit-2 7 months ago

Not even that, I’m using chatbox where you can simply add the API code in the settings and be done with it.

The software integrates a bunch of other AI’s, some of them in Chinese, but I’ve removed most from the quick access menu and only really work with GPT.

mozz@mbin.grits.dev · edit-2 7 months ago

This morning was 177kb in and out, so call it 2/3 of it is input and 1/3 output, would mean roughly:

118k bytes input ≈ 29k tokens = 29 cents 59k bytes output ≈ 15k tokens = 45 cents

I think you may be correct in your assessment

AwkwardLookMonkeyPuppet@lemmy.world · 7 months ago

So you pay for it and you received that message? I pay for it and paste hundreds of lines of code into it, making request after request for hours at a time and have never received that message. Maybe the system was overloaded? I’ve found that when I try using it through the app it will error out sometimes, and take a long time other times, but I’ve never encountered that on a desktop.

DavidGarcia@feddit.nl · 7 months ago

I unsubscribed because they blasted me with 5 min “are you a robot” tests for every request.

Now I’m using Perplexity, that sucks too but at least it is usable.

mozz@mbin.grits.dev · 7 months ago

Don’t forget “we have detected suspicious activity coming from your system” and “there was an error constructing a response” (or whatever the wording is) which form roughly one-third and one-third respectively of the responses I get from it, along with the one-third that are successful responses.