- cross-posted to:
- technology@lemmy.world
- cross-posted to:
- technology@lemmy.world
AI-infused hiring programs have drawn scrutiny, most notably over whether they end up exhibiting biases based on the data they’re trained on.
Every algorithm in the world is suddenly “AI” now
It’s like 10-15 years ago suddenly all the companies were claiming they used big data. Unfortunately it’s just buzz words to entice investors or lazy reporting.
They can ‘prove’ they don’t explicitly train the models on race or gender but that doesn’t really prove anything. A model will inevitably take into account data that it will correlate to race or gender- names, zip codes, education and financial history, etc, and those correlations will result in similarly biased decisions that regular human racism and sexism produce. Weeding that out completely may not even be possible.
I figure you’d audit it by examining the results, and if bias isn’t detectable in the results then I’d argue that’s at the very least still better than the human-based systems we’ve been relying on up til now.
Unequal outcomes isn’t evidence of bias.
Not inherently, but things can be tested.
If you have a bunch of otherwise identical résumés, with the only difference being the racial connotation of the name, and the AI gives significantly different results, there’s an identifiable problem.
That makes sense: empirical tests of the AI as you describe.
What would unbiased results look like?
When the demographics of the output are roughly equivalent to the demographics of the input. If ten men and fifty women apply, and eight men and two women are hired, that is worth investigating.
That would be a pretty extreme bias to have, so yeah that would make sense. If it’s not so drastic it might be harder to spot by just looking at the results.
It’s a flag, not the entire investigation. If something seems suspicious, that’s the queue to investigate, not just immediately slap cuffs on someone.
Right, that’s why I was trying to ask you for your opinion on what the threshold for “investigation worthy” results
I’m not a policy expert, author of the bill, or in charge of the department that will lead these investigations. Even if I were an expert on the subject, what I’d do and what this department will do aren’t likely to be the same.
I just support civilian oversight and audits of these algorithms and LLMs as they take up a more prominent position in hiring and firing.
I think part of this is basically asking companies to keep a record of what information is used.
Hey, I am a machine learning engineer that works with people data. Generally you measure bias in the training data, the validation sets, and the outcomes ( in an ongoing fashion - AIF 360 is a common library and approach ). There are lots of ways to measure bias and or fairness. Just checking if a feature was used isn’t considered “enough” by any standards or practitioner. There are also ways to detect and mitigate some of the proxy relationships you’re pointing to. That being said, I am 100% skeptical that any hiring algorithm isn’t going to be extremely bias. A lot of big companies have tried and quit because despite using all the right steps the models were still bias https://www.reuters.com/article/us-amazon-com-jobs-automation-insight-idUSKCN1MK08G. Also many of the metrics used to report fairness have some deep flaws ( disprate impact ).
All that being said the current state is that there are no requirements for reporting so vendors don’t do the minimum 90% of the time because if they did it would cost a lot more and get in the way of the “AI will solve all your problems with no effort” narrative they want to put forward so I am happy to see any regulation coming into place even if it won’t be perfect.
The so-called AI parses your resume looking for keywords that match the job description. They anonymize and provide a summary. I don’t think there is much room for bias. Maybe if you use crappie software that doesn’t make the summary anonymous.
BTW write your resume for the algorithm not the manager.
AI resume screeners are very much at risk of bias. There have been stories about exactly this in years past. The ML models need to be trained, so they get fed resumes of candidates that were hired and not hired so the model can learn to differentiate the two and make decisions on new resumes in the future. That training, though, takes any bias that went into previous decisions and brings it forward.
From the Amazon I linked above, the model was prioritizing white men over women and people of color. When you think back to how these models were trained, though, that’s exactly what you’d expect to happen. No one was intentionally introducing bias to the AI process, but software teams have historically been very male and white, and when referrals and references come into play, those demographics were further emphasized. And then let’s not pretend that none of those recruiters or hiring managers were bringing their own bias to the table.
If you feed that into your model as it’s training data, of course the model is going to continue to favor white men, not because it’s actually looking for men, but because resumes that men typically submit are the kinds that get hired. Then they found that resumes that mention a professional women’s organization or historically black or women only colleges were typically not hired. The model isn’t “thinking” about why that is - it just knows that when certain traits exist, the resume is ranked lower, so it replicates that.
Building a truly unbiased AI system is actually incredibly difficult, not the least due to the fact that the demographics of the data scientists working on these systems are themselves predominantly male and white themselves. We’ve also seen this issue in the past with other AI systems, including facial recognition systems, where these systems built by teams of white men can’t seem to make reliable determinations when looking at a picture of a black woman (with accuracy rates 20-30% lower for black woman compared to white men).
It depends how “bias” has been defined. The Ibram Kendi definition is unequal outcomes. Since no two groups are identical, such definitions require bias to be “unbiased.” Australia tried to employ blind recruitment and hired fewer women and minorities. That’s true unbiased recruitment, but I suspect it wouldn’t be praised today.
To your last point.
Use every single buzzword in any job listing on your resume. DO NOT CONJUGATE. Their “AI” is looking for “Rapid Analysis” not “Analyzes rapidly.”
We’ll the problem with that is you’d have to prove your hiring requirements aren’t exclusionary. Which isn’t going to happen until people start to examine their biasses on a societal level. The problem with AI isn’t that it’s biased. The problem is that both the dataset and the task given are biased. Which will always result in a biased system
It’s also not clear how the law will be enforced or to what extent.
No shit. Isn’t that the point? Use outrage to justify the growth of an impenetrable body of law addressing all social and economic behavior, then selectively enforce subjective interpretations to satisfy powerful groups and remain in power. So it goes for any population center whose rapid growth creates the illusion of independence.
Can we sell NYC to Canada yet? We’d make a bundle, they wouldn’t even be mad, and I’d sleep a lot better with the border of a superpower between me and a stack of nine million people who think privacy is a sin.
Isn’t the whole point of AI decision making to provide plausible deniability for these sort of things?
Yes, but if you train an AI on racist/sexist data, it will naturally do the same.
Depends how the law is applied…
Kinda like if a self driving car kills someone, who is liable, driver, manufacturer, seller?
I guess you pay insurance and they take on liability is another option.
Well, when AI takes over everything, at least it won’t be racist, unlike current police force.
It will disrespect and threaten all of us equally bad.