Fix LMF Negative Sampling to Sample Uniformly by Fazel94 · Pull Request #747 · benfred/implicit

Fazel94 · 2026-05-02T10:27:40Z

Fix four bugs in lmf_update that gutted negative sampling

The negative sampling loop in lmf_update had four bugs that together
meant the model was barely seeing true negatives:

n_items used item_vectors.shape[1] (= n_factors+2) instead of shape[0],
capping the negative loop at ~34 regardless of catalogue size
negatives were drawn from CSR indices[], which only holds interacted
items — never zero-interaction ones, and popularity-biased on top
the outer loop variable _ got clobbered by the inner factor loops,
so it ran once instead of neg_prop * seen_items times
a single RNG seeded with nnz-1 was shared by both the user-update and
item-update passes; each needs its own with the correct range

Fix: shape[0] for n_items, sample item/user IDs directly from [0, n)
with rejection for positives, distinct loop variables, two RNGVectors.

Added five regression tests covering each bug and the overall cluster
recovery behavior.

AI use disclosure:
I have used LLMs extensively for understanding the issue, cleaning up and generating comments for my code and pr.
I have written the code and am responsible for it.

benfred

thanks for the fixes!

I'll admit I haven't ever looked closely at the LMF code, and I should have reviewed it previously.

I'm pretty positive on most of these changes - but I have my doubts about switching from popularity based negative sampling to uniform sampling, since I've seen that change hurt performance in previous experiments with BPR (where I ended up having to switch to popularity based sampling to match performance for BPR in lightfm). Would it be possible to run a quick experiment to verify that this change helps out performance?

The other fixes look great, and I appreciate the depth that you've dug into this here

benfred · 2026-05-07T20:18:02Z

-                for _ in range(n_factors):
-                    deriv[_] -= reg * user_vectors[u, _]
-                    deriv_sum_sq[u, _] += deriv[_] * deriv[_]
+                # Sample uniformly from [0, n_items); reject any item the user has


I'm unsure about this one change to move to uniform sampling.

When I was originally adding the BPR code here - I noticed that uniformly sampling the negative items performed much worse than sampling based off popularity. The issue seemed to be that because the positive samples are biased towards popularity - sampling uniformly for the negatives produced much weaker negatives.

It can explain current lack of performance of LMF with respect to ALS and BPR in my tests.
Should I implement popularity based sampling in the PR?

benfred · 2026-05-07T20:21:48Z

+                        i = rng.generate(thread_id)
+                        # indices[indptr[u]:indptr[u+1]] is sorted (guaranteed by fit()),
+                        # so binary_search gives O(log k) rejection per sample.
+                        while binary_search(&indices[indptr[u]], &indices[indptr[u + 1]], i):


thanks for adding this check here - I had previously noticed in the BPR code that verifying negative samples was essential #103 (comment) , and this should have been in place here too

Bug A: item_vectors.shape[1] returned n_factors+2, not n_items. Fix: use shape[0]. Bug B: RNGVector range was [0, nnz-1] and i = indices[index] only samples from already-interacted items (popularity-biased, never zero-interaction items). Fix: sample i directly from [0, n_items). Bug C: outer negative-sample loop and inner factor loops all used as the loop variable. Each inner loop left _ == n_factors, so the outer loop ran at most once regardless of neg_prop. Fix: use f for inner factor loops. Bug D: a single RNG seeded with nnz-1 was shared by the user-update pass (needs item IDs) and item-update pass (needs user IDs). Fix: two separate RNGVector instances with correct ranges.

Fazel94 · 2026-05-09T13:52:09Z

Switched back to popularity-weighted sampling — RNG generates an offset into the global CSR indices array and dereferences it, same convention as BPR. Replaced the while-retry with continue-on-collision to avoid stalling on saturated users.

Ran both variants on MovieLens-100k (factors=32, iterations=30, neg_prop=30, same seed):

Strategy	P@10	MAP@10
Popularity-weighted	0.0981	0.0505
Uniform	0.0853	0.0476

Popularity wins, consistent with what you saw in BPR. Script is at bug_repro/compare_sampling.py.

benfred reviewed May 7, 2026

View reviewed changes

Fazel94 force-pushed the fix/lmf-negative-sampling branch from 721b42e to 31bc5bf Compare May 9, 2026 12:11

test(lmf): add regression tests for four lmf_update bugs

bda49d6

Fazel94 force-pushed the fix/lmf-negative-sampling branch from 31bc5bf to bda49d6 Compare May 9, 2026 14:00

Merge branch 'main' into fix/lmf-negative-sampling

0e8662f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LMF Negative Sampling to Sample Uniformly#747

Fix LMF Negative Sampling to Sample Uniformly#747
Fazel94 wants to merge 3 commits intobenfred:mainfrom
Fazel94:fix/lmf-negative-sampling

Fazel94 commented May 2, 2026

Uh oh!

benfred left a comment

Uh oh!

benfred May 7, 2026

Uh oh!

Fazel94 May 9, 2026

Uh oh!

benfred May 7, 2026

Uh oh!

Fazel94 commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Fazel94 commented May 2, 2026

Uh oh!

benfred left a comment

Choose a reason for hiding this comment

Uh oh!

benfred May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Fazel94 May 9, 2026

Choose a reason for hiding this comment

Uh oh!

benfred May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Fazel94 commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants