Decouple batch size and number of negatives #263

stes · 2025-08-02T14:59:33Z

This PR adds a new argument, num_negatives, to both the Loader and CEBRA classes (torch and sklearn API). This allows to stabilize training behavior by providing additional negative examples to the InfoNCE loss independent of the batch size. We leveraged this logic for the models trained in DCL.

Behavior

If the num_negatives = None (the default), the previous behavior is obtained for backwards compatibility and Loader.batch_size negative examples are drawn. If a different value is set, then the number of negative examples in all loaders will be set to num_negatives instead of Loader.batch_size.

The goodness of fit computation was also adapted to use the num_negatives.

API modification

While implementing this functionality, I noticed an inconsistency between single_session and multi_session solvers. In the single session, we passed self.batch_size through the get_indices function, while in multi_session we use self.batch_size directly. The 2nd behavior makes more sense in the context of the general class design. I deprecated passing the num_samples parameter and adapted the samplers accordingly.

stes · 2025-08-02T15:36:36Z

@cla-bot check

cla-bot · 2025-08-02T15:36:42Z

Thanks for tagging me. I looked for a signed form under your signature again, and updated the status on this PR. If the check was successful, no further action is needed. If the check was unsuccessful, please see the instructions in my first comment.

Copilot

Pull Request Overview

This PR introduces the ability to decouple batch size from the number of negative samples in contrastive learning by adding a new num_negatives parameter to both the Loader and CEBRA classes. This allows for more stable training behavior by providing additional negative examples to the InfoNCE loss independent of the batch size.

Adds num_negatives parameter to Loader base class and CEBRA class APIs
Updates all loader implementations to use num_negatives instead of duplicating batch_size
Modifies goodness of fit computation to use num_negatives instead of batch_size

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`cebra/data/base.py`	Adds `num_negatives` field to base `Loader` class with validation and deprecates `num_samples` parameter
`cebra/data/single_session.py`	Updates single session loaders to use `num_negatives` and removes `num_samples` parameter
`cebra/data/multi_session.py`	Updates multi-session loaders to use `num_negatives` and adds validation for unified loader
`cebra/data/multiobjective.py`	Updates multiobjective loaders to use `num_negatives` instead of `batch_size`
`cebra/integrations/sklearn/cebra.py`	Adds `num_negatives` parameter to CEBRA class and passes it to loaders
`cebra/integrations/sklearn/metrics.py`	Updates goodness of fit computation to use `num_negatives_` instead of `batch_size`
`tests/test_loader.py`	Comprehensive test updates to validate `num_negatives` functionality across all loader types
`tests/test_sklearn.py`	Adds basic test for `num_negatives` parameter in CEBRA class
`tests/test_sklearn_metrics.py`	Updates goodness of fit tests to validate `num_negatives` behavior

cebra/data/single_session.py

cebra/data/base.py

cebra/integrations/sklearn/metrics.py

MMathisLab · 2025-09-24T16:36:58Z

hey @stes is any of the copilot reviews useful? If not, we can merge, thanks for this!!

stes · 2026-01-18T12:49:46Z

@MMathisLab fine to merge from my end. Went through the copilot comments and made some further edits.

cla-bot bot added the CLA signed label Aug 2, 2025

stes self-assigned this Aug 2, 2025

stes added the enhancement New feature or request label Aug 2, 2025

stes marked this pull request as ready for review August 2, 2025 15:17

MMathisLab requested a review from Copilot August 11, 2025 06:23

This comment was marked as outdated.

Sign in to view

MMathisLab requested review from Copilot and myscience August 11, 2025 06:25

Copilot AI reviewed Aug 11, 2025

View reviewed changes

cebra/data/single_session.py Show resolved Hide resolved

cebra/data/single_session.py Show resolved Hide resolved

cebra/data/single_session.py Outdated Show resolved Hide resolved

cebra/data/base.py Show resolved Hide resolved

cebra/integrations/sklearn/metrics.py Show resolved Hide resolved

MMathisLab self-requested a review August 27, 2025 12:48

stes added 8 commits January 18, 2026 13:04

Add support to sample more negatives

6fc40fd

fix missing arg

403a62e

Fix multi-session samplers

cae19bf

Improve sampling API

be60112

add sklearn implementation

f722b67

Update deprecation note

4dd58e5

update deprecation note

ed23754

Update GoF computation

0f7ac51

stes force-pushed the stes/more-negatives branch from 1412484 to 0f7ac51 Compare January 18, 2026 12:04

stes added 4 commits January 18, 2026 13:17

update docstrings in single session samplers

36d785c

Make deprecated num_samples keyworld only

3f12ed8

refine doc string and fix type hint

0a0266f

improve api tests

3173c34

stes removed the request for review from myscience January 18, 2026 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Decouple batch size and number of negatives #263

Decouple batch size and number of negatives #263

stes commented Aug 2, 2025 •

edited

Loading

Uh oh!

stes commented Aug 2, 2025

Uh oh!

cla-bot bot commented Aug 2, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MMathisLab commented Sep 24, 2025

Uh oh!

stes commented Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Decouple batch size and number of negatives #263

Are you sure you want to change the base?

Decouple batch size and number of negatives #263

Conversation

stes commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Behavior

API modification

Uh oh!

stes commented Aug 2, 2025

Uh oh!

cla-bot bot commented Aug 2, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MMathisLab commented Sep 24, 2025

Uh oh!

stes commented Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stes commented Aug 2, 2025 •

edited

Loading