OpenMOA v0 Changelog

Release April 2026

v0.2 — Code Quality, Bug Fixes, and Performance

OpenMOA v0.2 is a quality-focused release. It does not change any algorithm's mathematical behaviour or experimental results. Instead, it fixes correctness bugs that were silently producing wrong outputs, eliminates code duplication, hardens reproducibility guarantees, and delivers measurable speed-ups on the hot paths of several classifiers.

Every change is covered by the 61 new test cases added in this release.

What Changed at a Glance

Category	Count
Refactoring	2
Algorithm bug fixes	7
Stream Wrapper bug fixes	5
Performance optimisations	8
New test cases	61 (3 files)
Files modified or created	15

Refactoring

Q1 — `SparseInputMixin`: Eliminating Copy-Pasted Logic

The _get_sparse_x helper — which converts an incoming instance into a (indices, values) sparse pair — was duplicated almost verbatim across five classifier files. Any future fix to the three-branch dispatch logic would have needed to be applied five times.

A new SparseInputMixin class in src/openmoa/base/_sparse_mixin.py centralises the logic:

class SparseInputMixin:
    def _get_sparse_x(self, instance) -> tuple[np.ndarray, np.ndarray]:
        if hasattr(instance, "feature_indices"):
            return np.asarray(instance.feature_indices), np.asarray(instance.x, dtype=float)
        if hasattr(instance, "x_index") and hasattr(instance, "x_value"):
            return instance.x_index, instance.x_value
        x = np.asarray(instance.x, dtype=float)
        valid_mask = (x != 0) & (~np.isnan(x))
        return np.where(valid_mask)[0], x[valid_mask]

Affected files: _fesl_classifier.py, _oasf_classifier.py, _fobos_classifier.py, _ftrl_classifier.py, _rsol_classifier.py

Q2 — Instance-Level RNG: Fixing Global Random State Pollution

Multiple classifiers and stream wrappers called np.random.seed(random_seed) to initialise randomness. Setting a global seed has a well-known problem: any other code that touches np.random in the same process will alter the drawn sequence, making results non-reproducible between runs.

All classifiers and wrappers now carry their own np.random.RandomState instance:

# Before — pollutes global state
np.random.seed(random_seed)
noise = np.random.randn(d)

# After — isolated per object
self._rng = np.random.RandomState(random_seed)
noise = self._rng.randn(d)

Algorithm Bug Fixes

A1 — FOBOS / FTRL: `IndexError` on Growing Feature Streams

Symptom

Running FOBOS or FTRL on an OpenFeatureStream in incremental mode crashes with IndexError: index N is out of bounds for axis 0 with size M.

Cause

Both classifiers allocate their weight arrays at construction time using the initial feature count. As OpenFeatureStream introduces new features, their global IDs exceed the array dimensions.

Fix

A _ensure_dimension(target_dim) method resizes the weight array on demand with a 1.5× growth factor. FTRL resizes three arrays (z, n, w) simultaneously.

def _ensure_dimension(self, target_dim: int):
    if target_dim <= self.n_features:
        return
    new_dim = max(target_dim, int(self.n_features * 1.5))
    new_W = np.zeros((new_dim, self.n_outputs))
    new_W[:self.n_features] = self.W
    self.W = new_W
    self.n_features = new_dim

A2 — ORF3V: Wrong Feature IDs in Weight Dictionary

Symptom

ORF3V's per-feature forests accumulate weights under incorrect keys, causing the model to learn associations between the wrong features and trees.

Cause

_update_weights used range(len(x)) as feature IDs — local position indices within the current instance vector. With OpenFeatureStream, position 0 in instance.x may correspond to global feature ID 3, 7, or any other value.

Fix

Use getattr(instance, "feature_indices", range(len(x))) to obtain the correct global IDs before iterating.

A3 — OLD3S: Normalisation Statistics Never Updated

Symptom

OLD3S's online Min-Max normalisation is effectively disabled from the first instance onward, meaning the VAE always receives unnormalised input.

Cause

A one-character variable name error in the update condition made it always False.

# Before — always False → stats never update
if len(x_raw) == len(stats['min']):
    stats['min'] = np.minimum(stats['min'], x_sub)

# After
if len(x_sub) == len(stats['min']):
    stats['min'] = np.minimum(stats['min'], x_sub)

This bug affects every OLD3S experiment run with v0.1.

A4 — OVFM: Ordinal Variable Initialisation Uses Global RNG

Cause

_init_z_ordinal sampled from np.random.uniform(...) instead of self._rng.uniform(...), breaking per-instance seed isolation introduced by Q2.

Fix

Replace all global np.random calls in _ovfm_classifier.py with the instance-level RNG.

A5 — OSLMF: DensityPeaks Called Once Per Instance Instead of Once Per Batch

Symptom

OSLMF is dramatically slower than expected and its learning behaviour diverges from the published algorithm.

Cause

The original paper (Wu et al., AAAI 2023) specifies that DensityPeaks label propagation runs once per batch. The implementation called it inside train(), triggering an O(buffer²) computation at every instance.

Fix

Introduce a batch_size parameter (default 50). Instances accumulate in an internal buffer; DensityPeaks, EM covariance update, and SGD all run together when the buffer is full.

A6 — ORF3V: Dead Code Removed

_update_feature_stats was defined in _orf3v_classifier.py but never called. Its logic also overlapped with the FeatureStatistics class. The method has been deleted.

A7 — FESL: Inconsistent Indentation

Several method bodies in _fesl_classifier.py mixed 2-space and 4-space indentation. All code is now uniformly 4-space indented.

Stream Wrapper Bug Fixes

W1 — `restart()` Did Not Reset the RNG

Symptom

Restarting a stream and replaying it produces a different random sequence than the first pass, breaking reproducibility.

Fix

All wrappers now reinitialise self._rng and regenerate derived schedules inside restart().

W2 — `ShuffledStream` Swallowed All Exceptions

# Before — silently catches KeyboardInterrupt, MemoryError, everything
try:
    ...
except Exception:
    break

# After — separates expected from unexpected
try:
    ...
except StopIteration:
    break
except Exception as e:
    warnings.warn(f"ShuffledStream: unexpected error: {e}", RuntimeWarning)
    break

W3 — `OpenFeatureStream` Pre-allocated O(N) Index Cache

Symptom

Constructing an OpenFeatureStream over a 100,000-instance dataset consumed significant memory before the first instance was read.

Fix

Lazy evaluation — _get_active_indices(t) is called on demand in next_instance(). For random modes, determinism is preserved by deriving the RNG seed from base_seed + t.

W4 — Magic Number in TDS

The constant n_stages = 10 in TrapezoidalStream was undocumented. A comment now explains its origin: the TDS paper (Gao et al.) defines 10 birth stages.

W5 — Duplicated EDS Boundary Formula

OpenFeatureStream (EDS mode) and EvolvableStream each had their own copy of the same boundary computation. The formula is now in a single module-level function _calc_eds_boundaries() shared by both classes.

Performance Optimisations

P1 — OASF: O(d × L) Roll → O(1) Ring Buffer

The sliding-window weight matrix was shifted every step with np.roll(W, -1, axis=1), which copies the entire matrix. A ring buffer with a write pointer performs the same logical operation in O(1):

# Before
self.W = np.roll(self.W, -1, axis=1)
self.W[:, -1] = w_new

# After
self.W[:, self._ptr] = w_new
self._ptr = (self._ptr + 1) % self.L

P2 — OSLMF: `list.pop(0)` → `deque(maxlen=...)`

The DensityPeaks buffer used a Python list with pop(0) to evict the oldest entry — an O(n) operation on every step. Replaced with collections.deque(maxlen=buffer_size), which evicts automatically in O(1).

P3 — FESL: Vectorised Prediction via Mapping Matrix

The inner loop over old-space weights was replaced with a single NumPy dot product, eliminating Python-level iteration over feature IDs.

P4 — ORF3V: Vectorised CDF Computation

# Before
return sum(1 for v in arr if v < split_val) / len(arr)

# After
return float(np.sum(arr < split_val)) / len(arr)

P5 — OSLMF: Vectorised DensityPeaks Delta Computation

The core O(n²) loop that finds each point's nearest higher-density neighbour was replaced with a rank-mask matrix approach — no Python loops, two NumPy min calls. Numerical equivalence verified by unit tests (error < 1e-10).

rank_mask = rank[np.newaxis, :] < rank[:, np.newaxis]   # (n, n) bool
dist_masked = np.where(rank_mask, dist_matrix, np.inf)
delta = dist_masked.min(axis=1)
nearest_higher = np.argmin(dist_masked, axis=1)

Buffer size	Before	After	Speedup
n = 50	0.151 ms	0.047 ms	3.2×
n = 100	0.430 ms	0.253 ms	1.7×
n = 200 (default)	1.492 ms	1.109 ms	1.3×

P6 — OSLMF / OVFM: `statsmodels.ECDF` → `np.searchsorted`

The Gaussian Copula transform constructed a statsmodels.ECDF object per feature per batch. The operation reduces to a sorted search with the standard Hájek smoothing factor. statsmodels is no longer imported in either file, removing a heavy optional dependency.

# Before
ecdf = ECDF(window_clean)
u = (len(window_clean) / (len(window_clean) + 1)) * ecdf(x_obs)

# After — numerically identical
sorted_w = np.sort(window_clean)
u = np.searchsorted(sorted_w, x_obs, side='right') / (len(sorted_w) + 1)

Observations	Before	After	Speedup
10	0.0146 ms	0.0042 ms	3.5×
100	0.0154 ms	0.0038 ms	4.0×

P7 — `np.array` → `np.asarray` (14 call sites)

np.array(x) always allocates and copies. np.asarray(x) returns the original array unchanged when dtype already matches. This affects train() and predict() — called on every instance — so the gain accumulates over long streams.

Case	`np.array`	`np.asarray`	Speedup
float64, no conversion needed	0.30 µs	0.07 µs	4.2×
dtype mismatch (must copy)	same		—

P8 — OLD3S: Vectorised HBP Weight Update

# Before — Python loop, one temporary tensor per layer
for i, l in enumerate(losses):
    bundle['hbp_weights'][i] *= torch.pow(decay, l)
bundle['hbp_weights'] /= bundle['hbp_weights'].sum()

# After — single in-place operation, no loop, no temporaries
losses_t = torch.stack([l.detach() for l in losses])
bundle['hbp_weights'].mul_(decay.pow(losses_t))
bundle['hbp_weights'].div_(bundle['hbp_weights'].sum())

Measured speedup: 1.4× for 3-layer MLP; gains scale with depth.

End-to-End Benchmark

Environment: Windows 11, Intel CPU, Python 3.13, NumPy 2.x. 400 train calls, d = 8.

Classifier	Total time	Per instance
OSLMF (batch = 50)	51.0 ms	0.128 ms
OVFM	31.8 ms	0.080 ms

Test Coverage

Three new test files, 61 test cases, all passing.

`tests/test_stream_wrappers.py` — 23 cases

Covers correct feature_indices attachment, monotone dimension trends, EDS ID ranges, restart() reproducibility for all wrappers, missing_ratio=0 produces no NaN, invalid arguments raise exceptions, ShuffledStream yields every instance exactly once.

`tests/test_uol_classifiers.py` — 27 cases

Covers smoke tests (train + predict does not crash) for all 10 classifiers, predict_proba shape/range/sum-to-one, FOBOS/FTRL dynamic dimension expansion without IndexError, ORF3V weight keys are valid global feature IDs, OSLMF correct batch gating, schema mismatch raises ValueError.

`tests/test_optimizations.py` — 11 cases

Verifies numerical equivalence: searchsorted vs ECDF error < 1e-12; vectorised DensityPeaks vs loop-based, element-wise error < 1e-10 across four matrix sizes; OVFM, OSLMF, and ORF3V with the same seed produce bit-identical prediction sequences across two runs.

Compatibility

v0.2 is fully backwards compatible with v0.1. The public API (train, predict, predict_proba, all stream wrapper constructors) is unchanged. The only user-visible behavioural difference is that OLD3S normalisation now works correctly — experiments that relied on the buggy (unnormalised) behaviour may see different accuracy numbers.

Release January 2026

v0.1 — Introducing Utilitarian Online Learning for Dynamic Feature Spaces

OpenMOA is a Python library for Utilitarian Online Learning (UOL) — a paradigm designed for real-world streaming environments where the feature space itself evolves over time. Built on top of CapyMOA, OpenMOA provides a clean, unified API that integrates MOA online learners, CapyMOA's stream learning backend, and PyTorch deep models.

Why OpenMOA?

Most online learning libraries assume a fixed feature space: every instance arrives with the same set of features, in the same order, with the same semantics. Real-world data streams routinely violate this assumption.

Sensors go offline. New signals are added mid-deployment. Features phase in or out according to external schedules. In these settings, standard online learners silently produce wrong predictions — or crash outright.

OpenMOA addresses this gap with two purpose-built layers:

Stream Wrappers that simulate diverse feature-space dynamics on top of any static dataset
UOL Classifiers that are designed from the ground up to handle evolving feature spaces

Installation

OpenMOA requires Java (for MOA) and PyTorch.

# Check Java
java -version

# Install PyTorch (CPU)
pip install torch torchvision --index-url https://download.pytorch.org/whl/cpu

# Install OpenMOA
pip install openmoa

# Verify
python -c "import openmoa; print(openmoa.__version__)"

For GPU support or troubleshooting, see the installation guide.

Core Components

Stream Wrappers

OpenMOA wraps any static dataset and transforms it into a dynamic stream that simulates feature-space evolution. Five wrapper types cover the major paradigms studied in the literature.

`OpenFeatureStream` — Variable-Length Output with Global Index Mapping

The flagship wrapper. At each time step, it exposes only the currently active features and attaches a feature_indices attribute to every instance — a list of global feature IDs corresponding to the values in instance.x. Downstream learners use this to maintain consistent weight indexing across changing feature sets.

Supports six evolution modes: EDS (Evolvable), TDS (Trapezoidal), CDS (Capricious), random, cyclic, and custom schedules.

from openmoa.stream import OpenFeatureStream
from capymoa.datasets import Electricity

stream = OpenFeatureStream(
    base_stream=Electricity(),
    mode="EDS",
    n_segments=5,
    overlap_ratio=0.5,
)

instance = stream.next_instance()
print(instance.feature_indices)  # e.g. [0, 2, 5] — active global feature IDs
print(instance.x)                # values for those three features only

`TrapezoidalStream` — Features Phase In Gradually (TDS)

Implements the Trapezoidal Data Stream paradigm (Gao et al.). Features enter the stream in stages: each feature has a birth stage and a retirement stage, creating a trapezoidal activation window. Inactive features are represented as NaN in a fixed-width vector.

`CapriciousStream` — Random Missing Features (CDS)

At each time step, features are independently masked with a configurable missing_ratio. Missing values appear as NaN. Models must tolerate sparse, unpredictably incomplete input.

from openmoa.stream import CapriciousStream

stream = CapriciousStream(base_stream=..., missing_ratio=0.3)
# ~30% of feature values will be NaN at each step

`EvolvableStream` — Segmented Feature Rotation (EDS)

Divides the stream into alternating stable and transition segments. During stable segments, a fixed feature subset is active. During transitions, old features fade out while new ones phase in. Output is fixed-width with NaN fill.

`ShuffledStream` — Eliminates Label-Sorting Bias

Many benchmark datasets are sorted by class label, which artificially inflates the performance of any learner with a memory component. ShuffledStream buffers the entire dataset in memory and shuffles instances before replay, ensuring evaluation reflects truly random arrival order.

UOL Classifiers

Ten algorithms, all implementing a common interface (train(instance) / predict(instance) / predict_proba(instance)), all compatible with OpenFeatureStream's variable-length input.

Algorithm	Description
FESL	Feature-Evolving Sparse Learning. The core UOL algorithm: maintains a sparse weight vector in a global feature ID space and handles feature-space transitions via explicit mapping matrices.
OASF	Online Adaptive Sparse Filtering. Sliding-window weight matrix; adapts to changing feature relevance over time.
RSOL	Randomly Sparse Online Learning. Random projection-based sparse updates with provable regret bounds.
FOBOS	Forward-Backward Splitting. Proximal gradient descent with L1 regularization for online sparse learning.
FTRL	Follow The Regularized Leader. Per-coordinate adaptive learning rates with L1+L2 regularization.
ORF3V	Online Random Forest for Variable-length feature Vectors. Extends random forests to streams with dynamic feature spaces by maintaining per-feature-ID tree structures.
OVFM	Online Variational Factorization Machine. Models feature interactions via a Gaussian Copula; designed for mixed continuous/ordinal feature types.
OSLMF	Online Semi-supervised Local Matrix Factorization. Combines DensityPeaks clustering with a Copula-based latent representation for label-efficient learning.
OLD3S	Online Deep Dynamic Semi-Supervised Streams. VAE-based deep model with hierarchical Bayesian pruning (HBP); supports unlabeled instances.
OWSS	Online Weighted Sparse Streams. Importance-weighted sparse updates for non-stationary distributions.

from openmoa.stream import OpenFeatureStream
from openmoa.classifier import FESLClassifier
from capymoa.datasets import Electricity
from capymoa.evaluation import prequential_evaluation

stream = OpenFeatureStream(Electricity(), mode="EDS", n_segments=5)
learner = FESLClassifier(schema=stream.get_schema())

results = prequential_evaluation(stream, learner, window_size=500)
print(results.accuracy())

Datasets

OpenMOA v0.1 ships with benchmark support for 17 datasets — 11 binary classification and 6 multi-class — drawn from UCI, MOA's built-in repository, and domain-specific streaming benchmarks. All 17 datasets are evaluated under all three feature-space paradigms (TDS, CDS, EDS), yielding a comprehensive 10-algorithm × 17-dataset × 3-paradigm benchmark grid.

Evaluation Protocol

All experiments follow the prequential (test-then-train) protocol: each instance is first used for evaluation, then for training. This produces unbiased estimates of online accuracy without a held-out test set. Prequential evaluation is the standard protocol in stream learning research and is natively supported by CapyMOA's evaluation utilities.

Cite

@misc{ZhiliWang2025OpenMOA,
    title={{OpenMOA}: A Python Library for Utilitarian Online Learning},
    author={Zhili Wang, Heitor M. Gomes and Yi He},
    year={2025},
    archivePrefix={arXiv},
    url={https://arxiv.org/abs/},
}

Links

Documentation GitHub PyPI Discord

OpenMOA v0 Changelog

v0.2 — Code Quality, Bug Fixes, and Performance

What Changed at a Glance

Refactoring

Q1 — SparseInputMixin: Eliminating Copy-Pasted Logic

Q2 — Instance-Level RNG: Fixing Global Random State Pollution

Algorithm Bug Fixes

A1 — FOBOS / FTRL: IndexError on Growing Feature Streams

A2 — ORF3V: Wrong Feature IDs in Weight Dictionary

A3 — OLD3S: Normalisation Statistics Never Updated

A4 — OVFM: Ordinal Variable Initialisation Uses Global RNG

A5 — OSLMF: DensityPeaks Called Once Per Instance Instead of Once Per Batch

A6 — ORF3V: Dead Code Removed

A7 — FESL: Inconsistent Indentation

Stream Wrapper Bug Fixes

W1 — restart() Did Not Reset the RNG

W2 — ShuffledStream Swallowed All Exceptions

W3 — OpenFeatureStream Pre-allocated O(N) Index Cache

W4 — Magic Number in TDS

W5 — Duplicated EDS Boundary Formula

Performance Optimisations

P1 — OASF: O(d × L) Roll → O(1) Ring Buffer

P2 — OSLMF: list.pop(0) → deque(maxlen=...)

P3 — FESL: Vectorised Prediction via Mapping Matrix

P4 — ORF3V: Vectorised CDF Computation

P5 — OSLMF: Vectorised DensityPeaks Delta Computation

P6 — OSLMF / OVFM: statsmodels.ECDF → np.searchsorted

P7 — np.array → np.asarray (14 call sites)

P8 — OLD3S: Vectorised HBP Weight Update

End-to-End Benchmark

Test Coverage

tests/test_stream_wrappers.py — 23 cases

tests/test_uol_classifiers.py — 27 cases

tests/test_optimizations.py — 11 cases

Compatibility

v0.1 — Introducing Utilitarian Online Learning for Dynamic Feature Spaces

Why OpenMOA?

Installation

Core Components

Stream Wrappers

OpenFeatureStream — Variable-Length Output with Global Index Mapping

TrapezoidalStream — Features Phase In Gradually (TDS)

CapriciousStream — Random Missing Features (CDS)

EvolvableStream — Segmented Feature Rotation (EDS)

ShuffledStream — Eliminates Label-Sorting Bias

UOL Classifiers

Datasets

Evaluation Protocol

Cite

Links

Q1 — `SparseInputMixin`: Eliminating Copy-Pasted Logic

A1 — FOBOS / FTRL: `IndexError` on Growing Feature Streams

W1 — `restart()` Did Not Reset the RNG

W2 — `ShuffledStream` Swallowed All Exceptions

W3 — `OpenFeatureStream` Pre-allocated O(N) Index Cache

P2 — OSLMF: `list.pop(0)` → `deque(maxlen=...)`

P6 — OSLMF / OVFM: `statsmodels.ECDF` → `np.searchsorted`

P7 — `np.array` → `np.asarray` (14 call sites)

`tests/test_stream_wrappers.py` — 23 cases

`tests/test_uol_classifiers.py` — 27 cases

`tests/test_optimizations.py` — 11 cases

`OpenFeatureStream` — Variable-Length Output with Global Index Mapping

`TrapezoidalStream` — Features Phase In Gradually (TDS)

`CapriciousStream` — Random Missing Features (CDS)

`EvolvableStream` — Segmented Feature Rotation (EDS)

`ShuffledStream` — Eliminates Label-Sorting Bias