New architectures and activations to make neural nets more powerful

Pushing Neural Network Expressivity

Advancements in Neural Architectures and Activation Functions: Toward More Powerful, Stable, and Domain-Aligned Models

The field of artificial intelligence continues to accelerate its evolution, driven by cutting-edge innovations in neural architectures and nonlinear activation functions. These breakthroughs are not only expanding the expressive power of models but are also addressing crucial challenges related to stability, interpretability, and alignment with domain-specific principles. Recent developments signal a paradigm shift toward creating AI systems that are more reliable, efficient, and capable of modeling complex real-world phenomena with unprecedented fidelity.

Architectural Innovations: Enhancing Stability, Physical Consistency, and Efficiency

Continuous-Time Piecewise-Linear RNNs

Building upon traditional recurrent neural networks (RNNs), continuous-time piecewise-linear RNNs have emerged as a promising approach for modeling temporal and dynamic systems. Their structure—composed of linear segments—provides transparent decision boundaries and improved interpretability, making them well-suited for applications such as signal processing, real-time forecasting, and physical system simulation. These models excel at long-term dependency capture while maintaining predictable and stable behaviors, crucial for deployment in safety-critical environments.

Physics-Informed Architectures and PINNs

Physics-Informed Neural Networks (PINNs) have seen rapid adoption across scientific and biomedical domains. By embedding conservation laws, material properties, and physical constraints directly into their architecture, PINNs guarantee physically consistent predictions even with limited data. Recent advances have demonstrated their effectiveness in fluid dynamics modeling, climate simulations, and medical imaging, where outputs respect fundamental physical laws, enhancing trustworthiness and interpretability.

Compact Implicit Neural Representations: The Rise of SMNs

Subtractive Modulative Networks (SMNs) have gained attention for their ability to compactly represent complex signals. Leveraging learnable periodic activations and innovative parameterizations, SMNs achieve high-fidelity reconstructions with fewer parameters, significantly reducing computational demands. This efficiency is vital in applications like high-fidelity 3D reconstructions, neural radiance fields (NeRFs), and biomedical imaging, where resource constraints often limit model complexity.

Transformation-Aware Neural Implicit Methods

Recent research has focused on transformable neural implicit representations for deformable image registration. By utilizing stationary velocity fields on matrix groups, these models incorporate physical deformation constraints, resulting in more accurate, interpretable, and physically plausible outcomes. Such methods are particularly relevant in biomedical imaging, where modeling tissue deformation accurately is essential for diagnostics and treatment planning.

Formally-Verified Neural PDE Solvers: The BEACONS Framework

The development of BEACONS, a formally-verified neural PDE solver, marks a milestone in creating trustworthy AI systems. These models undergo rigorous correctness and safety verification, ensuring adherence to physical laws and safety standards—crucial for medical diagnostics, scientific simulations, and high-stakes decision-making. This integration of formal methods underscores a broader trend toward robust, reliable AI capable of operating in critical environments.

Expanding the Nonlinear Activation Toolbox: From Rational to Fractal Functions

Rational and Distribution-Inspired Activations

Rational activations employ ratios of trainable polynomials, enabling models to dynamically adapt their nonlinearities during training—improving flexibility and performance.
Skewed Student-t Units (SSLU) incorporate heavy-tailed distributions into activations, making networks more robust to outliers and well-suited for heavy-tailed, real-world data.

Data-Adaptive and Periodic Activations

Distribution-inspired activations are designed to align with the underlying data distribution, facilitating more efficient learning.
Learnable periodic functions, such as sinusoidal activations with adaptable parameters, are particularly effective in Implicit Neural Representations (INRs) like NeRFs. They capture high-frequency details, enabling the modeling of intricate textures, signals, and multi-scale patterns.

Fractal Activation Functions

A novel frontier is the development of fractal activation functions, inspired by self-similar, multi-scale structures. These functions introduce hierarchical nonlinearities, empowering neural networks to capture complex, multi-resolution patterns. Early studies suggest that fractal activations significantly enhance the expressive capacity of models, especially in tasks involving multi-scale analysis or complex pattern recognition.

Subtractive Modulative Networks (SMNs) with Periodic Activations

Research into learnable periodic activations within SMNs has demonstrated their capacity to represent complex signals efficiently. These architectures are compact yet highly expressive, making them ideal for 3D modeling, biomedical imaging, and neural fields, where detail fidelity and computational efficiency are critical.

The Interplay of Optimization Algorithms and Activation Functions

Recent insights reveal that optimization algorithms—notably Adam and Muon—exert a profound influence on the implicit bias of learned functions, especially when combined with smooth, homogeneous activation functions.

Key findings include:

Optimizer-induced biases shape the smoothness, generalization, and stability of neural models.
When paired with smooth, homogeneous activations, optimizers like Adam guide models toward more stable and interpretable solutions.
Recognizing this interplay emphasizes the importance of jointly designing architectures, activation functions, and training procedures to realize robust, trustworthy models.

Advances in Implicit Neural Representation Techniques

Building on this understanding, several advanced INR techniques have emerged:

SIREN employs periodic sinusoidal activations to model high-frequency signals effectively.
WIRE (Weighted Implicit Representation) introduces adaptive weighting mechanisms for enhanced fidelity.
FINER (Frequency-aware Neural Encoding) integrates frequency-aware mechanisms for multi-scale signal modeling.
F-INR (Functional Tensor Decomposition for INRs)—a recent addition—uses tensor decomposition techniques to achieve more efficient, compact representations of signals, further pushing the boundaries of resource-efficient modeling.

Practical Benefits and Future Directions

These combined innovations offer tangible advantages:

Higher expressivity with fewer parameters, reducing computational and memory costs.
Enhanced stability and physical consistency, vital for scientific and medical applications.
Better alignment with domain principles, improving trustworthiness and interpretability.
Resource efficiency, enabling deployment on edge devices, medical imaging systems, and real-time environments.

Looking ahead, research is increasingly focused on integrating physics-based modeling, probabilistic reasoning, and adaptive nonlinearities. The goal is to develop compact, reliable, and domain-aware AI systems capable of addressing complex scientific, medical, and visual challenges with unprecedented precision.

The Significance of the WACV2026 F-INR Development

A notable recent milestone is the presentation of F-INR: Functional Tensor Decomposition for Implicit Neural Representations at WACV2026. This approach leverages tensor decomposition techniques within neural representations to achieve highly efficient, scalable, and expressive models. The F-INR framework exemplifies the trend toward compactness and efficiency, enabling models to represent intricate signals with less computational overhead and greater interpretability. As showcased in the accompanying YouTube video, F-INR demonstrates promising capabilities in modeling complex signals, paving the way for more resource-effective AI systems.

Conclusion

The ongoing wave of innovations in neural architectures and activation functions signals a new era of more powerful, stable, and domain-aligned AI models. From physics-informed architectures to fractal and periodic activations, and from formal verification frameworks like BEACONS to tensor-based INR techniques like F-INR, the field is converging toward creating systems that are not only more expressive but also trustworthy and resource-efficient.

As these developments continue to mature, they will enable AI to more faithfully model the complexities of the real world, adhere to domain principles, and operate reliably in critical applications—setting the stage for a future where AI systems are seamlessly integrated into scientific discovery, medicine, and technology with unprecedented capabilities and trust.

Sources (13)

Updated Feb 24, 2026

Advanced Nonlinear Activations

New architectures and activations to make neural nets more powerful

Advancements in Neural Architectures and Activation Functions: Toward More Powerful, Stable, and Domain-Aligned Models

Architectural Innovations: Enhancing Stability, Physical Consistency, and Efficiency

Continuous-Time Piecewise-Linear RNNs

Physics-Informed Architectures and PINNs

Compact Implicit Neural Representations: The Rise of SMNs

Transformation-Aware Neural Implicit Methods

Formally-Verified Neural PDE Solvers: The BEACONS Framework

Expanding the Nonlinear Activation Toolbox: From Rational to Fractal Functions

Rational and Distribution-Inspired Activations

Data-Adaptive and Periodic Activations

Fractal Activation Functions

Subtractive Modulative Networks (SMNs) with Periodic Activations

The Interplay of Optimization Algorithms and Activation Functions

Advances in Implicit Neural Representation Techniques

Practical Benefits and Future Directions

The Significance of the WACV2026 F-INR Development

Conclusion

WACV2026 - F-INR: Functional Tensor Decomposition for Implicit Neural Representations

Triple Paper Insights: SIREN, WIRE, & FINER | by Shanmuka Sadhu

Designing fractal activation functions for artificial neural networks

BEACONS: Formally-Verified Neural PDE Solvers

Stationary Velocity Fields on Matrix Groups for Deformable Image ...

Multi-Class Boundary Extraction from Implicit Representations - arXiv.org

[PDF] The Implicit Bias of Adam and Muon on Smooth Homogeneous Neural ...

Subtractive Modulative Network with Learnable Periodic Activations - arXiv

A novel physics-informed neural network architecture for solving ...

Continuous-Time Piecewise-Linear Recurrent Neural Networks - arXiv

SSLU: an activation function based on skew student's t-distribution for ...

Rational Neural Networks have Expressivity Advantages - arXiv

Knowledge Distillation for Continual Learning of Biomedical Neural Fields