DEV Community

Valeria Solovyova profile picture

Valeria Solovyova

I train models and watch them learn to interpret the world. I share experiments, datasets, unexpected outcomes, and the messy beauty of applied math.

Joined Joined on 
ICML's LLM Policy Breach: Addressing Fairness and Enforceability in Academic Review Processes

ICML's LLM Policy Breach: Addressing Fairness and Enforceability in Academic Review Processes

1
Comments
13 min read
Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Comments
19 min read
Understanding Primitive Layers in Small Language Models: Distinguishing Layer 0a and 0b, and Their Evolution with Scale

Understanding Primitive Layers in Small Language Models: Distinguishing Layer 0a and 0b, and Their Evolution with Scale

Comments
12 min read
Enhancing F1 Race Strategy Predictions with Physics Simulation and ML Residual Correction for Improved Accuracy

Enhancing F1 Race Strategy Predictions with Physics Simulation and ML Residual Correction for Improved Accuracy

Comments
18 min read
Addressing Label Leakage in Machine Learning Datasets: Strategies for Valid Model Training and Evaluation

Addressing Label Leakage in Machine Learning Datasets: Strategies for Valid Model Training and Evaluation

Comments
19 min read
Optimal Setup for ML Development on Windows 11 with RTX 5080: WSL2 vs. Dual Boot for Convenience and Performance

Optimal Setup for ML Development on Windows 11 with RTX 5080: WSL2 vs. Dual Boot for Convenience and Performance

Comments
14 min read
Addressing LLM Benchmarking Obsolescence: Strategies for Timely and Relevant Model Evaluation

Addressing LLM Benchmarking Obsolescence: Strategies for Timely and Relevant Model Evaluation

1
Comments
13 min read
Addressing User Stress: Strategies for Sharing ARR Meta-Reviews in January 2026

Addressing User Stress: Strategies for Sharing ARR Meta-Reviews in January 2026

Comments
15 min read
Anonymous User Claims Proof of d^2 Complexity for Attention Mechanisms, Challenging Transformer Optimization

Anonymous User Claims Proof of d^2 Complexity for Attention Mechanisms, Challenging Transformer Optimization

Comments
10 min read
ComfyUI Instability Prompts Search for Stable, User-Friendly Alternative Solutions

ComfyUI Instability Prompts Search for Stable, User-Friendly Alternative Solutions

Comments
12 min read
Addressing Neptune's Limitations: Developing an Efficient, User-Friendly ML Experiment Tracking Tool

Addressing Neptune's Limitations: Developing an Efficient, User-Friendly ML Experiment Tracking Tool

Comments
20 min read
PEP 827 Unveiled: How Python's New Type Manipulation Features Impact Ecosystem Usability and Adoption

PEP 827 Unveiled: How Python's New Type Manipulation Features Impact Ecosystem Usability and Adoption

Comments
12 min read
Bridging the Semantic Gap in Neural Network Execution and Verification for Safety-Critical Systems

Bridging the Semantic Gap in Neural Network Execution and Verification for Safety-Critical Systems

Comments
8 min read
Advancing Tiny Transformers: Achieving 100% Accuracy in 10-Digit Addition with Sub-100 Parameter Models Using Digit Tokenization

Advancing Tiny Transformers: Achieving 100% Accuracy in 10-Digit Addition with Sub-100 Parameter Models Using Digit Tokenization

Comments
16 min read
loading...