
kcok独立开发者
-
TransGoing – Building a Real-Time Polyglot That Works in Airplane Mode Stack Snapshot
ASR: ONNX-exported Whisper-small (244 M params) → int8 quant → 30 % speed-up via TensorRT. NMT: Transformer-base 12-layer, pruned to 320 M params; supports 200 language pairs, size…
-
ProbFace – Hollywood Face-Swap at Consumer Cost (and Ethical Guardrails) Model Architecture
ArcFace backbone → 512-D embedding → transformer-based alignment → StyleGAN2 fusion. Temporal stabiliser: LSTM looks across 7-frame window → reduces flicker from 18 % to <1 % (m…
-
ColorsFix – Reconstructing Lost Reality with GANs and Ethical Auto-Delete Pipeline Walk-through
Scratch-removal: Partial-convolution inpainting (NVIDIA’s gated conv) trained on 50 k artificially scratched vintage photos. Colourisation: Dual-branch GAN – one branch predicts lu…
-
QiblaSalat – Crafting an AR Compass That Prays as Good as It Looks Sensor Fusion Recipe
Magnetometer + accelerometer → Kalman filter → pitch/roll compensated heading. Gyroscope drift correction every 2 s using gravity vector; maintains <0.5° accuracy even when phon…
-
HealthierFruit – Teaching a Phone to Smell Ripeness Through Pixels Core Stack
Spectral reflection pipeline: CV converts RGB → CIELab → isolates chroma channel at 570–590 nm (chlorophyll-breakdown sweet-spot). Regression head predicts °Brix (sugar) within ±0.…
-
AIASMRFree: Two-Minute Videos That Melted My 3-A.M. Anxiety Spiral
I type ‘gentle ice cutting on a winter porch,’ and by the time I’ve brushed my teeth there’s a custom 1080p video humming me to sleep.
-
AIASMRFree – Engineering Two-Minute Tranquillity with VEO3 Pipeline Brief
Text encoder: fine-tuned T5-small → 77-token prompt embedding. Video decoder: diffusion-based latent model (3D U-Net) operating at 64×64×16 latent space → up-sampled to 1280×720×30…
-
TransGoing: The Pocket Polyglot That Made Me a Braver Traveler
Real-time voice, camera and local-guide AI—basically Star Trek’s Universal Translator with a passport stamp.
-
ProbFace: Where Hollywood-Grade Face-Swap Meets One-Click Simplicity
I swapped myself into a movie trailer over lunch break—and the render looked cinema-ready.
-
Building a Death-Cap Detector – The Engineering Diary Behind MushroomCheck Architectural TL;DR
Edge-first CNN (EfficientNet-B3) quantized to 8-bit INT → 3.2 MB on-device model. Federated learning loop every 14 days → new species weights without centralising user photos. Safe…
-
Memories Reborn: How ColorsFix Turned My Grandma’s Faded Photo into a Living Video
cratches erased, colours awakened, and for five ethereal seconds she smiled at me—78 years after her wedding day.
-
QiblaSalat: The Quiet App That Keeps My Faith on Course
From Jakarta to Johannesburg, one swipe points my heart to the Kaaba—no data, no drama.
-
The Grocery Genius in My Pocket: A Busy Dad Reviews HealthierFruit
It tells me which peach my toddler can bite, when it’ll go mush, and how much money I’ll save—before I even reach the cashier.
-
From Forest Fear to Fungi Fearless: How One App Changed My Wild-Mushroom Life
One photo, three seconds, zero hospital trips—MushroomCheck turned mycophobia into mycophilia.