Tag: python

#27 Web Scraping ดึง 20 ข่าวล่าสุดจากเว็บข่าวด้วย Selenium

Post author By Kittisak Chotikkakamthorn
Post date May 13, 2024

ปกติการดึงข่าวล่าสุด อันนี้เราใช้ RSS (Really Simple Syndication) หรือ Feedly ได้เลย อย่างไรก็ดี ไม่ใช่ทุกเว็บไซต์ที่จะรองรับวิธีนี้ได้ อาจจะเป็นเพราะทางนั้นไม่ได้ให้ลิ้งค์สำหรับ RSS ไว้ครับ

เมื่อเจอปัญหานี้แล้ว ในบทความนี้เราแนะนำวิธีหนึ่งที่คนใช้กัน วิธีนี้เรียกว่า Web Scraping

Tags Archiving, coding, ELT, ETL, Feedlyt, news, programming, python, rss, Selenium, Thai news, voicetv, web scraping, วอยซ์ทีวี, สนข, สำนักข่าว, โค้ดดิ้ง

English Articles

A* Search Algorithm and the maze traversal

Post author By Kittisak Chotikkakamthorn
Post date March 10, 2024

After introducing graphs and shortest path techniques like Dijkstra’s and Bellman-Ford’s algorithms, this article will introduce the following method: A* Search Algorithm.

Tags A star, A* search, Algorithms, Data Structures, Graph, Graph Traversal, Maze, Maze solving, path finding, programming, python, Shortest Path

Computer

#20 – Graph และ Shortest Path Algorithms

Post author By Kittisak Chotikkakamthorn
Post date March 6, 2024

เทคนิคการหาเส้นทางที่สั้นที่สุด (Shortest Path Algorithms) เป็นวิธีการหาเส้นเชื่อมระหว่างโหนดเริ่มต้น และโหนดสิ้นสุดในกราฟที่ให้ผลรวมของค่าน้ำหนักของเส้น (Edge Weight) ที่ต่ำที่สุด

For English, please follow this article on Medium.

Tags Algorithms, Bellman-Ford, computer, Data Structures, diary, Dijkstra, Edge, Graph, javascript, Path, programming, python, Shortest, Vertex, คอมพิวเตอร์, จาวาสคริป, เขียนโปรแกรม, โค้ดดิ้ง, ไพทอน

Computer Data

#19 Big-O, Search & Sort ที่ใช้ในงาน Data

Post author By Kittisak Chotikkakamthorn
Post date February 28, 2024

The English version of this blog related to the first part: Big-O notation, is available here.

เมื่อวันก่อนเข้าไปอ่านในหน้าเว็บ Medium ที่กล่าวถึง Data Structures & Algorithms (ย่อเป็น DSA) ที่จำเป็นสำหรับทาง Data Engineer ว่าต้องรู้เทคนิคไหนบ้าง แถมตอนเรียนในคอร์สจากเว็บ DataTH (รวมถึงที่อื่น) ก็มีกล่าวถึงไว้นิดหน่อยว่าจำเป็นต้องรู้เรื่องนี้ต่อยอดจากที่เรียนในคอร์ส

เราเลยสังเกตตอนที่เขียนโค้ดแล้วพบว่าโค้ดมันก็รันได้ แต่ประสิทธิภาพมันก็ไม่ได้ดีอะไรขนาดนั้น การมีความรู้ทางด้าน DSA ก็นำไปใช้ช่วยเขียนโค้ดให้ทำงานได้มีประสิทธิภาพมากกว่าเดิม โดยในบทความนี้ก็สรุป และแชร์เรื่องนี้ครับ

Tags Asymptotic Analysis, Big O, Bubble Sort, coding, data, Fibonacci Search, Heap, Heap Sort, Linear Search, Merge sort, programming, python, Quick Sort, Searching, Selection Sort, Sorting, คอมพิวเตอร์, โค้ดดิ้ง, ไพทอน

Computer Data

#15 แปลงข้อมูลจากไฟล์ให้เป็น Structured Data

Post author By Kittisak Chotikkakamthorn
Post date February 2, 2024

The English version is available in Medium.

ข้อมูลประเภท Unstructured Data เป็นข้อมูลที่ไม่มีโครงสร้างที่แน่นอนแบบที่ปรากฏในข้อมูลประเภท Structured Data และ Semi-Structured Data โดยตัวอย่างข้อมูลประเภทนี้ได้แก่ ไฟล์ รูปภาพ วิดีโอ และเสียง

Computer

#12 – รู้จัก Apple MLX และเขียนโค้ด Linear Regression

Post author By Kittisak Chotikkakamthorn
Post date January 17, 2024

Apple MLX เป็นไลบรารีสำหรับงานทางด้าน Machine Learning ที่พัฒนาโดยทีมงาน Apple Machine Learning Research ที่ออกแบบมาเพื่อ Apple Silicon (ชิปแบบ M2, M3) โดยเฉพาะ โดยไลบรารีนี้มีฟีเจอร์ที่เด่น ๆ ได้แก่

Tags apple, Apple Silicon, artificial intelligence, coding, machine learning, numpy, python, PyTorch, การเรียนรู้ของเครื่อง, ปัญญาประดิษฐ์, แอปเปิล, ไพทอน, ไลบรารี

Computer

#11 – Linear Regression แบบเขียนมือ

Post author By Kittisak Chotikkakamthorn
Post date January 14, 2024

Linear regression เป็นความสัมพันธ์แบบเชิงเส้นระหว่างค่าที่เราต้องการทำนาย กับตัวแปรที่เรานำมาใช้ในการคำนวณ เทคนิคนี้เป็นเทคนิคที่ใช้มานานแล้ว กับเป็นเทคนิคที่เป็นโมเดลทางคณิตศาสตร์ที่เข้าใจง่าย ร่วมกับใช้งานได้หลากหลาย ตั้งแต่การศึกษา ไปจนถึงธุรกิจต่าง ๆ

Tags artificial intelligence, coding, computer, linear, python, regression, scikit-learn, statistics, supervised learning, การถดถอยเชิงเส้น, ปัญญาประดิษฐ์, สถิติ, โค้ดดิ้ง, ไพทอน

Computer Data

ทำ Synthetic Head Pose Dataset เพื่อเทรน AI

Post author By Kittisak Chotikkakamthorn
Post date January 22, 2023

ปกติเวลาที่เทรนโมเดล AI ที่เป็น Deep Learning สิ่งหนึ่งที่โมเดลเหล่านี้ต้องการสำหรับการเทรนโมเดลคือข้อมูล Dataset สำหรับการฝึกที่มีข้อมูลเป็นจำนวนมาก อย่างไรก็ตามปัญหาหนึ่งคือไม่มี Dataset ที่มีจำนวนมากพอสำหรับการเทรน

วิธีทั่วไปที่ทำก็ออกไปเก็บข้อมูล และจัดทำ Ground Truth สำหรับการเทรนโมเดล AI เพิ่ม ซึ่งโอเค อย่างไรก็ดีมีอีกวิธีหนึ่งที่เราสามารถทำได้คือการสังเคราะห์ชุดข้อมูล Dataset ขึ้นมา (หรือเรียกว่าทำ Synthetic Dataset)

Tags 300W_LP, 3ddfa, ai, artificial intelligence, Augmentation, coding, Dataset, deep learning, Face alignment, face detection, Facial Landmark, fan, head pose estimation, python, RetinaFace, synthesis

Computer Data

วิธีการทำ Object Detection โดย Nanodet

Post author By Kittisak Chotikkakamthorn
Post date December 25, 2022

Object detection คือขั้นตอนการหาตำแหน่งวัตถุจากภาพโดย AI ตามที่กำหนดไว้ ได้แก่ คน รถยนต์ จักรยาน และอื่น ๆ โดยผลลัพธ์ที่ได้จากการใช้งานเทคนิคนี้จะแสดงผลในรูปแบบกรอบสี่เหลี่ยม Bounding box พร้อมกับจำแนก Class ของภาพที่จับได้ว่าเป็นอะไร

Tags ai, artificial intelligence, COCO, coding, computer, nanodet, object detection, programming, python, PyTorch, training, คอมพิวเตอร์, โค้ดดิ้ง

Computer Data

ปรับโมเดล ONNX ให้ไวด้วย Static Quantization

Post author By Kittisak Chotikkakamthorn
Post date November 24, 2022

ปกติเมื่อเราเทรนโมเดลที่ต้องใช้ระยะเวลาหลายชั่วโมง ไปจนถึงหลายวันเสร็จเรียบร้อยแล้ว เราจะต้องนำโมเดลไปใช้งานบนเซิร์ฟเวอร์ หรืออุปกรณ์ฝังตัวขนาดเล็กเพื่อนำไปใช้งานจริง อย่างไรก็ตามโมเดลมันมีขนาดใหญ่ ต้องใช้พลังการประมวลผลมาก แล้วเราจะต้องใช้เทคนิคอะไรมาช่วยล่ะ?

คำตอบที่เหมาะสมกับปัญหานี้คือ Quantization

Tags coding, computer, float, float32, int8, onnx, programming, python, quantization, static quantization, คอมพิวเตอร์, เขียนโปรแกรม, โค้ดดิ้ง