No. of Recommendations: 4
For automation
- WSH can be good for automating native windows applications. It has been a while since I've used either, but I thought Powershell was the direction Microsoft was leaning for the future.
- BeautifulSoup can be good for automating websites and browsers. If native browsers are needed, Playwright and Selenium, which interact with browsers through management protocols. So something like "If a div element with specific test is present and visible, issue a scroll down event" is easily expressible.
- For more exotic cases, I've used to used Sikuli (
https://www.sikuli.org/) which was based on OpenCV.
The latter one will use computer vision techniques capture a screenshot and recognize elements you want to interact with visually, and determine their screen coordinates dynamically. It is a little more robust for differing screen geometries or cases where simply sending an event at a particular screen coordinate is difficult for timing or precision reasons.
It has been more than a decade since I've used Sikuli and I assume better options are now available, but it represents a powerful and generic automation technique.