{"id":2569,"date":"2025-02-18T19:29:14","date_gmt":"2025-02-18T12:29:14","guid":{"rendered":"https:\/\/mintea.blog\/?p=2569"},"modified":"2025-03-05T10:23:55","modified_gmt":"2025-03-05T03:23:55","slug":"2569","status":"publish","type":"post","link":"https:\/\/mintea.blog\/?p=2569","title":{"rendered":"Bridging the GAP: Business Expert meets ML @UEH_Sharing_2025-Feb-19"},"content":{"rendered":"<p>I recently had the chance to speak with MBA students at the University of Economics about the vital role of business experts in data and machine learning applications. Rather than focusing on technical details, I emphasized how their industry knowledge and strategic thinking can drive successful ML projects. Through a case study on a risk manager using ML for a scorecard project, I illustrated how business professionals can leverage data-driven insights without needing deep technical expertise. In this blog, I\u2019m sharing my slides and key insights to help others understand how to apply ML effectively in a business context.<\/p>\n<a href=\"https:\/\/mintea.blog\/wp-content\/uploads\/2025\/02\/2025-02-19_UEH-ML-Banking-1.pdf\" class=\"pdfemb-viewer\" style=\"\" data-width=\"max\" data-height=\"max\"  data-toolbar=\"bottom\" data-toolbar-fixed=\"off\">_2025-02-19_UEH (ML-Banking)<br\/><\/a>\n<p><strong>Q&amp;A:<\/strong><\/p>\n<p><span style=\"color: #ff0000;\">#1 &#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<\/span><br \/>\n<strong>Student asked<\/strong>: <span style=\"color: #0000ff;\">&#8220;I&#8217;m working in a fintech company, offering stock trading, what is the variables of the ML model?&#8221;<\/span><\/p>\n<p><strong>Tri answered:<\/strong><br \/>\n<span style=\"color: #0000ff;\">First, the question should be reframed. Instead of asking, \u201cWhat are the variables of the model for a fintech company?\u201d, you should ask:\ud83d\udc49 \u201cI want to protect and grow the business. How can I apply Machine Learning?\u201d<\/span><\/p>\n<p><span style=\"color: #0000ff;\"><strong>Step 1<\/strong>: Understand Your Business Portfolio &amp; Revenue Sources<\/span><br \/>\n<span style=\"color: #0000ff;\">Before jumping into ML, you need to analyze your business fundamentals:<\/span><br \/>\n<span style=\"color: #0000ff;\">. What are the sources of income? (e.g., trading fees, spreads, subscriptions, margin lending)<\/span><br \/>\n<span style=\"color: #0000ff;\">. Who are your customers? (e.g., retail traders, institutional investors, high-frequency traders)<\/span><br \/>\n<span style=\"color: #0000ff;\">. How do they behave? (e.g., trade volume, retention rate, churn rate, deposit patterns)<\/span><\/p>\n<p><span style=\"color: #0000ff;\"><strong>Step 2<\/strong>: Analyze Data Across Multiple Dimensions<\/span><br \/>\n<span style=\"color: #0000ff;\">Look at historical data from different perspectives:<\/span><br \/>\n<span style=\"color: #0000ff;\">. By stock: Which stocks drive the most trading activity?<\/span><br \/>\n<span style=\"color: #0000ff;\">. By customer segment: Are high-frequency traders more profitable than long-term investors?<\/span><br \/>\n<span style=\"color: #0000ff;\">. By demographic: Are younger users trading more actively than older ones?<\/span><br \/>\n<span style=\"color: #0000ff;\">. By trend: Is there a slowdown in a particular segment? Are certain products underperforming?<\/span><\/p>\n<p><span style=\"color: #0000ff;\"><strong>Step 3<\/strong>: Identify Key Business Challenges &amp; Make a Case for ML<\/span><br \/>\n<span style=\"color: #0000ff;\">Once you detect issues like a decline in trading volume, an increase in churn, or low engagement in certain segments, you can start building a business case:<\/span><br \/>\n<span style=\"color: #0000ff;\">. Why do you need ML?<\/span><br \/>\n<span style=\"color: #0000ff;\">. What problem will it solve?<\/span><br \/>\n<span style=\"color: #0000ff;\">. How will it increase revenue or reduce risk?<\/span><\/p>\n<p><span style=\"color: #0000ff;\"><strong>Step 4<\/strong>: Discuss with Management &amp; Defend Your Idea<\/span><br \/>\n<span style=\"color: #0000ff;\">. Present your findings and explain how ML can be a strategic tool:<\/span><br \/>\n<span style=\"color: #0000ff;\">. For retention: ML models can predict which users are likely to stop trading and suggest personalized offers.<\/span><br \/>\n<span style=\"color: #0000ff;\">. For growth: AI-driven recommendations can increase trading activity.<\/span><br \/>\n<span style=\"color: #0000ff;\">. For risk management: Fraud detection models can prevent financial losses.<\/span><\/p>\n<p><span style=\"color: #0000ff;\"><strong>Step 5<\/strong>: Dive into ML Research (or Ask ChatGPT!)<\/span><br \/>\n<span style=\"color: #0000ff;\">At this point, you can explore relevant ML models:<\/span><br \/>\n<span style=\"color: #0000ff;\">Read industry papers, case studies, or competitor strategies.<\/span><br \/>\n<span style=\"color: #0000ff;\">Use AI (like ChatGPT) to get insights on which ML models suit your business needs.<br \/>\nOnboard vendors (3 quotations from 3 vendors \/ of your in-house modelling team) then you can learn of their proposals.\u00a0<\/span><br \/>\n<span style=\"color: #0000ff;\">\ud83d\udca1 Takeaway: The right approach is business-first, data-driven, and ML-enabled\u2014not just focusing on model variables but understanding the real business impact of ML. \ud83d\ude80<\/span><\/p>\n<p><em><strong>Now, let&#8217;s check with ChatGPT further regarding the model:<span style=\"text-decoration: underline;\"><br \/>\nPrompt:<\/span><br \/>\n<\/strong><\/em>I&#8217;m working in a fintech company offering stock trading service to customer with flexible trading amount and volume,<br \/>\n1. what is the main source of revenue for my business model?<br \/>\n2. suggest me how to apply Machine Learning to grow the business, retain customer with the detail case study.<\/p>\n<p><span style=\"text-decoration: underline;\"><em><strong>ChatGPT:<br \/>\n<\/strong><\/em><\/span>1. Main Sources of Revenue for Your Fintech Stock Trading Business<br \/>\nYour company likely generates revenue from several key streams:<\/p>\n<ul data-start=\"138\" data-end=\"1169\">\n<li data-start=\"138\" data-end=\"310\"><strong data-start=\"140\" data-end=\"170\">Trading Commissions &amp; Fees<\/strong>: Charging customers a fee per trade (fixed or percentage-based). Some platforms offer commission-free trading and monetize in other ways.<\/li>\n<li data-start=\"311\" data-end=\"429\"><strong data-start=\"313\" data-end=\"331\">Spread Revenue<\/strong>: Making money on the bid-ask spread, particularly in market-making or fractional share trading.<\/li>\n<li data-start=\"430\" data-end=\"563\"><strong data-start=\"432\" data-end=\"468\">Subscription or Premium Services<\/strong>: Offering advanced tools, analytics, research, or priority support for a monthly\/annual fee.<\/li>\n<li data-start=\"564\" data-end=\"680\"><strong data-start=\"566\" data-end=\"602\">Margin Lending &amp; Interest Income<\/strong>: Earning interest on margin loans when customers trade with borrowed money.<\/li>\n<li data-start=\"681\" data-end=\"793\"><strong data-start=\"683\" data-end=\"716\">Payment for Order Flow (PFOF)<\/strong>: Earning fees from market makers or exchanges for routing customer orders.<\/li>\n<li data-start=\"794\" data-end=\"929\"><strong data-start=\"796\" data-end=\"821\">Asset Management Fees<\/strong>: If offering robo-advisory or managed portfolios, charging a percentage of assets under management (AUM).<\/li>\n<li data-start=\"930\" data-end=\"1034\"><strong data-start=\"932\" data-end=\"957\">Stock Lending Revenue<\/strong>: Lending out customers&#8217; stocks to short sellers and earning a lending fee.<\/li>\n<li data-start=\"1035\" data-end=\"1169\"><strong data-start=\"1037\" data-end=\"1070\">Partnerships &amp; Advertisements<\/strong>: Monetizing through partnerships with financial service providers, ETFs, or other fintech firms.<\/li>\n<\/ul>\n<p data-start=\"1171\" data-end=\"1245\">2. Applying Machine Learning to Grow the Business &amp; Retain Customers<\/p>\n<p data-start=\"1247\" data-end=\"1311\"><strong data-start=\"1252\" data-end=\"1309\">Case Study: Personalized Trading &amp; Retention Strategy<\/strong><\/p>\n<p data-start=\"1313\" data-end=\"1342\"><strong data-start=\"1319\" data-end=\"1340\">Problem Statement<\/strong><\/p>\n<p data-start=\"1343\" data-end=\"1536\">Your platform has a large customer base, but retaining users and keeping them engaged is a challenge. Many users sign up but do not actively trade, while others stop after a few transactions.<\/p>\n<p data-start=\"1538\" data-end=\"1578\"><strong data-start=\"1544\" data-end=\"1576\">Machine Learning Application<\/strong><\/p>\n<p data-start=\"1580\" data-end=\"1634\"><strong data-start=\"1580\" data-end=\"1632\">Step 1: Customer Segmentation &amp; Churn Prediction<\/strong><\/p>\n<ul data-start=\"1635\" data-end=\"2064\">\n<li data-start=\"1635\" data-end=\"1838\"><strong data-start=\"1637\" data-end=\"1647\">Model:<\/strong> Use clustering (e.g., K-Means, DBSCAN) or supervised learning (e.g., Random Forest, XGBoost) to segment traders into different categories\u2014active traders, casual traders, and dormant users.<\/li>\n<li data-start=\"1839\" data-end=\"1946\"><strong data-start=\"1841\" data-end=\"1854\">Features:<\/strong> Trading frequency, portfolio size, deposit patterns, risk tolerance, past churn behavior.<\/li>\n<li data-start=\"1947\" data-end=\"2064\"><strong data-start=\"1949\" data-end=\"1961\">Outcome:<\/strong> Identify users likely to stop trading in the next 30 days and target them with retention strategies.<\/li>\n<\/ul>\n<p data-start=\"2066\" data-end=\"2119\"><strong data-start=\"2066\" data-end=\"2117\">Step 2: Personalized Investment Recommendations<\/strong><\/p>\n<ul data-start=\"2120\" data-end=\"2450\">\n<li data-start=\"2120\" data-end=\"2259\"><strong data-start=\"2122\" data-end=\"2132\">Model:<\/strong> Use collaborative filtering (like in recommendation systems) to suggest stocks or ETFs based on users with similar profiles.<\/li>\n<li data-start=\"2260\" data-end=\"2359\"><strong data-start=\"2262\" data-end=\"2275\">Features:<\/strong> Past trades, industry preferences, news sentiment analysis, technical indicators.<\/li>\n<li data-start=\"2360\" data-end=\"2450\"><strong data-start=\"2362\" data-end=\"2374\">Outcome:<\/strong> Increase engagement by recommending stocks aligned with user preferences.<\/li>\n<\/ul>\n<p data-start=\"2452\" data-end=\"2494\"><strong data-start=\"2452\" data-end=\"2492\">Step 3: AI-Powered Trading Assistant<\/strong><\/p>\n<ul data-start=\"2495\" data-end=\"2785\">\n<li data-start=\"2495\" data-end=\"2623\"><strong data-start=\"2497\" data-end=\"2507\">Model:<\/strong> NLP models (e.g., ChatGPT-like models) to provide trade insights, explain market trends, and answer user queries.<\/li>\n<li data-start=\"2624\" data-end=\"2696\"><strong data-start=\"2626\" data-end=\"2639\">Features:<\/strong> Real-time market data, user queries, trading patterns.<\/li>\n<li data-start=\"2697\" data-end=\"2785\"><strong data-start=\"2699\" data-end=\"2711\">Outcome:<\/strong> Boost engagement by providing intelligent market insights via chatbots.<\/li>\n<\/ul>\n<p data-start=\"2787\" data-end=\"2826\"><strong data-start=\"2787\" data-end=\"2824\">Step 4: Fraud &amp; Anomaly Detection<\/strong><\/p>\n<ul data-start=\"2827\" data-end=\"3062\">\n<li data-start=\"2827\" data-end=\"2975\"><strong data-start=\"2829\" data-end=\"2839\">Model:<\/strong> Anomaly detection algorithms (Isolation Forest, Autoencoders) to detect unusual trading behavior, insider trading, or bot activities.<\/li>\n<li data-start=\"2976\" data-end=\"3062\"><strong data-start=\"2978\" data-end=\"2990\">Outcome:<\/strong> Improve platform security and compliance, maintaining customer trust.<\/li>\n<\/ul>\n<p data-start=\"3064\" data-end=\"3109\"><strong data-start=\"3064\" data-end=\"3107\">Step 5: Predictive Portfolio Management<\/strong><\/p>\n<ul data-start=\"3110\" data-end=\"3330\">\n<li data-start=\"3110\" data-end=\"3227\"><strong data-start=\"3112\" data-end=\"3122\">Model:<\/strong> Reinforcement learning to optimize portfolio allocation based on market trends and user risk profiles.<\/li>\n<li data-start=\"3228\" data-end=\"3330\"><strong data-start=\"3230\" data-end=\"3242\">Outcome:<\/strong> Help users maximize returns and keep them engaged with an intelligent portfolio tool.<\/li>\n<\/ul>\n<p data-start=\"3332\" data-end=\"3359\"><strong data-start=\"3338\" data-end=\"3357\">Expected Impact<\/strong><\/p>\n<ul data-start=\"3360\" data-end=\"3731\">\n<li data-start=\"3360\" data-end=\"3445\"><strong data-start=\"3362\" data-end=\"3382\">Higher Retention<\/strong>: Personalized recommendations and alerts keep users engaged.<\/li>\n<li data-start=\"3446\" data-end=\"3530\"><strong data-start=\"3448\" data-end=\"3476\">Increased Trading Volume<\/strong>: AI-powered insights encourage users to trade more.<\/li>\n<li data-start=\"3531\" data-end=\"3620\"><strong data-start=\"3533\" data-end=\"3565\">Enhanced Customer Experience<\/strong>: AI chatbots and assistants improve service quality.<\/li>\n<li data-start=\"3621\" data-end=\"3731\"><strong data-start=\"3623\" data-end=\"3643\">Lower Churn Rate<\/strong>: Early identification of at-risk customers allows for proactive retention strategies.<\/li>\n<\/ul>\n<p data-start=\"3733\" data-end=\"3864\" data-is-last-node=\"\" data-is-only-node=\"\">Would you like me to refine any of these ideas further, such as building a proof of concept or detailing an implementation plan? \ud83d\ude80<\/p>\n<p><span style=\"text-decoration: underline;\"><em><strong> Prompt:<br \/>\n<\/strong><\/em><\/span>How these models, at the end, contribute to the revenue?<\/p>\n<p><span style=\"text-decoration: underline;\"><em><strong>ChatGPT:<br \/>\n<\/strong><\/em><\/span><\/p>\n<p>Great question! Let&#8217;s directly connect each machine learning (ML) model to revenue generation.<\/p>\n<p data-start=\"98\" data-end=\"190\"><strong data-start=\"102\" data-end=\"188\">1. Customer Segmentation &amp; Churn Prediction \u2192 Higher Retention \u2192 More Trading Fees<\/strong><\/p>\n<ul data-start=\"191\" data-end=\"504\">\n<li data-start=\"191\" data-end=\"350\"><strong data-start=\"193\" data-end=\"209\">How it Works<\/strong>: Identifies users likely to stop trading and targets them with personalized incentives (e.g., discounts, exclusive insights, free trades).<\/li>\n<li data-start=\"351\" data-end=\"504\"><strong data-start=\"353\" data-end=\"371\">Revenue Impact<\/strong>: Reduces churn, keeping customers actively trading and generating <strong data-start=\"438\" data-end=\"491\">more commissions, spreads, and order flow revenue<\/strong> over time.<\/li>\n<\/ul>\n<p data-start=\"506\" data-end=\"612\"><strong data-start=\"510\" data-end=\"610\">2. Personalized Investment Recommendations \u2192 Increased Trading Volume \u2192 More Transaction Revenue<\/strong><\/p>\n<ul data-start=\"613\" data-end=\"876\">\n<li data-start=\"613\" data-end=\"726\"><strong data-start=\"615\" data-end=\"631\">How it Works<\/strong>: Recommends stocks, ETFs, or trading strategies based on user preferences and market trends.<\/li>\n<li data-start=\"727\" data-end=\"876\"><strong data-start=\"729\" data-end=\"747\">Revenue Impact<\/strong>: Encourages users to trade more often, increasing <strong data-start=\"798\" data-end=\"873\">trading commissions, spreads, and payment for order flow (PFOF) revenue<\/strong>.<\/li>\n<\/ul>\n<p data-start=\"878\" data-end=\"990\"><strong data-start=\"882\" data-end=\"988\">3. AI-Powered Trading Assistant \u2192 Higher Engagement &amp; Upselling \u2192 Subscription &amp; Margin Lending Growth<\/strong><\/p>\n<ul data-start=\"991\" data-end=\"1309\">\n<li data-start=\"991\" data-end=\"1103\"><strong data-start=\"993\" data-end=\"1009\">How it Works<\/strong>: Provides real-time market insights, news, and trading tips via chatbot or voice assistant.<\/li>\n<li data-start=\"1104\" data-end=\"1309\"><strong data-start=\"1106\" data-end=\"1124\">Revenue Impact<\/strong>: Keeps users engaged, leading them to <strong data-start=\"1163\" data-end=\"1196\">subscribe to premium services<\/strong> (e.g., advanced research, AI-powered alerts) and <strong data-start=\"1246\" data-end=\"1268\">use margin lending<\/strong>, which generates <strong data-start=\"1286\" data-end=\"1306\">interest revenue<\/strong>.<\/li>\n<\/ul>\n<p data-start=\"1311\" data-end=\"1394\"><strong data-start=\"1315\" data-end=\"1392\">4. Fraud &amp; Anomaly Detection \u2192 Risk Mitigation \u2192 Protects Revenue &amp; Trust<\/strong><\/p>\n<ul data-start=\"1395\" data-end=\"1691\">\n<li data-start=\"1395\" data-end=\"1511\"><strong data-start=\"1397\" data-end=\"1413\">How it Works<\/strong>: Detects insider trading, bot activity, and fraudulent transactions, reducing compliance risks.<\/li>\n<li data-start=\"1512\" data-end=\"1691\"><strong data-start=\"1514\" data-end=\"1532\">Revenue Impact<\/strong>: Prevents financial losses and regulatory fines, ensuring smooth business operations and maintaining <strong data-start=\"1634\" data-end=\"1652\">customer trust<\/strong>, which indirectly increases revenue.<\/li>\n<\/ul>\n<p data-start=\"1693\" data-end=\"1796\"><strong data-start=\"1697\" data-end=\"1794\">5. Predictive Portfolio Management \u2192 Better Performance \u2192 Asset Management &amp; Advisory Revenue<\/strong><\/p>\n<ul data-start=\"1797\" data-end=\"2079\">\n<li data-start=\"1797\" data-end=\"1918\"><strong data-start=\"1799\" data-end=\"1815\">How it Works<\/strong>: Uses reinforcement learning to optimize portfolios, helping users make better investment decisions.<\/li>\n<li data-start=\"1919\" data-end=\"2079\"><strong data-start=\"1921\" data-end=\"1939\">Revenue Impact<\/strong>: Increases AUM (Assets Under Management), generating <strong data-start=\"1993\" data-end=\"2043\">higher advisory fees and premium subscriptions<\/strong> for automated portfolio services.<\/li>\n<\/ul>\n<hr data-start=\"2081\" data-end=\"2084\" \/>\n<p><strong data-start=\"2090\" data-end=\"2141\">Summary: How ML Directly Contributes to Revenue<\/strong><\/p>\n<table data-start=\"2142\" data-end=\"2648\">\n<thead data-start=\"2142\" data-end=\"2185\">\n<tr data-start=\"2142\" data-end=\"2185\">\n<th data-start=\"2142\" data-end=\"2167\">Machine Learning Model<\/th>\n<th data-start=\"2167\" data-end=\"2185\">Revenue Impact<\/th>\n<\/tr>\n<\/thead>\n<tbody data-start=\"2231\" data-end=\"2648\">\n<tr data-start=\"2231\" data-end=\"2299\">\n<td><strong data-start=\"2233\" data-end=\"2253\">Churn Prediction<\/strong><\/td>\n<td>More retained traders = More trading fees<\/td>\n<\/tr>\n<tr data-start=\"2300\" data-end=\"2387\">\n<td><strong data-start=\"2302\" data-end=\"2332\">Investment Recommendations<\/strong><\/td>\n<td>Higher trading volume = More commissions &amp; spreads<\/td>\n<\/tr>\n<tr data-start=\"2388\" data-end=\"2480\">\n<td><strong data-start=\"2390\" data-end=\"2414\">AI Trading Assistant<\/strong><\/td>\n<td>More engagement = More premium subscriptions &amp; margin lending<\/td>\n<\/tr>\n<tr data-start=\"2481\" data-end=\"2570\">\n<td><strong data-start=\"2483\" data-end=\"2502\">Fraud Detection<\/strong><\/td>\n<td>Prevents financial loss = Maintains customer trust &amp; compliance<\/td>\n<\/tr>\n<tr data-start=\"2571\" data-end=\"2648\">\n<td><strong data-start=\"2573\" data-end=\"2599\">Portfolio Optimization<\/strong><\/td>\n<td>Higher AUM = More advisory &amp; management fees<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>By implementing these ML strategies, you can <strong data-start=\"2695\" data-end=\"2793\">increase customer lifetime value (CLV), boost trading activity, and unlock new revenue streams<\/strong>. \ud83d\ude80<\/p>\n<p>Would you like a deep dive into a specific model or an implementation plan?<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"color: #ff0000;\">#2 &#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<\/span><\/p>\n<p><em><strong>Student Asked: <\/strong><\/em><span style=\"color: #0000ff;\"><em>Should I read a paper, and use that as a base to develop my model at my business, e.g. use the same set of variable, hypothesis, etc.?<\/em><\/span><\/p>\n<p><em><strong>Tri Answered:<\/strong> <\/em><span style=\"color: #0000ff;\">Not really, you should start from business insight, paper can be used as reference source, but not the golden source where you can get the variable.<\/span><\/p>\n<p><em><span style=\"text-decoration: underline;\"><strong>Check with ChatGPT:<\/strong><\/span><\/em><\/p>\n<p data-start=\"186\" data-end=\"232\"><strong data-start=\"190\" data-end=\"232\">1. Define the Business Problem Clearly<\/strong><\/p>\n<p data-start=\"233\" data-end=\"293\">Before choosing independent variables, you must be clear on:<\/p>\n<ul data-start=\"294\" data-end=\"776\">\n<li data-start=\"294\" data-end=\"467\"><strong data-start=\"296\" data-end=\"375\">What business outcome (dependent variable) you want to predict or optimize.<\/strong>\n<ul data-start=\"380\" data-end=\"467\">\n<li data-start=\"380\" data-end=\"467\">Example: Customer default probability, trading volume prediction, or fraud detection.<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"468\" data-end=\"601\"><strong data-start=\"470\" data-end=\"512\">How this outcome impacts the business.<\/strong>\n<ul data-start=\"517\" data-end=\"601\">\n<li data-start=\"517\" data-end=\"601\">Example: If predicting customer churn, reducing it can increase retention revenue.<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"602\" data-end=\"776\"><strong data-start=\"604\" data-end=\"676\">What actions the business can take based on the model\u2019s predictions.<\/strong>\n<ul data-start=\"681\" data-end=\"776\">\n<li data-start=\"681\" data-end=\"776\">Example: If predicting trading volume, can you design better promotions to increase activity?<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<hr data-start=\"778\" data-end=\"781\" \/>\n<p data-start=\"783\" data-end=\"837\"><strong data-start=\"787\" data-end=\"837\">2. Brainstorm Potential Drivers of the Outcome<\/strong><\/p>\n<p data-start=\"838\" data-end=\"1079\">Since you are working in fintech with stock trading services, consider what factors logically influence the outcome you&#8217;re predicting. Use <strong data-start=\"977\" data-end=\"997\">domain knowledge<\/strong> and <strong data-start=\"1002\" data-end=\"1032\">hypothesis-driven thinking<\/strong> to brainstorm potential independent variables.<\/p>\n<p data-start=\"1081\" data-end=\"1120\">Example: Predicting Trading Volume<\/p>\n<p data-start=\"1121\" data-end=\"1152\">Possible independent variables:<\/p>\n<ul data-start=\"1153\" data-end=\"1456\">\n<li data-start=\"1153\" data-end=\"1234\"><strong data-start=\"1155\" data-end=\"1172\">User behavior<\/strong>: Number of past trades, frequency of logins, deposit amounts.<\/li>\n<li data-start=\"1235\" data-end=\"1313\"><strong data-start=\"1237\" data-end=\"1255\">Market factors<\/strong>: Stock index movements, volatility (VIX), interest rates.<\/li>\n<li data-start=\"1314\" data-end=\"1388\"><strong data-start=\"1316\" data-end=\"1337\">User demographics<\/strong>: Experience level, portfolio size, risk tolerance.<\/li>\n<li data-start=\"1389\" data-end=\"1456\"><strong data-start=\"1391\" data-end=\"1409\">App engagement<\/strong>: Time spent on the platform, clickstream data.<\/li>\n<\/ul>\n<p data-start=\"1458\" data-end=\"1508\">Start broad\u2014then refine based on what\u2019s available.<\/p>\n<hr data-start=\"1510\" data-end=\"1513\" \/>\n<p data-start=\"1515\" data-end=\"1549\"><strong data-start=\"1519\" data-end=\"1549\">3. Check Data Availability<\/strong><\/p>\n<p data-start=\"1550\" data-end=\"1700\">Now, validate which of these variables you actually have access to. Sometimes, the most impactful variables are not collected or not easily available.<\/p>\n<ul data-start=\"1702\" data-end=\"1992\">\n<li data-start=\"1702\" data-end=\"1792\"><strong data-start=\"1704\" data-end=\"1721\">Internal data<\/strong>: Do you track user behavior, trade history, deposits, and withdrawals?<\/li>\n<li data-start=\"1793\" data-end=\"1882\"><strong data-start=\"1795\" data-end=\"1812\">External data<\/strong>: Can you access market trends, economic indicators, social sentiment?<\/li>\n<li data-start=\"1883\" data-end=\"1992\"><strong data-start=\"1885\" data-end=\"1908\">Feature engineering<\/strong>: Can you derive useful insights from raw data? (e.g., trend-based features, ratios)<\/li>\n<\/ul>\n<hr data-start=\"1994\" data-end=\"1997\" \/>\n<p data-start=\"1999\" data-end=\"2057\"><strong data-start=\"2003\" data-end=\"2057\">4. Perform Initial Exploratory Data Analysis (EDA)<\/strong><\/p>\n<p data-start=\"2058\" data-end=\"2132\">Once you shortlist variables, explore their relationship with the outcome:<\/p>\n<ul data-start=\"2133\" data-end=\"2371\">\n<li data-start=\"2133\" data-end=\"2213\"><strong data-start=\"2135\" data-end=\"2159\">Correlation analysis<\/strong>: Identify which variables have a strong relationship.<\/li>\n<li data-start=\"2214\" data-end=\"2297\"><strong data-start=\"2216\" data-end=\"2241\">Feature distributions<\/strong>: Ensure variables are well-distributed and informative.<\/li>\n<li data-start=\"2298\" data-end=\"2371\"><strong data-start=\"2300\" data-end=\"2324\">Missing values check<\/strong>: See if you have enough data for each feature.<\/li>\n<\/ul>\n<hr data-start=\"2373\" data-end=\"2376\" \/>\n<p data-start=\"2378\" data-end=\"2418\"><strong data-start=\"2382\" data-end=\"2418\">5. Feature Selection &amp; Iteration<\/strong><\/p>\n<p data-start=\"2419\" data-end=\"2444\">At this stage, you might:<\/p>\n<ul data-start=\"2445\" data-end=\"2686\">\n<li data-start=\"2445\" data-end=\"2501\">Use <strong data-start=\"2451\" data-end=\"2471\">domain intuition<\/strong> to remove unlikely variables.<\/li>\n<li data-start=\"2502\" data-end=\"2594\">Use <strong data-start=\"2508\" data-end=\"2529\">statistical tests<\/strong> (e.g., mutual information, chi-square tests) to select features.<\/li>\n<li data-start=\"2595\" data-end=\"2686\">Run a <strong data-start=\"2603\" data-end=\"2661\">basic model (e.g., decision tree, logistic regression)<\/strong> to get initial insights.<\/li>\n<\/ul>\n<hr data-start=\"2688\" data-end=\"2691\" \/>\n<p data-start=\"2693\" data-end=\"2714\"><strong data-start=\"2697\" data-end=\"2714\">Key Takeaways<\/strong><\/p>\n<ul data-start=\"2715\" data-end=\"2914\">\n<li data-start=\"2715\" data-end=\"2761\"><strong data-start=\"2717\" data-end=\"2761\">Start with a business problem, not data.<\/strong><\/li>\n<li data-start=\"2762\" data-end=\"2820\"><strong data-start=\"2764\" data-end=\"2820\">Brainstorm potential drivers using domain knowledge.<\/strong><\/li>\n<li data-start=\"2821\" data-end=\"2854\"><strong data-start=\"2823\" data-end=\"2854\">Validate data availability.<\/strong><\/li>\n<li data-start=\"2855\" data-end=\"2890\"><strong data-start=\"2857\" data-end=\"2890\">Perform EDA to find patterns.<\/strong><\/li>\n<li data-start=\"2891\" data-end=\"2914\"><strong data-start=\"2893\" data-end=\"2914\">Test and iterate.<\/strong><\/li>\n<\/ul>\n<p data-start=\"2916\" data-end=\"3044\" data-is-last-node=\"\" data-is-only-node=\"\">Would you like to discuss this in the context of your specific fintech business case? I can help refine the approach further! \ud83d\ude80<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I recently had the chance to speak with MBA students at the University of Economics about the vital role of business experts in data and machine learning applications. Rather than focusing on technical details, I emphasized how their industry knowledge and strategic thinking can drive successful ML projects. Through a case study on a risk &hellip; <a href=\"https:\/\/mintea.blog\/?p=2569\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Bridging the GAP: Business Expert meets ML @UEH_Sharing_2025-Feb-19<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[35],"tags":[96],"class_list":["post-2569","post","type-post","status-publish","format-standard","hentry","category-books","tag-ueh-machinelearning-analytic-data"],"_links":{"self":[{"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/posts\/2569","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mintea.blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2569"}],"version-history":[{"count":22,"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/posts\/2569\/revisions"}],"predecessor-version":[{"id":2638,"href":"https:\/\/mintea.blog\/index.php?rest_route=\/wp\/v2\/posts\/2569\/revisions\/2638"}],"wp:attachment":[{"href":"https:\/\/mintea.blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2569"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mintea.blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2569"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mintea.blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2569"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}