λ―Έκ΅ μ¦μλ μμ λ―Έλμ΄μ λ°λ¬λ‘ μΈν΄ μ 보 νμ° μλκ° κ·Ήλλ‘ λΉ¨λΌμ‘μΌλ©°, λ¨μΌ λ΄μ€ κΈ°μ¬λ νΈμμ΄ λ¨κΈ°μ μΈ λ³λμ±μ μ λ°νλ μ¬λ‘κ° λΉλ²νκ² λ°μνκ³ μμ΅λλ€. μ΄μ λ³Έ νλ‘μ νΈλ λ΄μ€, λΈλ‘κ·Έ, λκΈ λ±κ³Ό κ°μ λΉμ ν λ°μ΄ν°λ₯Ό μμ°μ΄ μ²λ¦¬ κΈ°λ° μΈκ³΅μ§λ₯ λͺ¨λΈμ νμ©ν΄ λΆμν¨μΌλ‘μ¨, μμ₯ μ°Έμ¬μλ€μ μ¬λ¦¬μ λ°μμ μ λμ μΌλ‘ νκ°ν©λλ€.
λ λμκ°, λ¨μν κ°μ± λΆμμ λμ΄ μμ₯ κ³ νκ°/μ νκ° μ¬λΆλ₯Ό λ°μνλ VIX λ³λμ± μ§ν, κ³Όλ§€μΒ·κ³Όλ§€λ νλ¨μ μν RSI 보쑰μ§ν, μ΄λνκ· μ κΈ°λ° μΆμΈ μ§ν λ±μ ν΅ν©μ μΌλ‘ κ³ λ €ν©λλ€. μ΄λ₯Ό ν΅ν΄ μμ₯ μ¬λ¦¬λ₯Ό 0μμ 100 μ¬μ΄μ κ°μΌλ‘ νμ°ν Market Sentiment Indexλ₯Ό μ°μΆν©λλ€.
λ³Έ μμ€ν μ μ λ¬Έ ν¬μ μ§μμ΄ λΆμ‘±ν κ°μΈ ν¬μμλΏ μλλΌ, μκ° μ μ½μΌλ‘ μΈν΄ μμ₯ λͺ¨λν°λ§μ΄ μ΄λ €μ΄ ν¬μμλ€μκ²λ μ€μ§μ μΈ μμ¬κ²°μ 보쑰 λκ΅¬λ‘ κΈ°λ₯ν μ μμ΅λλ€.
The U.S. stock market has become highly sensitive to the rapid spread of information driven by the rise of social media, where a single news article or tweet can frequently trigger short-term volatility. To address this, the project leverages a natural language processing (NLP)βbased artificial intelligence model to analyze unstructured data such as news articles, blogs, and comments, thereby quantifying market participantsβ psychological responses.
Beyond sentiment analysis alone, the system also incorporates key market indicators, including the VIX volatility index to capture overvaluation and undervaluation, the RSI oscillator to identify overbought and oversold conditions, and moving averages to assess price trends. These elements are integrated to calculate a Market Sentiment Index, expressed as a numerical value ranging from 0 to 100.
This system is designed not only for individual investors with limited expertise in financial indicators but also for those constrained by time, providing a practical decision-support tool to better understand the prevailing psychological state of the U.S. stock market.
upstock-python/
βββ upstock/
β β
β βββ builders/
β β βββ pipeline.py # pipline builder
β β
β βββ indicators/
β β βββ core.py # calculation logic
β β βββ indexer.py # Indicator indexing/management
β β
β βββ models/
β β βββ artifacts.py # Model artifact management
β β
β βββ nodes/
β β βββ predict.py # Predict node -> News predict part, market predict part
β β βββ train.py # Train node
β β
β βββ storage/
β β βββ downloader.py # Data download management
β β βββ market_data.py # Stock Data Processing
β β
β βββ config.py # Environment
β
βββ main.py # Main entry point
| Text | Label |
|---|---|
| ANA with the conference tomorrow, I guess we will hit 10.05 after all... | 1 |
| NFX - hearing PJCO out cautious following comScore data: Following Weak December, Overall 4Q Traffic Again Turns Negative user | 0 |
| se pullback 2 initiate quick trades, not invest, long. Keep stops tight & sell up. AAP EGN OCN AMH DDD NSM | 1 |
| Piper Jaffray making negative comments on NFX and CST ed Box : comScore Q4 data weak for edbox, Netflix | 0 |
| Maybe a good time to buy TD the 44 level appears to be holding as a key support level | 1 |
| CSN option trader buys 1,500 of the Jan 11-16 call spread against low OI indicating entering a position for .50. Bet on data bef. Jan 19 | 1 |
| AA there goes the kids college fund! | 1 |
- Author(s). Dataset Title. Kaggle. Published 2025.
doi: 10.34740/kaggle/dsv/1217821
License: Data files Β© Original Authors
| Epoch | Loss | Accuracy | Val Loss | Val Accuracy |
|---|---|---|---|---|
| 1 | 0.6352 | 0.6516 | 0.5148 | 0.7498 |
| 2 | 0.3841 | 0.8314 | 0.4343 | 0.8007 |
| 3 | 0.1724 | 0.9359 | 0.5599 | 0.7869 |
| 4 | 0.0806 | 0.9743 | 0.7665 | 0.7826 |
| 5 | 0.0433 | 0.9853 | 0.9666 | 0.7627 |
Early Stopping: Best epoch β 2
| Layer (type) | Output Shape | Param # |
|---|---|---|
| InputLayer | (None, 141) | 0 |
| Embedding | (None, 141,128) | 1,304,192 |
| Bidirectional(LSTM) | (None, 141,128) | 98,816 |
| GlobalMaxPooling1D | (None, 128) | 0 |
| Dense | (None, 64) | 8,256 |
| Dropout | (None, 64) | 0 |
| Dense | (None, 32) | 2,080 |
| Dense | (None, 1) | 33 |
Total params: 1,413,377 (5.39 MB)
Trainable params: 1,413,377
Non-trainable params: 0
μμΈ‘ κ²°κ³Όλ News Sentimentμ Market Sentiment Index ν μ΄λΈλ‘ λλμ΄μ μ μ₯λκ³ λ³΄κ΄λκ³ μμ΅λλ€.
SELECT text, percent, label FROM news_sentiment LIMIT 10
| text | percent | label |
|---|---|---|
| Dow jumps 400 points to record as August inflation increase likely won't derail Fed rate cut | 0.0548475 | negative |
| Dow jumps and S&P touches all-time high while Treasury yields fall as Wall Street reacts to rising inflation and jobless claims | 0.25916 | negative |
| The Fedβs 2% inflation target might not be hit for years, says Janus portfolio manager β unless thereβs a recession | 0.295348 | negative |
| Dow, S&P 500 and Nasdaq push deeper into record territory | 0.88824 | positive |
| Stocks, Bonds Rise as Data Seal September Fed Cut: Markets Wrap | 0.813471 | positive |
| Mexico hikes China auto tariff, S. Korea warns on US investment | 0.141901 | negative |
| Stocks, Gold Set New Records in Run-Up to Fed Meet: Markets Wrap | 0.821933 | positive |
| Vietnam Urges US to Rethink Seafood Ban as Trade Talks Grind On | 0.212265 | negative |
| Stocks climb toward record closes | 0.813072 | positive |
| Shell LNG Plant Wins Place on Carneyβs List of Favored Projects | 0.924645 | positive |
SELECT date_utc, score, zone, rsi, vix, macd_val FROM market_sentiment_index LIMIT 10| date_utc | score | zone | rsi | vix | macd_val |
|---|---|---|---|---|---|
| 2025-09-18 | 58 | Neutral | 67.892115296438 | 15.5600004196167 | 6.48942949393609 |
Tensorflow 2.10
Cpu : AMD 5600
Ram : 32GB
Gpu : RTX 4060 8GB
For questions or feedback, please contact NE7K or NeighborSoft.
