SI
SI
discoversearch

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor.  We ask that you disable ad blocking while on Silicon Investor in the best interests of our community.  If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.
Non-Tech : Kirk's Market Thoughts -- Ignore unavailable to you. Want to Upgrade?


To: Elroy who wrote (25843)9/9/2025 10:07:20 AM
From: toccodolce  Respond to of 26439
 
Today's flavor is $CWD :)



To: Elroy who wrote (25843)9/9/2025 10:15:33 AM
From: Kirk ©  Read Replies (1) | Respond to of 26439
 
I recall a decade or two ago Google had a project here to scan and digitize every book...

share.google

I read data centers are moving from HD to SSD to store their training data. You'd think there would be a very secure way to store it for decades with backups... more HDs or are SSDs reliable enough?



To: Elroy who wrote (25843)9/13/2025 1:27:08 PM
From: Hank Scorpio2 Recommendations

Recommended By
7kidstofeed
Kirk ©

  Respond to of 26439
 
Just ask AI. Grok:







Hi Grok. Where do you get your information for any of the queries that you receive. Rank them in a top 10 list. Also, where is this information stored, ranked as a top 10 list. Thank you.









As Grok 3, I don’t rely on a fixed set of sources or storage locations that can be neatly ranked, since my knowledge is dynamically generated and updated through a complex interplay of systems. However, I can give you a sense of how I process queries and where the information generally comes from, based on my design and capabilities. Since I don’t have explicit rankings for sources or storage, I’ll frame this as a conceptual breakdown of the types of resources and systems I rely on, in rough order of importance or frequency of use.Where I Get My Information (Top 10 Conceptual Sources)
  1. Internal Knowledge Base (Training Data): My core knowledge comes from a vast, anonymized dataset used during training, which includes a wide range of texts from books, articles, and other publicly available sources up to my last update.
  2. Real-Time Web Search: For queries requiring current information, I can perform web searches to fetch the latest data, articles, or reports relevant to your question.
  3. X Platform Data: I can access public posts and user profiles on X to provide insights or context, especially for trending topics or specific user-related queries.
  4. User-Provided Content: If you upload files (e.g., images, PDFs, text), I analyze them directly to inform my response.
  5. xAI Knowledge Systems: Proprietary datasets or tools curated by xAI, which may include structured information or specialized databases for certain domains.
  6. Public APIs and Databases: For specific queries (e.g., scientific or technical), I may draw on open-access APIs or databases like PubMed, arXiv, or similar repositories.
  7. General Knowledge Patterns: My ability to reason and synthesize answers is informed by patterns and structures learned during training, even if I don’t cite a specific source.
  8. Crowdsourced Knowledge: Indirectly, my training data includes aggregated insights from human contributions across the internet, like forums or wikis, though I don’t access these in real-time unless specified.
  9. News Aggregators: For breaking news or recent events, I may pull from aggregated news sources or summaries available on the web.
  10. Domain-Specific Resources: For niche topics (e.g., legal, medical), I prioritize authoritative sources like government websites or academic journals when available.
Where This Information Is Stored (Top 10 Conceptual Storage Locations)
  1. xAI Cloud Infrastructure: The bulk of my processed knowledge and training data resides in xAI’s secure cloud servers, where my model is hosted.
  2. Distributed Neural Network Weights: My “knowledge” is encoded in the weights of my neural network, stored across xAI’s computational infrastructure.
  3. Temporary Cache for Real-Time Queries: Web search results or X post data are temporarily cached during a session to generate responses.
  4. User Interaction Buffer: Uploaded files or query-specific inputs are stored briefly in a secure, ephemeral buffer for processing.
  5. xAI Knowledge Repositories: Curated datasets or internal databases maintained by xAI for specific domains are stored in secure, proprietary systems.
  6. External Web Servers: For real-time searches, information is pulled from external websites and stored temporarily for analysis.
  7. X Platform Servers: Public X posts or profile data are accessed from X’s servers and processed in real-time.
  8. Local Processing Memory: During a query, intermediate data (e.g., calculations, reasoning steps) is held in memory on the servers running my model.
  9. Archival Storage: Some training data may be archived in cold storage systems, accessed only during model updates or retraining.
  10. Third-Party API Responses: Data from external APIs (e.g., for weather, science) is stored temporarily in memory during query processing.
Notes
  • No Fixed Hierarchy: The “ranking” here is approximate, as the importance of a source or storage location depends on the query. For example, a question about recent news prioritizes web searches, while a math problem relies on my internal reasoning.
  • Storage Security: All data, whether internal or user-provided, is handled in secure environments, with temporary data purged after use to protect privacy.
  • Dynamic Access: I don’t have a static “database” I query like a traditional search engine; my responses are generated by integrating these sources in real-time, guided by my training.
If you’d like me to dive deeper into how I process specific types of queries or to analyze a particular source (e.g., an X post or a website), let me know!



x.com